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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/665,350 recorded 
July 9, 2001, at Reel 01 1964 and Frame 0181. The present application is a continuation of U.S. 
Serial No. 09/665,350. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PR0232". Although there exist several applications directed to the "gene 
amplification" utility under Appeal, there are no applications related to PR0232 nucleic acids or 
antibodies. 

III. STATUS OF CLAIMS 

Claims 44-46 and 49-51 are in this application. 
Claims 1-44 and 47-48 have been canceled. 

Claims 44-46 and 49-51 stand rejected and Appellants appeal the rejection of these 

claims. 

A copy of the rejected claims in the present Appeal is provided in Section IX. 

IV. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 

Previously, in response to a Final Office Action mailed on October 21, 2005, a Notice of 
Appeal was filed on February 21, 2006 and an Appeal Brief was filed on April 17, 2006. An 
RCE Response with additional references and affidavits supporting Appellants' arguments was 
filed on July 27, 2006. A Final Office Action was mailed on October 17, 2006, and a Notice of 
Appeal was filed on March 19, 2007. 

No amendments were submitted after the final rejection mailed October 17, 2006. All 
previous amendments have been entered. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated polypeptide 
comprising the amino acid sequence of the polypeptide of SEQ JD NO: 18, referred to in the 
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present application as "PR0232". The PR0232 gene was shown for the first time in the present 
application to be significantly amplified in human lung cancers as compared to normal, non- 
cancerous human tissue controls (Example 92). This feature is specifically recited in Claim 124, 
and carried by all claims dependent from claim 44. In addition, the invention also claims the 
amino acid sequence of the polypeptide of SEQ ID NO: 18, lacking its associated signal-peptide; 
or the amino acid sequence of the polypeptide encoded by the full-length coding sequence of the 
cDNA deposited under ATCC accession number 209250 (Claim 44-46 and 49). The invention is 
further directed to a chimeric polypeptide comprising one of the above polypeptides fused to a 
heterologous polypeptide (Claim 50), and to a chimeric polypeptide wherein the heterologous 
polypeptide is an epitope tag or an Fc region of an immunoglobulin (Claim 51). The preparation 
of chimeric PRO polypeptides (Claims 50 and 51), including those wherein the heterologous 
polypeptide is an epitope tag or an Fc region of an immunoglobulin, is set forth in the 
specification at page 74, lines 23 to page 75, line 5. Examples 53-56, pages 192-199, describe 
the expression of PRO polypeptides in various host cells, including E. coli, mammalian cells, 
yeast and Baculovirus-infected insect cells. 

The amino acid sequence of the "PR0232" polypeptide and the nucleic acid sequence 
encoding this polypeptide (referred to in the present application as "DNA34435-1 140") are 
shown in the present specification as SEQ ID NOs: 18 and 17, respectively, and in Figures 9 and 
8, described on page 59, lines 4-7. The full-length PR0232 polypeptide having the amino acid 
sequence of SEQ ID NO: 18 is described in the specification at, for example, on page 4, pages 3- 
4 and page 100, page 131, line 9 to 16 and the isolation of cDNA clones encoding PR0232 of 
SEQ ID NO: 18 is described in Example 4, page 149-150 of the specification. The specification 
discloses that the PR0232 polypeptide possess significant sequence homology to cell surface 
stem cell antigen (35% sequence identity with a stem cell surface antigen from Gallus gallus) 
and may play a role in cell proliferation and/or differentiation, (see for example, page 4 and 
Example 4, line 14-15). 

Finally, Example 92, in the specification at page 222, line 26, to page 235, line 3, sets 
forth a 'Gene Amplification assay' which shows that the PR0232 gene is amplified in the 
genome of certain human lung cancers (see Table 9, pages 230-234). The profiles of various 
primary lung and colon tumors used for screening the PRO polypeptide compounds of the 
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invention in the gene amplification assay are summarized on Table 8, page 227 of the 
specification. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether instant Claims 44-46 and 49-5 1 satisfy the utility/enablement 
requirement under 35 U.S.C. §§101/1 12, first paragraph. 

2. Whether Claims 44-46 and 49-51 are entitled to the priority date of U.S. 
Provisional Application 60/059121, filed September 17, 1997. 

3. Whether Claims 44-46 and 49-5 1 are anticipated under 35 U.S.C. § 1 02(b) by 
Rosenthal et al., DE19818619-A1 (October 1999). 

VII. ARGUMENTS 
Summary of the Arguments 
Issue 1; Utility 

As a preliminary matter, Appellants note that the Examiner refers to the microarray assay 
in several instances in several Office Actions, and cites references like Lilley et al, King et al, 
Lee et al, Wildsmith et al, Chen et al. etc. which use and analyze the microarray assay. In 
addition, the Examiner rejects the use of the "universal control" in the instant application. 
Appellants point out that neither the "universal control" nor the microarray were used in the 
claimed invention, hence rejections directed either to the limitations of the use of the microarray 
assay, or to the universal control, are moot. On the other hand, Appellants rely upon the gene 
amplification data of the PR0232 gene for patentable utility of the PR0232 polypeptides in the 
present application, and controls defined in Example 92 of the instant specification, and would 
like to distinguish between the gene amplification and the microarray assay. The gene 
amplification assay measures the level at which a certain gene (i.e. DNA) is amplified in the 
genome, whereas the microarray assay measures the level of expression of a mRNA encoding for 
a certain polypeptide in a sample. Throughout prosecution, the Examiner fails to distinguish 
between these two techniques, but Appellants submit that the two assays, although similar, are 
not the same. 

Appellants would like to bring to the Examiner's attention a recent decision in a 
microarray case by the Board of Patent Appeals and Interferences (Decision on Appeal No. 
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2006-1469). In its decision, the Board reversed the utility rejection, acknowledging that "there is 
a strong correlation between mRNA levels and protein expression, and the Examiner has not 
presented any evidence specific to the PR01866 polypeptide to refute that." (Page 9). 
Appellants submit that, in the instant application, the Examiner has likewise not presented any 
evidence specific to the PR0232 polypeptide to refute Applicant's assertion of a correlation 
between DNA levels, mRNA levels and protein expression. Appellants add that they discuss or 
analyze the microarray assay only in response to the Examiner's citations and in response to 
correlation of mRNA and protein levels. This is not to be construed as an admission that all that 
applies to the microarray assay, applies to the gene amplification assay as well. 

Appellants rely upon the gene amplification data of the PR0232 gene for patentable 
utility of the PR0232 polypeptides. The specification discloses that the gene encoding PR0232 
showed significant amplification, ranging from 2.056-fold to 5.28-fold, in five lung tumors or 
2.00-fold to 5.32-fold in seven colon tumors . 

Appellants have submitted, in their Response filed July 25, 2005, a Declaration by Dr. 
Audrey Goddard, which explains that a gene identified as being amplified at least 2-fold by the 
disclosed gene amplification assay in a tumor sample relative to a normal sample is useful as a 
marker for the diagnosis of cancer , and for monitoring cancer development and/or for measuring 
the efficacy of cancer therapy. Therefore, such a gene is useful as a marker for the diagnosis of 
lung or colon cancer , and for monitoring cancer development and/or for measuring the efficacy 
of cancer therapy. 

Appellants have also submitted, in. their Responses filed August 9, 2004, and July 27, 
2006, ample evidence to show that, in general, if a gene is amplified in cancer it is more likely 
than not that the encoded protein will be expressed at an elevated level. For instance, the articles 
by Orntoft et aL 9 Hyman et aL, and Pollack et al collectively teach that in general gene 
amplification increases mRNA expression . Second, the Declarations of Dr. Paul Polakis: 
(Polakis I and II), shows that, in general, there is a correlation between mRNA levels and 
polypeptide levels . Third, Appellants further submit that even if there were no. correlation 
between gene amplification and increased mRNA/protein expression, (which Appellants 
expressly do not concede to), a polypeptide encoded by a gene that is amplified in cancer would 
still have a specific, substantial, and credible utility. Appellants submit that, as evidenced by the 
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Ashkenazi Declaration and the teachings of Hanna and Mornin (both made of record in 
Appellants' Responses filed December 10, 2003 and August 9, 2004), simultaneous testing of 
gene amplification and gene product over-expression enables more accura te tumor classification, 
even if the gene-product, the protein, is not over-expressed. This leads to better determination of 
a suitable therapy for the tumor, as demonstrated by a real-world example of the breast cancer 
marker HER-2/neu. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not informative of 
the protein level). Therefore, as a general rule, one skilled in the art would find it more likely 
than not that PR0232 polypeptides are useful as a diagnostic tools for detecting lung or colon 
tumors. 

The Examiner relies on the teachings of Pennica et al, Konopka et al, Haynes et al, Hu 
et al, Lian et al, Fessler et al, Gygi et al, Lilley et al, Lee et al, King et al, Wildsmith et al, 
Nagaraja et al, Sagynaliev et al, Waghray et al, Madoz-Gurpide et al, Feroze-Merzoug et al, 
Bustin et al, Saito-Hisaminato et al, to allege that there is no correlation between increased 
gene amplification and protein levels. 

Appellants respectfully disagree and submit that the teachings within these cited 
references do not conclusively establish a prima facie case for lack of utility because the 
references are, either not contrary to the Appellants' arguments, or, actually lend support to the 
Appellants' position, or, are not applicable to the present application for various reasons, as 
discussed in detail below. On the other hand, Appellants submit that while the literature 
indicates that some references demonstrate a positive correlation between DNA levels, mRNA 
expression and protein levels, and some show no correlation, in general, there are more cases in 
literature that show a positive correlation than not . 

Taken together, although there are some examples in the scientific art that do not fit 
within the central dogma of molecular biology that there is generally a positive correlation 
between DNA, mRNA, and polypeptide levels, in general, in the majori ty of amplified genes, as 
exemplified by the teachings of Orntoft et al, Hyman et al, Pollack et al, the two Polakis 
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Declarations, the art overwhelmingly show that gene amplification influences gene expression at 
the mRNA and protein levels . The widespread, art accepted use of information obtained from, 
array chips for detecting diagnostic markers lend further support that in general, one of skill in 
the art would reasonably expect in this instance, based on the amplification data for the PR0232 
gene, that the PR0232 polypeptide is concomitantly overexpressed and has utility in the 
diagnosis of lung or colon cancer or for individuals at risk for developing lung or colon cancer. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PR0232 polypeptides. Accordingly, one of ordinary skill in the art would also 
understand how to make and use the recited polypeptides for the diagnosis of lung or colon 
cancer without any undue experimentation. 

Issue 2: Priority 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Application 60/059121, filed September 17, 1997 on the grounds that the 60/059121 application 
fails to provide a utility and lacks an enabling disclosure for the claimed invention under 35 
U.S.C.§§101/112, first paragraph. 

Appellants submit that, for the same reasons discussed above under Issue 1, U.S. 
Provisional Application 60/059121 also satisfies the utility requirements. Therefore, Appellants 
should be entitled to the priority date of September 17, 1997. 

Issue 3: Anticipation by Rosenthal et al. 

As discussed above under Issue 2, the present application should be entitled to the earlier 
filing date of September 17, 1997 and therefore, Rosenthal et al, DE19818619-A1, dated 
October 1999, is not prior art. Thus the instant claims are not anticipated by Rosenthal et al. 

These arguments are all discussed in further detail below under the appropriate headings. 

Response to Rejections 

ISSUE 1: The Instant Claims 44-46 and 49-51 Satisfy the Utility Requirement under 35 
U.S.C. S101/S112, First Paragraph based on the results of the gene amplification assay 

The sole basis for the Examiner's rejection of Claims 44-46 and 49-51 under this section 
is that the data presented in the instant Application, allegedly, does not satisfy the requirements 
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of 35 U.S.C. §§101/112, first paragraph. Appellants strongly disagree for the reasons discussed 
below. 

A. The Legal Standard For Utility Under 35 U.S.C. $101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, or 
composition of matter, or any new and useful improvement thereof, may obtain a patent 
therefor, subject to the conditions and requirements of this title. (Emphasis added.) 

In interpreting the utility requirement, in Brenner v. Manson the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent applicant disclose a "substantial utility" for his or 
her invention, i.e. a utility "where specific benefit exists in currently available form." The Court 
concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 
commerce rather than the realm of philosophy." 

Later, in Nelson v. Bowler the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 



' Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id. at 856, 206 U.S.P.Q. (BNA) at 883. 
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In Cross v. Iizuka the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

7 

i.e. there is a reasonable correlation there between." The court perceived "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants' statements of utility are usually 

9 ... 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 
burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 
the art to question the objective truth of the statement of utility or its scope." ' 

Compliance with 35 U.S.C. §101 is a question of fact. 13 The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 

6 Cross v. Iizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 

8 

Id. 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 Ibid. 

" In re hanger, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Mies, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
USPQ351 (1965);//! reSichert, 566 F.2d 1154, 1159, 196 USPQ 209, 212-13 (C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert, denied, 469 
US 835 (1984). 

14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 
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an assertion of utility by the applicant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the applicant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines")' 5 , which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial' utility." 16 Indeed, the Guidelines for Examination of 
Applications for Compliance With the Utility Requirement,' 7 gives the following instruction to 
patent examiners: "If the applicant has asserted that the claimed invention is useful for any 
particular practical purpose ... and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants respectfully submit that the data presented in Example 92 starting on page 222 
of the priority application and the cumulative evidence of record, which underlies the current 

15 66 Fed. Reg. 1092(2001). 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 II (B)(1). 
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dispute, indeed support a "specific, substantial and credible" asserted utility for the presently 
claimed invention. 

Patentable utility for the PR0232 polypeptides is based upon the gene amplification data 
for the gene encoding the PR0232 polypeptide. Example 92 describes the results obtained using 
a very well-known and routinely employed polymerase chain reaction (PCR)-based assay, the 
TaqMan™ PCR assay, also referred to herein as the gene amplification assay. This assay allows 
one to quantitatively measure the level of gene amplification in a given sample, say, a tumor 
extract, or a cell line. It was well known in the art at the time the invention was made that gene 
amplification is an essential mechanism for oncogene activation. Appellants isolated genomic 
DNA from a variety of primary cancers and cancer cell lines that are listed in Table 9 (pages 222 
onwards of the specification), including primary lung and colon cancers of the type and stage 
indicated in Table 8 (page 227). The tumor samples were tested in triplicates with Taqman™ 
primers and with internal controls, beta-actin and GADPH in order to quantitatively compare 
DNA levels between samples (page 229). As a negative control, DNA was isolated from the 
cells often normal healthy individuals, which was pooled and used as a control (page 222, lines 
28-29). The results of TaqMan™ PCR are reported in ACt units, as explained in the passage on 
page 222, lines 37-39. One unit corresponds to one PCR cycle or approximately a 2-fold 
amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold amplification 
and so on. Using this PCR-based assay, Appellants showed that the gene encoding for PR0232 
was amplified, that is, it showed approximately 1.04-2.40 ACt units for five lung tumors and 
1.00-2.41 ACt units for seven colon tumors, which corresponds to 2 h04 -2 2A0 - fold amplification 
in lung or to 2 l 00 -2 2 41 - fold amplification in colon tumors; that is 2.056-fold to 5.28-fold in five 
lung tumors or 2.00-fold to 5.32-fold in seven colon tumors , which would be considered 
significant and credible by one skilled in the art. Therefore, the PR0232 gene and the PR0232 
polypeptide are important diagnostic markers to identify such malignant lung or colon cancers. 

A prima facie case of lack of utility has not been established 

As discussed above, the increase in DNA copy number for the PR0232 gene is 
significant. Further, the evidentiary standard to be used throughout ex parte examination of a 
patent application is a preponderance of the totality of the evidence under consideration. Thus, 
to overcome the presumption of truth that an assertion of utility by the applicant enjoys; the 
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Examiner must establish that it is more likely than not that one of ordinary skill in the art would 
doubt the truth of the statement of utility. 

Accordingly, it is not a legal requirement to establish a necessary correlation between an 
increase in the copy number of the DNA and protein expression levels that would correlate to the 
disease state or that it is imperative to find evidence that DNA amplification is "necessarily" or 
"always" associated with overexpression of the gene product. Appellants respectfully submit 
that when the proper evidentiary standard is applied, a correlation must be acknowledged. Only 
after the Examiner has made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. 

Previously, the Examiner has indicated based on references Pennica et al, Konopka et 
al and Haynes et al to show that gene amplification data cannot reliably predict protein levels. 
Appellants have argued the references in great detail throughout prosecution and these arguments 
are incorporated by reference herein for brevity. Appellants summarize the rejections and the 
arguments submitted below. 

The teachings of Pennica et al are specific to WISP genes, a specific class of closely 
related molecules. Pennica et al showed that there was good correlation between DNA and 
mRNA expression levels for the WISP-1 gene, but not for WISP-2 and WISP-3 genes. WISPs 
1-3 have no structural relationship to the PR0232 polypeptides of the present application. The 
apparent finding that for two out of three specific molecules, that are related to each other but 
have no relationship to PR0232, that there was no correlation between gene amplification and 
the level of mRNA/protein expression does not establish, in general, that it is more likely than 
not that such correlation does not exist, and has no bearing whatsoever on determining the 
question whether such correlation is likely to exist between PR0232 gene amplification and 
mRNA/protein expression levels. As discussed above, the standard is not absolute certainty . 
Pennica et al has no teaching whatsoever about the correlation of gene amplification and protein 
expression for genes in general or PR0232 or related molecules in particular. 

Similarly, in Konopka et al, the Examiner has generalized a very specific result disclosed 
by Konopka et al to cover all genes. Konopka et al actually state that "[p]rotein expression is 
not related to amplification of the abl gene but to variation in the level of bcr-ab\ mRNA 
produced from a single Ph 1 template." (See Konopka et al Abstract, emphasis added). The 
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paper does not teach anything whatsoever about the correlation of protein expression and gene 
amplification in general and provides.no basis for the generalization that apparently underlies 
the present rejection. The statement of Konopka et al that "[p]rotein expression is not related to 
amplification of the abl gene ..." is not sufficient to establish a prima facie case of lack of 
utility. Therefore, the combined teachings of Pennica et al and Konopka et al are not directed 
towards genes in general but to a single gene or genes within a single family and thus, their 
teachings cannot support a general conclusion regarding a correlation between gene 
amplification and mRNA or protein levels. In addition, the abl gene has no structural 
relationship to the PR0232 gene of the present application and thus, Konopka et al. provides no 
information of specific relevance to the question whether for PR0232 there is a reasonable 
expectation that correlation between gene amplification and mRNA/protein expression levels is 
likely to exist. 

The Examiner also cited Haynes et al. to show that transcript levels and protein levels do 
not correlate. However, Appellants had shown that Haynes themselves admit that " there was a 
general trend, although no strong correlation between protein [expression] and transcript levels" 
(see Figure 1 and page 1863, paragraph 2.1, last line). Therefore, when the proper legal standard 
is used, Haynes clearly supports the Appellants' position that in general, a positive correlation 
exists between mRNA and protein expression levels . Since accurate prediction is not the 
standard , a prima facie case of lack of utility has not been met based on the cited references 
Pennica et al, Konopka et al. and Haynes et al Appellants respectfully submit that, contrary to 
the Examiner's assertion, none of the cited reference conclusively establish a prima facie case 
for lack of utility for the PR0232 molecule. 

Appellants have already discussed references Hu et al, Chen et al, Lian et al, Fessler et 
al, in great detail in their previous responses (at least see Appeal brief filed April 17, 2006), and 
these arguments are hereby incorporated by reference for brevity. 

Briefly, the analytical methods utilized by Hu et al have certain statistical drawbacks, as 
the authors themselves admit, and Hu et al.'s conclusions only apply to a specific type of breast 
tumor (estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast cancer 
genes in general, let alone to cancer genes in general. 
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Regarding Lian et al., Appellants respectfully submit that Lian et al only teach that 
protein expression may not correlate mRNA level in differentiating myeloid cells, and not about 
genes in general . In fact, the authors themselves admit that there were a number of problems 
with their data . For instance, at page 520 of this article, the authors explicitly express their 
concerns regarding the methods they utilized and the interpretation of their data stating that 
' Ttlhese data must be considered with several caveats: membrane and other hydrophobic proteins 
and very basic proteins are not well displayed bv the standard 2DE app roach, and proteins 
presented at low level will be missed. In addition, to simplify MS analysi s, we used a Coomassie 
dve stain rather than silver to visualize proteins, and this decreased the sensitivity of detection of 
minor proteins. " (Emphasis added). Besides, Lian et al.'s conclusions are based on the 
Coomassie dye staining method, which is not a. very sensitive method of measuring protein. 
Similarly, in Fessler et al, examined lipopoysaccharide-activated neutrophilins, in response to 
LPS stimulation. Fessler et al. also used the Coomassie Blue dye staining method, and concede 
that it is known to. have a limited protein binding range and a non-linear curve for protein 
detection. Protein identification in their study was also done using two-dimensional PAGE, 
which is, by their own admission, limited only to well-resolved regions of the gel , and therefore, 
may have performed less well with hydrophobic and high molec ular weight proteins (see page 
31301, col. 1). 

Gvei et al. 

The Examiner has cited new reference Gygi et al. in the Final Office Action of October 
17, 2006 addressing the correlation between mRNA and protein levels. 

Appellants submit that Gygi et al. never indicate that the correlation between mRNA and 
protein levels does not exist. Gygi et al. only state that the correlation may not be sufficient to 
accurately predict the protein level from the level of the corresponding mRNA transcript (see 
page 1270, Abstract). Gygi et al. may teach that protein levels cannot be "predicted" from 
mRNA levels in the sense that the exact numerical amounts of protein present in a tissue cannot 
be determined based upon mRNA levels. But Appellants respectfully submit that the PTO's 
emphasis on the need to "accurately predict" protein levels based on mRNA levels misses the 
point. The asserted utility for the claimed polypeptides is in the diagnosis of cancer. What is 
relevant to use as a cancer diagnostic is relative levels of gene or protein expression, not absolute 
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values, that is, that the gene or protein is differentially expressed in tumors as compared to 
normal tissues. 

Moreover, contrary to the Examiner's statement, the Gygi data indicate a general trend 
of correlation between protein [expression] and transcript levels (Emphasis added). For 
example, as shown in Figure 5, the mRNA abundance of 250-300 copies/cell correlates with the 
protein abundance of 500-1000 x 103 copies/cell. The mRNA abundance of 100-200 copies/cell 
correlates with the protein abundance of 250-500 x 103 copies/cell (emphasis added). Therefore, 
high levels of mRNA generally correlate with high levels of proteins. In fact, most data points 
in Figure 5 did not deviate or scatter away from the general trend of correlation. Thus, the Gygi 
data meets the "more likely than not standard" and shows that a positive correlation exists 
between mRNA and protein. 

In summary, Hu et al, Lian et al, Fessler et al 9 Gygi et al do not conclusively teach 
that, in general, protein levels cannot be accurately predicted from mRNA/gene amplification 
levels. These authors concede that either due to insensitive protein detection methods or due 
their methodologies utilized in their protocols, some protein species may have been under- 
represented over others. Therefore, the teachings of these references cannot be relied upon to 
establish a prima facie showing of lack of utility. On the other hand, as noted even in Haynes et 
al and Gygi et al most genes showed a positive correlation between increased gene 
amplification, mRNA and translated protein. 

The Examiner asserts that, of the references cited by Appellants in their IDS filed July 
27, 2006, "only Godbout et al is pertinent." 

Appellants respectfully submit that, as discussed in the RCE response of July 27, 2006, 
there are several other references; for instance, Bea et al who investigated gene amplification, 
mRNA expression, and protein expression of the putative oncogene BMI-1 in human lymphoma 
samples, and support Appellants' assertion that gene amplification is correlated with both 
increased mRNA and protein expression. 

Godbout et al (Item 46 of Evidence List) 

Regarding the Godbout reference, the Examiner alleges that the instant specification does 
not teach structure/function analysis; the Examiner says that she finds no reason to suspect if 
PR0232 can confer selective advantage to a cell. The Examiner questions whether the level of 
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genomic amplification of DDX1 gene is comparable to that of PR0232 (page 5 of Final Office 
Action of October 17, 2006). 

Appellants respectfully submit that it was never claimed that PR0232 is similar in any 
way to the DDX1 gene of Godbout et al., they never claimed PR0232 was an RNA helicase or 
that it confers selective advantage to cell survival; on the other hand, the Godbout reference was 
submitted to show good correlation between protein levels based upon genomic DNA 
amplification, which the Examiner clearly agrees with. Moreover, selective advantage to cell 
survival is not the only mechanism by which genes impact cancer and structure/function data, 
which the Examiner requests, is not a requirement for the utility requirement. Hence a prima 
facie case has not been established and this rejection is improper. 

Saito- Hisaminato et al and Bustin et al 

The Examiner refers to the microarray assay and the "universal control" and indicates 
that proper controls are needed for the microarray assay. The Examiner rejects the use of the 
universal control based on the teachings within Saito- Hisaminato et al (Pages 18-19 of the 
instant Final Office action). The Examiner further cites Bustin et al without any explanations 
for the citation. 

As discussed above under Section VII, Appellants respectfully submit that they rely upon 
the gene amplification data of the PR0232 gene for patentable utility of the PR0232 
polypeptides in the present application, not the microarray assay . Appellants would like to 
distinguish between the gene amplification and the microarray assay. The gene amplification 
assay measures the level at which a certain gene (i.e. DNA) is amplified in the genome, whereas 
the microarray assay measures the level of expression of a mRNA encoding for a certain 
polypeptide in a sample. Throughout prosecution, the Examiner fails to distinguish between 
these two techniques, but Appellants submit that the two assays, although similar, are not the 
same. Any rejection directed to the limitations of the microarray assay or to the universal control 
are moot and are not addressed here. Only relevance of microarray array results to correlate 
mRNA to protein levels is discussed. 

No "universal control" was used in the instantly claimed invention. Since "Saito- 
Hisaminato" primarily discusses the universal control, its teachings do not apply to the instant 
invention in any manner and is moot. 
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The Examiner has not provided reasons for why Bustin et al, which uses and analyzes 
microarray technology, was cited. Without conceding to the propriety of this rejection, 
Appellants note that Bustin et al do not dispute the use of microarrays to obtain biologically 
relevant data, noting that "several microarray experiments have generated clinically relevant 
quantitative gene profiles" (page 271, col. 1). In fact, Bustin et al conclude that "[m]icroarrays 
are already having a major impact on cancer biology, pharmacology and drug development" and 
state that "the major limiting factor in their further application is the current lack of data 
comparability, which is essential for appropriate comparisons between different arrays." 

Accordingly, a prima facie case has not been established based on Saito- Hisaminato et 
al and Bustin et al and this rejection is improper. 

Lillev et al.* King et al.. Lee et al and Wildsmith et al. 

First of all, the references Lilley et al, King et al, Lee et al and Wildsmith et al all 
utilize and analyze the microarray assay. The Examiner cites the Lee et al reference allege a 
limitation that requires replication of microarray assays (see page 14, line 3 of Final Office 
action). The Examiner cites King et al to allege a limitation that microarray has variability for 
high- or medium- abundance mRNAs (see page 14, lines 12-13 of Final Office action). The 
Examiner further indicates, based on Lilley et al, Wildsmith et al, and King et al, that the state 
of the art demonstrates that correlation between mRNA and protein abundance "cannot be 
accurately predicted" (page 12 of Final Office action). 

Again, as discussed above, without conceding to the propriety of the rejections stated 
above, Appellants submit that the rejections directed to the limitations of microarray assay are 
moot and are not addressed here, since Appellants rely upon the gene amplification data of the 
PR0232 gene for patentable utility of the PR0232 polypeptides and not the microarray assay . 

Appellants add that it is not a legal requirement for utility, to establish a necessary 
correlation between an increase in the mRNA level and protein expression levels; that is, the 
utility standard is not absolute certainty or it is not necessary to show that changes in transcript 
level should always result in corresponding changes in protein amount or activity. Accordingly, 
the question is not whether a correlation between an increase in mRNA and protein expression 
levels always exists, rather, if it is more likely than not to exist, and whether a person of ordinary 
skill in the pertinent art would recognize such a positive correlation. 
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Nowhere in the Lilley, Wildsmith or the King papers do the authors suggest that it is 
more likely than not that altered mRNA levels does not correlate with altered protein levels. In 
fact, the King reference discussed numerous advantages of the microarray technology, which 
offers tremendous advantages in the study of human diseases. For instance, on page 2287, the 
author states that "microarrays can be expected to prove extremely valuable as tools for the study 
of the generic basis of complex diseases. The ability to measure expression profiles across entire 

genomes provides a level of information not previously attainable Microarrays make it 

possible to investigate differential gene expression in normal vs. diseased tissue, in treated vs. 
non-treated tissue, and in different stages during the natural course of the disease, all on a 
genomic scale. Gene expression profiles may help to unlock the molecular basis of phenotype, 

response to treatment, and heterogeneity of disease " Therefore, if anything, the King 

reference supports the use of the microarray in the diagnosis of human diseases, which silently 
assumes that, most probably, increases in mRNA levels correlate well with increases in protein 
levels which in turn impacts disease. 

Similarly, the Wildsmith paper discusses examples of a number of successes of 
microarray applications in the detection of human diseases (see Page 284). For instance, the 
author points out that "one area of rapid progress using microarray technology is the increased 
understanding of cancer. Molecular pathologies are subgrouping cancers of tissues such as 
blood, skin, and breast, based on differential gene expression patterns. For example, within a 
small group of breast cancer tissue samples, Perou et al distinguished two broad subgroups 
representing those expressing or alternatively lacking expression of the oestrogen receptor- y- 
gene. The work was not conclusive, but never has progress in this field been so rapid when 
compared with the previous methods of gene amplification. Another example of the impact of 
this technology is in the identification of two biomarkers for prostate cancer, namely hepsin and 
PIM1 (Dhanasekaran et a/., 2001). Microarray technology has also accelerated the 
understanding of the molecular events surrounding pulmonary fibrosis. Specially, two distinct 
clusters of genes associated with inflammation and fibrosis have been identified in a disease 
where, for years, the pathogenesis and treatment have remained unknown (Katsuma et aL, 
2001)." 
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Therefore, contrary to the Examiner's position regarding the Lilley et al, Lee et al, 
Wildsmith et al, and King et al references, collectively, the references show that the art 
indicates that, generally, if a mRNA is overexpressed in cancer, it is more likely than not that the 
encoded protein will also be expressed at an elevated level 

Madoz Gurpide et al and Feroze-Merzoug et al 

Again, the Examiner cites Madoz-Gurpide et al and Feroze-Merzoug et al to show that 
one cannot accurately predict protein levels based on mRNA levels (made of record by the 
Examiner in the Final Office Action mailed October 17, 2006). 

Appellants respectfully disagree. Madoz-Gurpide et al explains that mRNA expression 
alone does not provide information regarding the "activation state, post-translational 
modification or localization of corresponding proteins" (emphasis added; page 168, col 1). 
That is, Madoz-Gurpide et al explain that mechanisms are not apparent from mRNA expression 
alone. Madoz Gurpide et al further state that, it is "unclear" how well the reported RNA levels 
correlate with protein levels. In support of this assertion, the authors cite only a single reference, 
namely, the Chen et al, which was discussed above and in detail in the Appeal Brief filed 
April 17, 2006. Madoz Gurpide et al also acknowledge that the DNA microarray studies, such 
as those carried out by Beer et al (specifically cited by the authors at page 52), "justify the use 
of this technology for uncovering patterns of gene expression that are clinically 
informative" (emphasis added; page 53). Thus, while Madoz-Gurpide et al note that it is "more 
difficult to develop an understanding of disease at a mechanistic level using DNA 
microarrays," (emphasis added; page 53), Appellants respectfully point out that that 
"understanding of a disease at the mechanistic level" is not relevant to Appellants' assertions of 
utility, as discussed above. Accordingly, a prima facie case cannot be made based on the 
teachings within the Madoz-Gurpide et al reference. 

Similarly, Feroze-Merzoug et al appear to mainly focus on "accurately predicting" the 
precise levels of protein expression, which is not required for utility as a cancer diagnostic, as 
discussed above. Moreover, the teachings of Feroze-Merzoug et al are directed specifically to 
androgen regulated genes, which clearly involve different biological processes than those 
involved in tumor development. Feroze-Merzoug et al are not directed to genes in general, 
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either, and hence this reference cannot be relied upon to establish a prima facie case for lack of 
utility. 

Naearaia et al.. Waghrav et al. Sagvnaliev et al. 

Again, new references Nagaraja et al, Waghray et al, Sagynaliev et al. all utilize and 
analyze the microarray assay. Any rejection directed to the limitations of the microarray assay or 
to the universal control are moot and are not addressed here. Only relevance of microarray array 
results to correlate mRNA to protein levels is discussed. 

The Examiner asserts that "[c]omprehensive studies where significantly large numbers of 
transcripts and proteins were examined report that increases in mRNA and protein samples are 
not correlated." (Final Office Action of October 17, 2006). The Examiner cites Nagaraja et al. 
as allegedly teaching that "the proteomic profiles indicated altered abundance of fewer proteins 
as compared to transcript profiles." (Final Office Action of October 17, 2006). 

Appellants respectfully submit that the fact that many more transcripts than' proteins were 
found to be differentially expressed does not mean that most mRNA changes did not result in 
correlating protein changes, but merely reflects the fact that expression levels were only 
measured at all for many fewer proteins than transcripts . In particular, the total number of 
proteins whose expression levels could be visualized on silver-stained gels was only about 300 
(page 2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for 
which mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many 
fewer proteins than transcripts were measured, it is hardly surprising that a smaller absolute 
number of proteins than mRNAs were found to be overexpressed, because the protein products 
of most of the overexpressed mRNAs would not have been among the small number of proteins 
identified on the gels. 

The Examiner next cites Waghray et al., to the effect that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level," and that "[t]he change 
in intensity for most of the affected proteins identified could not be predicted based on the level 
of the corresponding RNA." (Final Office Action of October 17, 2006). Appellants reiterate that 
they need only show that there is a correlation between mRNA and protein levels, such that 
mRNA overexpression generally predict protein overexpression. A showing that mRNA levels 
can be used to "accurately predict" the precise levels of protein expression is not required . 
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Appellants also emphasize that Appellants are asserting that a measurable change in 
mRNA level generally leads to a corresponding change in the level of protein expression, not 
that changes in protein level can be used to predict changes in mRNA level. Waehrav et al did 
not take genes which showed significant mRNA changes and check the corresponding protein 
levels . Instead, the authors looked at a small and unrepresentative number of proteins, and 
checked the corresponding mRNA levels. Waghray et al acknowledge that only "[a] relatively 
small set of genes could be analyzed at the protein level, largely due to the limited sensitivity of 
2-D PAGE" (page 1337, col. 1). In particular, while the authors examined the expression levels 
of 16,570 genes (page 1329, col. 2), they were able to measure the expression levels of only 
1031 proteins (page 1333, col. 2). Waghray et al does not teach that changes in mRNA 
expression were not correlated with changes in expression of the corresponding protein. All 
Waghray et al state is that "for most of the proteins identified, there was no appreciable 
concordant change at the mRNA level" (page 1337, col. 2). This statement is not relevant to 
Appellants' assertion of utility, since Appellants are not asserting that changes in mRNA levels 
are the only cause of changes in protein levels. Waghray et al do not contradict Appellants' 
assertion that changes in mRNA expression, in general, correspond to changes in expression of 
the corresponding protein. 

Lastly, the Examiner cites Sagynaliev et al, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et al, of 982 genes found to be differentially expressed in human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Final Office Action 
of October 17, 2006). 

The Sagynaliev et al reference, titled " Web-based data warehouse on gene expression in 
human colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of gene 
expression data published in human CRC , and not from experimental data. While a literature survey 
can be a useful tool to assist researchers, the results may greatly over-represent or under-represent 
certain genes, and thus the conclusions may not be generally applicable. In particular, Appellants 
note that, as evidenced by Lian et al, Nagaraja et al, and Waghray et al, discussed above, the 
number of mRNAs examined in transcriptomics studies is typically much larger than the number of 
proteins examined in corresponding proteomics studies , due to the difficulties in detecting and 
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resolving more than a small minority of all expressed proteins on 2D gels. Thus the fact that only 
18% of all genes found to be differentially expressed in human CRC have been confirmed using 
proteomics technologies does not mean that the corresponding proteins are not also differentially 
expressed, but is most likely due to the fact that the corresponding proteins were not identified on 2D 
gels, and thus their expression levels remain unknown. 

The authors of Sagynaliev et al. acknowledge the many technical problems in finding 
proteomic data for CRC that can be matched to transcriptomic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 
protein expression data obtained in the human patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations . . . even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

Accordingly, the Examiner cannot rely on the teachings of Nagaraja et al., Waghray et 
al, Sagynaliev et al. to establish a prima facie showing of lack of utility. 

Li et al. 

The Examiner cites new reference Li et al. as teaching that "68.8% of the genes showing 
over-representation in the genome did not show elevated transcript levels." (Final Office Action 
of October 17,2006). 

Appellants respectfully point out that Li et al. acknowledge that their results differed 
from those obtained by Hyman etal. and Pollack et al. (of record), who found a substantially 
higher level of correlation between gene amplification and increased gene expression. The 
authors note that "[t]his discordance may reflect methodologic differences between studies or 
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biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). For 
instance, as explained in. the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if they had a copy number ratio of at least 1.40 . In the case of 
PR0232, as discussed in previously filed responses and in the Goddard Declaration (of record), 
an appropriate threshold for considering gene amplification to be significant is a copy number of 
at least 2.0 (which is a higher threshold). The PR0232 gene showed significant amplification of 
2.056-fold to 5.28-fold, in five lung tumors or 2.00-fold to 5.32-fold in seven colon tumors , 
and thus fully meets this standard. It is not surprising that, in the Li et al reference, by using a 
lower threshold of 1.4 for considering gene amplification, a higher number of genes not showing 
corresponding increases in mRNA expression were found. Nevertheless, the results of Li et aL 
do not conclusively disprove that a gene with a substantially higher level of gene amplification, 
such as PRQ232 , would be expected to show a corresponding increase in transcript expression. 

Therefore, the Patent Office has failed to meet its initial burden of proof that Appellants' 
claims of utility are not substantial or credible. The arguments presented by the Examiner based 
on references Pennica et al, Konopka et al, Haynes et al, Hu et aL, Lian et al, Fessler et al, 
Gygi et al, Lilley et al, Lee et al, King et al, Wildsmith et aL, Nagaraja et aL, Sagynaliev et 
al, Waghray et al, Madoz-Gurpide et al, Feroze-Merzoug et al, Bustin et al, Saito-Hisaminato 
et al, do not provide sufficient reasons to doubt the statements by Appellants that PR0232 has 
utility as a diagnostic marker for lung or colon cancer. Appellants once again remind the 
Examiner that only after the Examiner has made a proper prima facie showing of lack of utility, 
does the burden of rebuttal shift to the Appellant. Based on the above discussions, such a 
showing has not been made. Accordingly, the instant rejection should be withdrawn for the 
Examiner's lack of establishment of a prima facie showing. 

Moreover, Appellants acknowledge that, in certain instances, DNA/mRNA and protein 
levels do not correlate. In fact, Appellants have included several such references directed 
towards a single gene or genes that lack a correlation in their IDS filed July 27, 2006. The 
references discussed in the Preliminary Amendment filed July 27, 2006, and the arguments 
therein are hereby incorporated by reference for brevity. The IDS included references that 
studied single genes or gene families, multiple or large families of genes, and included studies 
that a wide variety of techniques including gene amplification and microarray. Regardless of the 
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techniques employed, by and large, increased genes/transcripts levels mostly correlated with 
increased protein levels, even if accurate predictions of proteins could not be made . As 
discussed throughout prosecution, the law does not require the existence of a "necessary" 
correlation between DNA/mRNA and protein levels, or that protein levels be "accurately 
predicted". In fact, authors in several of the cited references (cited both, by the Examiner, and 
by Appellants) themselves acknowledge that there is a general correlation between protein 
expression and transcript levels and DNA levels, which meets the "more likely than not 
standard". 

In summary, Appellants maintain that even though there are certain instances where a 
correlation, between DNA/mRNA and protein levels do not exist, in most cases, there is 
generally good correlation between them, and this was collectively demonstrated in the more 
than 100 references submitted by the Appellants in the IDS filed July 27, 2006. 

D. The Gene Amplification Data Establishes Credible. Substan tial and Specific 
Patentable Utility for the PRQ232 Polypeptide 

On the other hand, as discussed throughout prosecution, Appellants submit that Example 
170 of the specification further discloses that, "(a)mplificatio'n is associated with overexpression 
of the gene product, indicating that the polypeptides are usef ul targets for therapeutic 
intervention in certain cancers such as lung, colon, breast and other cancers and diagnostic 
determination of the presence of those cancers" (Emphasis added). Appellants have also 
submitted ample evidence to show that, in general, if a gene is amplified in cancer, it is "more 
likely than not" that the encoded protein will also be expressed at an elevated level. 

Besides the reference, the Declaration by Dr. Paul Polakis (Polakis I - made of record in 
Appellants' Response filed August 9, 2004), principal investigator of the Tumor Antigen Project 
of Genentech, Inc., the assignee of the present application, explains that in the course of Dr. 
Polakis' research using microarray analysis, he and his co-workers identified approximately 200 
gene transcripts that are present in human tumor cells at significantly higher levels than in 
corresponding normal human cells. Appellants submit that Dr. Polakis' Declaration was 
presented to support the position that there is a correlation between mRNA levels and 
polypeptide levels. The second Declaration by Dr. Polakis (Polakis II) presented evidentiary 
data in Exhibit B. Exhibit B of the Declaration identified 28 gene transcripts out of 31 gene 
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transcripts (i.e., greater than 90%) that showed good correlation between tumor mRNA and 
tumor protein levels. As Dr. Polakis' Declaration (Polakis II - made of record in Appellants 1 
Response filed July 27, 2006) says "[a]s such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) tumor tissue and (ii) 
normal tissue, we have observed that in the vast majority of cases, there is a very strong 
correlation between increases in mRNA expression and increases in the level of protein encoded 
by that mRNA." Accordingly, Dr. Polakis has provided the facts to enable the Examiner to draw 
independent conclusions regarding protein data. Appellants further emphasize that the opinions 
expressed in the Polakis Declaration, including in the above quoted statement, are all based on 
factual findings. For instance, antibodies binding to about 30 of these tumor antigens were 
prepared and mRNA and protein levels were compared. In approximately 80% of the cases , the 
researchers found that increases in the level of a particular mRNA correlated with changes in the 
level of protein expressed from that mRNA when human tumor cells are compared with their 
corresponding normal cells . Therefore, Dr. Polakis 1 research, which is referenced in his 
Declaration, shows that, in general there is a correlation between increased mRNA and 
polypeptide levels . Hence, one of skill in the art would reasonably expect that, based on the gene 
amplification data of the PR0232 gene, the PR0232 polypeptide is concomitantly overexpressed 
in lung or colon tumors studied as well. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip® arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not that the results 
are informative of protein levels). 

Thus, based on the asserted utility for PR0232 in the diagnosis of selected lung or colon 
tumors, the reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 18 in 
the present application (also see pages 3- 4 and Example 4, page 149-150), the step-by-step 
preparation of chimeric PRO polypeptides, including those wherein the heterologous polypeptide 
is an epitope tag or an Fc region of an immunoglobulin (page 74, lines 23 to page 75, line 5), the 
description of the expression of PRO polypeptides in various host cells, including E. coli, 
mammalian cells, yeast and Baculovirus-infected insect cells at least in Examples 53-56, pages 
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192-199, the disclosure of the step-by-step protocol for the preparation, isolation and detection of 
monoclonal, polyclonal and other types of antibodies against the PR0232 protein in the 
specification (monoclonal and polyclonal antibodies at page 139, line 32, to page 141, line 13; 
humanized antibodies at page 141, line 15, to page 142, line 16; antibody fragments at page 143, 
line 8 onwards; labeled antibodies at pages 144-145, line 16 onwards and page 146, line 33 to 
page 147, line 6) and the disclosure of the gene amplification assay in Example 92, the skilled 
artisan would know exactly how to make and use the claimed polypeptides for the diagnosis of 
lung or colon cancers. Appellants submit that based on the detailed information presented in the 
specification and the advanced state of the art in oncology, the skilled artisan would have found 
such testing routine and not 'undue.' 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well : established utility, one skilled in the art would know 
"how to make and use" the claimed invention without undue experimentation, Appellants 
respectfully request reconsideration and reversal of the determination of priority for Claims 44- 
46 and 49-51. 

ISSUE 2. Claims 44-46 and 49-51 should be entitled to the priority date of U.S. P rovisional 
Application 60/059121. filed September 17. 1997 

The instant application has not been granted the earlier priority date of U.S. Provisional 
Application 60/059 1 2 1 , filed September 1 7, 1 997 on the grounds that the prior 60/059 1 2 1 
application fails to provide a utility and lacks an enabling disclosure for the claimed invention 
under 35 U.S.C.§§ 101/1 12, first paragraph." 

Appellants disagree and submit that, for the same reasons discussed above under Issue 1, 
U.S. Provisional Application 60/059121 also satisfies the utility requirements. Therefore, 
Appellants should be entitled to the priority date of September 17, 1997. 

ISSUE 3. Claims 44-46 and 49-51 are not anticipated by Rosenthal et aL, DE19818619-A1 
(dated 10/1999) 

Claims 44-46 and 49-51 remain rejected under 35 U.S.C. § 102(b) as being anticipated by 
Rosenthal et al., DE19818619-A1 (dated 10/1999). 
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For the reasons discussed above under Issue 2, Appellants maintain that they are entitled 
to an effective filing date of September 17, 1997 based on a properly claimed priority to 
International application PCT/US98/18824. Therefore, Rosenthal et al is not prior art and does 
not anticipate the instant claims. Accordingly, this rejection under 35 U.S.C. § 1 02(b) should be 
withdrawn. 



For the reasons given above, Appellants submit that present specification and the 
specification of U.S. Provisional Application 60/059121 dated September 17, 1997 clearly 
describes and provides at least one patentable utility for the instantly claimed invention. 
Moreover, it is respectfully submitted that the present specification clearly teaches "how to use" 
the presently claimed polypeptide based upon this disclosed patentable utility. Accordingly, 
Rosenthal et aL, DE19818619-A1 is not prior art. As such, Appellants respectfully request 
reconsideration and reversal of the outstanding rejection of claims 44-46 and 49-51. 

The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 08-1641 (referencing 
Attorney's Docket No. 39780-1618 P2C18 . 



HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 



CONCLUSION 



Respectfully submitted, 



Date: July 23, 2007 




Ginger R. Dreger 
Reg. No. 33,055 
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VIII. CLAIMS APPENDIX 

Claims on Appeal 

44. An isolated polypeptide comprising: 

(a) . the amino acid sequence of the polypeptide of SEQ ID NO: 1 8 ; 

(b) the amino acid sequence of the polypeptide of SEQ ID NO: 1 8, lacking its 
associated signal peptide; or 

(c) the amino acid sequence of the polypeptide encoded by the full-length coding 
sequence of the cDNA deposited under ATCC accession number 209250; 

wherein, the nucleic acid encoding said polypeptide is amplified in lung or colon tumors. 

45. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 18. 

46. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO: 1 8, lacking its associated signal peptide. 

49. The isolated polypeptide of Claim 44 comprising the amino acid sequence of the 
polypeptide encoded by the full-length coding sequence of the cDNA deposited under ATCC 
accession number 209250. 

50. A chimeric polypeptide comprising a polypeptide according to Claim 44 fused to 
a heterologous polypeptide. 

51. The chimeric polypeptide of Claim 50, wherein said heterologous polypeptide is 
an epitope tag or an Fc region of an immunoglobulin. 
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X. RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 

above. 
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Beschreibung 

Die Erfindung betrifft menschliche Nuklcinsauresequenzen aus Blasentumorgewebe, die fur Genprodukte oder Teile 
davon kodieren, deren funktionale Gene, die mindestens ein biologisch aktives Polypeptid kcxiieren und deren \ferwen- 
5 dung. 

Die Erfindung betrifft weiterhin die iiber die Sequenzen erhaltlichen Polypeptide und deren Verwendung. 
Eine der Hauptkrebstodesursachen ist der Blasentumor, fiir dessen Bekampfung neue Therapien notwendig sind. Bis- 
her verwendete Therapien, wie z. B. Chemotherapie, Honnontherapie oder chirurgische Entfernung des TYimorgewebes, 
fuhren haufig nicht zu einer vollstandigen Heilung. 

10 Das Phanomen Krebs geht haufig cinher mit der tjber- oder Unterexpression gewisser Gene in den entarteten Zellen, 
wobei noch unklar ist, ob diese veranderten Expressionsraten Ursacbe oder Folge der malignen Transformation sind. Die 
Identifikation solcher Gene ware ein wesentlicher Schritt fiir die Entwicklung neuer Therapien gegen Krebs. Der spon- 
tanen Entstehung von Krebs geht haufig eine Vielzahl von Mutationen voraus. Diese kdnnen verschiedenste Auswixkun- 
gen auf das Expressionsmuster in dem betroffenen Gewebe haben, wie z. B. Unter- oder Uberexpression, aber auch Ex- 

15 pression verkiirzter Gene. Mehrere solcher Veranderungen durch solche Mutationskaskaden konnen schlieBlich zu bos- 
artigen Entartungen fuhren. Die Komplexitat solcher Zusammenhange erschwert die experimentelle Herangehensweise 
sehr. 

Fiir die Suche nach Kandidatengenen, d. h. Genen, die im Vergleich zum TYimorgewebe im normalen Gewebe starker 
exprimiert werden, wird eine Datenbank verwendet, die aus sogenannten ESTs besteht. ESTs (Expressed Sequence T&gs) 

20 sind Sequenzen von cDNAs, d. h. revers transkribierten mRNAs, den Molekulen also, die die Expression von Genen wi- 
derspiegeln. Die EST-Sequenzen werden fur nonnale und entartete Gewebe ermittelt. Solche Datenbanken werden von 
verschiedenen Betreibem z. T. kommerziell angeboten. Die ESTs der LifeSeq-Datenbank, die hier verwendet wird, sind 
in der Regel zwischen 150 und 350 Nukleotide lang. Sie reprasentieren ein fur ein bestimmtes Gen unverkennbares Mu- 
ster, obwohl dieses Gen normalerweise sehr viel langer ist (> 2000 Nukleotide). Durch Vergleich der Expressionsmuster 

25 von normalen und TYimorgewebe konnen ESTs identifiziert werden, die fur die TYimorentstehung und -proliferation 
wichtig sind. Es besteht jedoch folgendes Problem: Da durch unterschiedliche Konstruktionen der cDNA-Bibliotheken 
die gefundenen EST-Sequenzen zu unterschiedlichen Regionen eines unbekannten Gens gehoren konnen, ergabe sich in 
einem solchen Fall ein vollig falsches VerhaTtnis des Vorkommens dieser ESTs in dem jeweiligen Gewebe. Dieses wurde 
erst bemerkt werden, wenn das vollstandige Gen bekannt ist und somit die ESTs dem gleichen Gen zugeordnet werden 

30 konnen. 

Es wurde nun gefunden, daB diese Fehlermoglichkeit verringert werden kann, wenn zuvor samtliche ESTs aus dem je- 
weiligen Gewebstyp assembliert werden, bevor die Expressionsmuster miteinander verglichen werden. Es wurden also 
iiberlappende ESTs ein und desselben Gens zu langeren Sequenzen zusammengefaBt (s. Fig. 1, Fig. 2a und Fig. 3). 
Durch diese Verlangerung und damit Abdeckung eines wesentlich groBeren Genbereichs in jeder der jeweiligen Banken 
35 sollte der oben beschriebene Fehler weitgehendst vermieden werden. Da es hierzu keine bestehenden Softwareprodukte 
gab, wurden Programme fur das Assemblieren von genomischen Abschnitten verwendet, die abgewandelt eingcsetzt und 
durch eigene Programme erganzt wurden. Ein Flowchart der Assemblierungsprozedur ist in Fig. 2bl-2b4 dargestellt. 

Es konnten nun die Nukleinsaure-Sequenzen Seq. ID No. 1 50 gefunden werden, die als Kandidatengene beim Bla- 
sentumor eine Rolle spielen. 

40 Von besonderem Interesse sind die Nukleinsaure-Sequenzen Seq. ID Nos. 2-5, 7-13, 16, 18, 20, 23, 26-27, 31-32, 36, 
45. 

Die Erfindung betrifft somit Nukleinsaure-Sequenzen, die ein Genprodukt oder ein Teil davon kodieren, umfassend 

a) eine Nukleinsaure-Sequenz, ausgewahlt aus der Gruppe der Nukleinsaure-Sequenzen Seq. ID Nos. 2-5, 7-13, 
45 16, 18, 20, 23, 26-27, 31-32, 36, 45. 

b) cine allelische Variation der unter a) genanntcn Nukleinsaure-Sequenzen 
oder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unter a) oder b) genannten Nukleinsaure-Sequenzen ist. 

50 Die Erfindung betrifft weiterhin eine Nukleinsaure-Sequenz gemaB einer der Sequenzen Seq. ID Nos. 2-5, 7-13, 16, 
18, 20, 23, 26-27, 31-32, 36, 45 oder eine komplementare oder allelische Variante davon und die Nukleinsaure-Sequen- 
zen davon, die eine 90%ige bis 95%ige Homologie zu einer humanen Nukleinsaure-Sequenz aufweisen. 

Die Erfindung betrifft auch die Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50, die im Blasentumorgewebe 
erhoht exprimiert sind. 

55 Die Erfindung betrifft ferner Nukleinsaure-Sequenzen, umfassend einen Teil der oben genannten Nukleinsaure-Se- 
quenzen, in solch einer ausreichenden GroBe, daB sie mit den Sequenzen Seq. ID Nos. 1-50 hybridisieren. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen weisen im allgemeinen eine Lange von mindestens 50 bis 4500 bp, 
vorzugsweise eine Lange von mindestens 150 bis 4000 bp, insbesondere eine Lange von 450 bis 3500 bp auf. 

Mit den erfindungsgemaBen Teilsequenzen Seq. ID Nos. 1-50 konnen gemaB gangiger Verfahrenspraxis auch Expres- 
60 sionskassetten konstruiert werden, wobei auf der Kassette mindestens eine der erfindungsgemaBen Nukleinsaure-Se- 
quenzen zusammen mit mindestens einer dem Fachmann allgemein bekannten Kontroll- oder regulatorischen Sequenz, 
wie z. B. einem geeigneten Promoter, kombiniert wird. Die erfindungsgemaBen Sequenzen konnen in sense oder anti- 
sense Orientierung eingefugt sein. 
In der Literatur sind ist eine groBe Anzahl von Expressionskassetten bzw. Vektoren und Promotoren bekannt, die ver- 
65 wendet werden konnen. 

Unter Expressionskassetten bzw. Vektoren sind zu verstehen: 1. bakterielle, wie z. B., phagescript, pBs, (|>X174, pBlu- 
escript SK, pBs KS, pNH8a, pNH16a, pnH18a, pNH46a (Stratagene), pTVc99A, pKK223-3, pKK233-3, pDR540, pRTT5 
(Pharmacia), 2. eukaryontische, wie z. B. pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagene), pSVK3, pBPV, pMSG, 
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pSVL (Pharmacia). 

Unter Kontroll- oder regulatorischer Sequenz sind geeignete Promotoren zu verstehen. Hierbei sind zwei bevorzugte 
Vektoren der pKK232-8 und dcr PCM7 Vektor. Im einzelnen sind folgende Promotoren gemeint: lad, lacZ, 13, T7, gpt, 
lambda P R , trc, CMV, HSV Thymidin-Kinase, S V40, LTRs aus Retrovirus und Maus MetaUothionein-L 

Die auf der Expressionskassette befindlichen DNA-Sequenzen konnen ein Fusionsprotein kodieren, das ein bekanntes 5 
Protein und ein biologisch aktives Polypeptid-Fragment umfaBt. 

Die Expressionskassetten sind ebenfalls Gegenstand der vorliegenden Erfindung. 

Die erfindungsgemaBen Nukleinsaure-Fragmente konnen zur Herstellung von \bllangen-Genen verwendet werden. 
Die erhaltlichen Gene sind ebenfalls Gegenstand der vorliegenden Erfindung. 

Die Erfindung betrifft auch die Verwendung der erfindungsgemaBen Nukleinsaure-Sequenzen, sowie die aus der \er- 10 
wendung erhaltlichen Gen-Fragmente. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen mit geeigneten Vektoren in Wirtszellen gebracht werden, in 
denen als heterologer Teil die auf den Nukleinsaure-Fragmenten enthaltene genetischen Information befindet, die expri- 
miert wird. 

Die die Nukleinsaure-Fragmente enthaltenden Wirtszellen sind ebenfalls Gegenstand der vorliegenden Erfindung. 15 

Geeignete Wirtszellen sind z. B. prokaryontische Zellsysteme wie E. coli oder eukaryontische Zellsysteme wie tieri- 
sche oder humane Zellen oder Hefen. 

Die erfindungsgemaBen Nukleinsaure-Sequenzen konnen in sense oder antisense Form verwendet werden. 

Die Herstellung der Polypeptide oder deren Fragment erfolgt durch Kultivierung der Wirtszellen gemaB gangiger Kul- 
tivierungsmethoden und anschlieBender Isolierung und Aufreinigung der Peptide bzw. Fragmente, ebenfalls mittels gan- 20 
giger Verfahren. Die Erfindung betrifft femer Nukleinsaure-Sequenzen, die mindestens eine Teilsequenz eines biolo- 
gisch aktiven Polypeptids kodieren. 

Ferner betrifft die vorliegende Erfindung Polypeptid-Teilsequenzen, sogenannte ORF (open-reading-frame)-Peptide, 
gemaB den Sequenzprotokollen ORF ID Nos. 51-106. 

Die Erfindung betrifTt femer die Polypeptid-Sequenzen, die mindestens eine 80%ige Homologie, insbesondere eine 25 
90%ige Homologie zu den erfindungsgemaBen Polypeptid-Teilsequenzen der ORF. ID Nos. 51-106 aufweisen. 

Die Erfindung betrifft auch Antikorper, die gegen ein Polypeptid oder Fragment davon gerichtete sind, welche von den 
erfindungsgemaBen Nukleinsauren der Sequenzen Seq. ID No. 1 bis Seq. ID 50 kodiert werden. 

Unter Antikorper sind insbesondere monoklonale und Phage-Display- Antikorper zu verstehen. 

Die erfindungsgemaBen Polypeptide der Sequenzen ORF ID Nos. 51-106 konnen auch als Tool zum Auffinden von 30 
Wirkstoffen gegen den Blasentumor verwendet werden, was ebenfalls Gegenstand der vorliegenden Erfindung ist. 

Ebenfalls Gegenstand der vorliegenden Erfindung ist die Verwendung der Nukleinsaure-Sequenzen gemaB den Se- 
quenzen Seq. ID No. 1-50 zur Expression von Polypeptiden, die als Tools zum Auffinden von WirkstofTen gegen den 
Blasentumor verwendet werden konnen. 

Die Erfindung betrifft auch die Verwendung der gefundenen Polypeptid-Teilsequenzen ORF. ID No. 51-106 als Arz- 35 
neimittei in der Gentherapie zur Behandlung gegen den Blasentumor, bzw. zur Herstellung eines Arzneimittels zur Be- 
handlung gegen den Blasentumor. 

Die Erfindung betriffl auch Arzneimittel, die mindestens eine Polypeptid-Teilsequenz ORF. ID No. 51-106 enthalten. 

Die gefundenen erfindungsgemaBen Nukleinsaure-Sequenzen konnen auch genomische oder mRNA-Sequenzen sein. 

Die Erfindung betrifft auch genomische Gene, ihre Exon- und Intronstruktur und deren SpleiBvarianten, erbaltlich aus 40 
den cDNAs der Sequenzen Seq. ID No. 1-50, sowie deren Verwendung zusammen mit geeigneten regulativen Elemen- 
ten, wie geeigneten Promotoren und/oder Enhancem. 

Mit den erfindungsgemaBen Nukleinsauren (cDNA-Sequenzen) Seq. ID No. 1-50 werden genomische BAC-, PAC- 
und Cosmid-Bibliotheken gescreent und uber komplementare Basenpaarung (Hybridisierung) spezifisch humane Klone 
isoliert. Die so isolierten BAC-, PAC- und Cosmid-Klone werden mit Hilfe der Ruoreszenz-in-situ-Hybridisation auf 45 
Metaphasenchromosomen hybridisiert und entsprechende Chromosomenabschnitte identifiziert, auf denen die entspre- 
chenden genomischen Gene liegen. BAG-, PAC- und Cosmid-Klone werden sequenziert, um die entsprechenden genom- 
ischen Gene in ihrer vollstandigen Struktur (Promotoren, Enhancer, Silencer, Exons und Introns) aufzuklaren. BAC-, 
PAC- und Cosmid-Klone konnen als eigenstandige Molekule fur den Gentransfer eingesetzt werden (s. Fig. 5). 

Die Erfindung betrifft auch BAC-, PAC- und Cosmid-Klone, enthaltend funktionelle Gene und ihre chromosomale 50 
Lokalisation, entsprechend den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur Verwendung als Vehikel zum Gentrans- 
fer. 

Bedeutungen von Fachbegriffen und Abkiirzungen 

55 

Nukleinsauren = Unter Nukleinsauren sind in der vorliegenden Erfindung zu verstehen: mRNA, partielle cDNA, vollan- 
gen cDNA und genomische Gene (Chromosomen). 

ORF = Open Reading Frame, eine definierte Abfolge von Aminosauren, die von der cDNA-Sequenz abgeleitet werden 
kann. 

Contig = eine Menge von DNA-Sequenzen, die aufgrund sehr groBer Ahnlichkeiten zu einer Sequenz zusammengefaBt 60 
werden konnen (Consensus) 

Singleton = ein Contig, der nur eine Sequenz enthalt. 

Erklarung zu den Alignmentparametem 



minimal initial match = minimaler anfanglicher Identitatsbereich 
maximum pads per read = maximale Anzahl von Insertionen 
maximum percent mismatch = maximale Abweichung in %. 



65 
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Erklarung der Abbildungen 

Fig. 1 zeigt die systemalische Gen-Suche in der Incyle LifeSeq Datenbank. 
Fig. 2a zeigt das Prinzip der EST-Assemblierung 
5 Fig. 2bl-2b4 zeigt das gesamte Prinzip der EST-Assemblierung 

Fig. 3 zeigt die in silico Subtraktion der Genexpression in verschiedenen Geweben 

Fig. 4a zeigt die Bestimmung der gewebsspezifischen Expression uber elektronischen Northern. 

Fig. 4b zeigt den elektronischen Northern 

Fig. 5 zeigt die Isolierung von genomischen BAC- und PAC-Klonen. 
to Die nachfolgenden Beispiele erlautem die Herstellung der erfindungsgcmaBen Nukleinsaure-Sequenzen, ohne die Er- 
findung auf diese Beispiele und Nukleinsaure-Sequenzen zu beschranken. 

. Beispiel 1 

15 Suche nach Tumor-bezogenen Kandidatengenen 

Zuerst wurden samtliche ESTs des entsprechenden Gewebes aus der LifeSeq-Datenbank (vom Oktober 1997) extra- 
hiert. Diese wurden dann mittels des Programms GAP4 des Staden-Pakets mit den Parametern 0% mismatch, 8 pads per 
read und einem minimalen match von 20 assembliert Die nicht in die GAP4-Datenbank aufgenommenen Sequenzen 

20 (Fails) wurden erst bei 1 % mismatch und dann nochmals bei 2% mismatch mit der Datenbank assembliert. Aus den Con- 
tigs der Datenbank, die aus mehr als einer Sequenz bestanden, wurden Consensussequenzen errechnet Die Singletons 
der Datenbank, die nur aus einer Sequenz bestanden, wurden mit den nicht in die GAP4-Datenbank aufgenommenen Se- 
quenzen bei 2% mismatch emeut assembliert Wiederum wurden fur die Contigs die Consensussequenzen ermittelt. Alle 
ubrigen ESTs wurden bei 4% mismatch erneut assembliert Die Consensussequenzen wurden abermals extrahiert und 

25 mit den vorherigen Consensussequenzen sowie den Singletons und den nicht in die Datenbank aufgenommenen Sequen- 
zen abschlieBcnd bei 4% mismatch assembliert Die Consensussequenzen wurden gebildet und mit den Singletons und 
Fails als Ausgangsbasis fur die Gewebsvergleiche verwendet Durch diese Prozedur konnte sichergestellt werden, daB 
unter den verwendeten Parametern samtliche Sequenzen von einander unabhangige Genbereiche darstellten. 
Fig. 2bl-2b4 veranschaulicht die Verlangerung der Blasengewebe ESTs. 

30 Die so assemblierten Sequenzen der jeweiligen Gewebe wurden anschlieBend mittels des gleichen Programms mitein- 
ander verglichen (Fig. 3). Hierzu wurden erst alle Sequenzen des ersten Gewebes in die Datenbank eingegeben. (Daher 
war es wichtig, daB diese voneinander unabhangig waren.) 

Dann wurden alle Sequenzen des zweiten Gewebes mit alien des ersten verglichen. Das Ergebnis waren Sequenzen, 
die fur das erste bzw. das zweite Gewebe spezifisch waren, sowie welche, die in beiden vorkamen. Bei Letzteren wurde 

35 das Verhaltnis der Haufigkeit des Vorkommens in den jeweiligen Geweben ausgewertet Samtliche, die Auswertung der 
assemblierten Sequenzen betreff enden Programme, wurden selbst entwickelt 

Alle Sequenzen, die mehr als viermal in jeweils einem der verglichenen Gewebe vorkamen, sowie alle, die mindestens 
funfmal so haufig in einem der beiden Gewebe vorkamen wurden weiter untersucht Diese Sequenzen wurden einem 
elektronischen Northern (s. Beispiel 2.1) unterzogen, wodurch die Verteilung in samtlichen Tumor- und Normal-Gewe- 

40 ben untersucht wurde (s. Fig. 4a und Fig. 4b). Die relevanten Kandidaten wurden dann mit Hilfe samtlicher Incyte ESTs 
und alien ESTs dffentlicher Datenbanken verlangert (s. Beispiel 3). AnschheBend wurden die Sequenzen und ihre Uber- 
setzung in mogliche Proteine mit alien Nukleotid- und Proteindatenbanken verglichen, sowie auf mogliche, fur Proteine 
kodierende Regionen untersucht 

45 Beispiel 2 

Algorithms zur Identifikation und Verlangerung von partiellen cDNA-Sequenzen mit verandertem Expressionsmuster 

^ Im folgenden soli ein Algorithmus zur Auffindung uber- oder unterexprimierter Gene erlautert werden. Die einzelnen 
50 Schritte sind der besseren Ubersicht halber auch in einem HuBdiagramm zusammengefaBt (s. Fig. 4b). 

2.1 Elektronischer Northern-Blot 

Zu einer partiellen DNA-Sequenz S, z. B. einem einzelnen EST oder einem Contig von ESTs, werden mittels eines 
55 Standardprogramms zur Homolgiesuche, z. B. BLAST (Altschul, S. F., Gish W., Miller, W., Myers, E. W und Lipman 

D. J. (1990) J. Mol. Biol, 215, 403-410), BLAST2 (Altschul, S. E, Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z.' 

Miller, W. und Lipman, D. J. (1997) Nucleic Acids Research 25 3389-3402) oder FASTA (Pearson, W. R. und Lipman! 

D. J. (1988) Proc. Natl. Acad. Sci. USA 85 2444-2448), die homologen Sequenzen in verschiedenen nach Geweben ge- 

ordneten (privaten oder offentlichen) EST-Bibliotheken bestimmt. Die dadurch ermittelten (relativen oder absoluten) 
60 Gewebe-spezifischen Vorkommenshaufigkeiten dieser Partial-Sequenz S werden als elektronischer Northern-Blot be- 

zeichnet 

2.1.1 

65 Analog der unter 2.1 beschriebenen Verfahrensweise wurde die Sequenz Seq. ID No. 16 gefunden, die 17,7.x starker 
im normalen Blasentumorgewebe als im normalem Blasengewebe vorkommt 
Das Ergebnis ist wie folgt: 
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Elektronischer Northern fur SEQ. ID. NO: 16 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0.0039 


0.0690 


0.0565 17.6998 


Brust 


0.0000 


0.0000 


undef 


undef 


Duenndarra 


0.0031 


*0.0000 


undef 


0.0000 


Eierstock 


0.0180 


0.0078 


2.3025 0.4343 


Endokr ine s_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0.0000 


0.0000 


undef 


undef 


Gehirn 


0.0000 


0.0000 


undef 


undef 


Haematopoetisch 


0.0000 


0.0000 


undef 


undef 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Her2 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0000 


0.0000 


undef 


undef 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


Muskel-Skelett 


0.0000 


0.0000 


undef* 


undef 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0065 


0.0064 


1.0236 


0.9769 


Dte rus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


o.opco 


undef 


undef 


Brust-Hyperplasie 


0.0000 









Prostata-Hyperplasie 0.0178 
Samenblase 0.0000 
Sinnesorgane 0.0000 
Weisse_Blutkoerperchen 0.0000 
Zervix 0.0000 



10 



15 



20 



25 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0028 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



30 



35 



40 



Brust 
Eierstock_n 
Eierstock_t 
EndokrinesjGewebe 
Foetal 
. Gastrointestinal 
. Haematopoetisch 
Haut-Muskel 
Hoden 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0035 

0.0000 

0.0000 

0.0000 

0.0000 



Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0000 



45 



50 



55 



60 



In analoger Verfahrensweise wurden auch folgende Northerns gefunderi: 
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Elektronischer Northern fur SEQ. ID. NO: 8 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Blase 
Brust 
Duenndarm 
Eierstock 
EndokrinesjGewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Oterus_Myomet rium 
Uterus__allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
He rz-Blutge fa esse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_n 
Eierstock_t 
EndokrinesjGewebe 
Foetal 
* Gastrointestinal 
' Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
" Sinnesorgane 
Uterus n 



NORMAL 
%Haeufigkeit 
0.0000 
0.0038 
0.0399 
0.0000 
0.0000 
0.0479 
.0000 
.0227 
.0037 
.0000 
.0000 
.0000 
0.0145 
0.0000 
0.0120 
0.0000 
0.0066 
0.0000 
0.0022 
0.0000 
0.0000 
.0000 
.0128 
.0030 
.0000 
.0000 
0.0009 
0.0000 



TUMOR 

%Haeufigkeit 

0.0281 

0.0056 

0.0000 

0.0078 

0.0000 

0.0000 

0-0010 

0.0000 

0.0000 

0.0323 

0.0000 

0.0585 

0.0123 

0.0000 

0.0120 

0.0274 

0.0U0 

0000 

0021 

00C0 

ccoo • - 

0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.6805 1.4694 
undef 0.0000 
0.0000 undef 
undef undef 
undef 0.0000 
0.0000 undef 
.undef 0.0000 
undef 0.0000 
0.0000 undef 
undef undef 
0.0000 undef 
1.1854 0.8436 
undef undef 
0.9994 1.0006 
0.0000 undef 
0.5983 1.6714 
undef undef 
1.0236 0.9769 
undef undef. " 
undef "under 
undef undef 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0/0000 

0.0000 

0.0000 

o.oooo ; 

0.0000 
0.0000 
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Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehim 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_ Endometrium 
Uterus_Myometrium 
Oterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstockja 
Eierstock^_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
- "- Hau t-Mus ke 1 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



Elektronischer Northern fur SEQ. ID. NO: 2 



NORMAL TUMOR Verhaeltnisse 

%Haeufigkeit %Haeuf igkeit N/T T/N 



0,0000 


0.0307 


0.0000 undef 


0.0307 


0.0376 


. 0.8166 1.2245 


0.0337 


0.0165 


2.0391 0.4904 


0.0120 


0.0364 


0.3289 3.0402 


0.0255 


0.0075 


3.3962 0.2944 


0.0153 


0.0185 


0.8283 1.2072 


0.0185 


0.0216 


0.8571 1.1667 


0.0201 


0.0379 


0.5293 1.8892 


0.0844 


0.0000 


undef 0.0000 


0.0238 


0.0065 


3.6765 0.2720 


0.0148 


0.0000 


undef 0.0000 


0.0575 


0.0351 


1.6399 0.6098 


0.0145 


0.0082 


1.7781 0.5624 


0.0387 


0.0077 


.5.0421 0.1983 


0.0308 


0.0300 


1.0280 0.9728 


0.0217 


0.0000 


undef 0.0000 


0.0099 


0.0110 


0.8974 1.1143 


0.0240 


0.0000 


undef 0.0000 


0.0262 


0.0213 


1.2284 0.8141 


0.0135 


0.0000 


undef 0.0000 


0.0152 


0.0408 


0.3741 2.6732 


0.0204 


0.0954 


0.2135 4.6839 



5 



10 



15 



20 



25 



0.0512 
0.0268 
0.0089 
0.0235 

0.0286 30 
0.0106 



FOETUS 

%Haeufigkeit- 35 

0.0278 

0.0305 

0.0063 

0.0157 

0.0000 40 

0.0000 

0.0213 

0.0289 

0.0000 

0.0185 45 

0.0121 

0.0000 

0.0126 



50 

NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0204 

0.0000 

0.0051 55 

0.0000 

0.0122 

0.0488 

0.0000 

0.0000 

0.0463 

0.0164 

0.0100 

0.0137 

0.0000 

0.0125 65 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 3 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0000 


0.0256 


0.0000 undef 


Brust 


0.0000 


0.0000 


undef 


undef 


Duenndarm 


0.0000 


0.0000 


undef 


undef 


Eierstock 


0.0000 


0.0000 


undef 


undef 


Endokrines_Gewebe 


0.0000 


0.0000 


undef 


undef 


Gastrointestinal 


0-0000 


0.0000 


undef 


undef 


Gehirn 


0.0000 


0.0000 


undef 


undef 


Haematopoetisch 


0.0000 


0.0000 


undef 


undef 


Haut 


0.0000 


0.0000 


undef 


undef 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0000 


0.0000 


undef 


undef 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


' Muskel-Skelett 


0.0000 


0.0000 


undef 


undef 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0030 


0.0000 


undef 


0.0000 


Prostata 


0.0000 


0.0000 


undef 


undef 


UterusJEndometrium 


0.0000 


0.0000 


undef 


undef 


Uterus Myometrium 


0.0000 


0.0000 


undef 


undef 


Dterus_allgemein 


0.0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0.0000 








Prostata-Hyperplasie 


0.0000 








Samenblase 


0.0000 








Sinnesorgane 


0.0000 








WeisseJBlutkoerperchen 


0.0000 








Zervix 


0.0000 









35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 
EierstockM: 
Endokrines_Gewebe 
. Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
N erven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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DE 198 18 619 A 1 







Elektronischer Northern fur 


SEQ. ID. NO; 


:4 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0. 


0000 


0. 


0230 


0.0000 


undef 


Brust 


0. 


0090 


0. 


0094 


0.9527 


1.0496 


Duenndarm 


0. 


0123 


0- 


0165 


0.7415 


1.3487 


Eier stock 


0. 


0150 


0. 


0104 


1.4391 


0.6949 


Endokrines_Gewebe 


0. 


0068 


0. 


0125 


0.5434 


1.8403 


Gastrointestinal 


0. 


0038 


. o. 


0046 


0.8283 


1.2072 


Gehirn 


0. 


0044 


0. 


0123 


0.3600 


2.7779 


Haema t opoe t is ch 


0. 


0187 


0. 


0000 


undef 


0.0000 


Haut 


0. 


0000 


0. 


0000 


undef 


undef 


Hepatisch 


0. 


0000 


0. 


0388 


0.0000 


undef 


Herz 


0. 


0138 


0. 


0000 


undef 


0.0000 


Hoden 


0. 


0173 


0. 


0000 


undef 


0.0000 


Lunge 


0. 


0093 


0. 


0143 


0.6532 


1.5310 


Magen-Speiseroehre 


0. 


0097 


0. 


0000 


.undef 


0.0000 


Muskel-Skelett 


0. 


0017 


0. 


0180 


0.0952 


10.5060 


Niere 


0. 


0136 


0. 


0137 


0.9913 


1.0088 


Pankreas 


0. 


0149 


0, 


0055 


2.6923 


0.3714 


Penis 


0. 


0150 


0. 


0000 


undef 


0.0000 


Prostata 


0. 


0065 


0. 


0064 


1.0236 


0.9769 


Uterus_Endometrium 


0. 


0000 


0. 


0000 


undef 


undef 


Uterus_Myometrium 


0. 


0000 


0. 


0136 


0.0000 


undef 


Dterus_allgemein 


0. 


0000 


0. 


0000 


undef 


undef 


Brust-Hyperplasie 


0. 


0032 











Prostata-Hyperplasie 0.0178 

Samenblase 0.0089 

Sinnesorgane 0.0000 

Weisse_ Blutkoerperchen 0 . 0035 

Zervix 0.0000 



10 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haeraatopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit* 
0.0000 
0.0167 
0.0000 
0.0354 
.0000 
.0000 
.0000 
.0072 
0.0000 
0.0062 
0.0000 
0.0000 
0.0000. 



35 



40 



45 



NORMIERTE / S DBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0272 

Eierstock_n 0.0000 

EierstoclcJ: 0.0051 

EndokrinesjGewebe 0.0000 

Foetal 0.0151 

Gastrointestinal 0.0000 

Haematopoetisch 0.0114 

Haut-Muskel 0.0259 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0.0060 

Prostata 0.0068 

Sinnesorgane 0.0155 

Uterus n 0.0250 



50 



55 



60 



65 
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DE 198 18 619 A 1 

Bektronischer Northern fur SEQ. ID. NO: 5 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndann 
Eierstock 
Endokr ine s__Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus__ allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0000 
0.0026 
0.0031 
0.0060 
0.0051 
0.0134 
0.0052 
0.0000 
0.0000 
0.0000 
0.0032 
0.0000 
0.0031 
0.0000 
0.0034 
0.0190 
0.0017 
0.0000 
0.0131 
0.0068 
0.0000 
0.0000 
0.0000 
0059 
0000 
0000 
0000 



TUMOR 

%Haeufigkeit 

0.0204 

0.0056 

0.0000 

0.0156 

0.0000 

0.0093 

0.0051 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0020 

0.0000 

0.0060 

0.0068 

0.0000 

0.0000 

0.0043 

0.0000 

0.0000 

0.0000 



Verhaeltnisse 
N/T T/N 
0.0000 undef 
0.4537 2.2042 
undef 0.0000 
0.3838 2.6058 
undef 0.0000 
1.4496 0.6898 
1.0079 0.9921 
undef undef 
undef undef 
undef undef 
undef 0.0000 
undef undef 
1.5241 0.6561 
undef undef 
0.5711 1.7510 
2.7756 0.3603 
undef 0.0000 
undef undef 
3.0709 0.3256 
undef 0.0000 
undef undef 
undef undef 



0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit' 

0.0000 

0.0083 

0.0063 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000- 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock_t 
Endokr ines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
— ' Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0051 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0082 

0.0050 

0.0137 

0.0000 

0.0000 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 6 





NORMAL 


T0MOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0204 


0.0000 undef 


Brust 


0.0038 


0.0075 


0.5104 1.9593 


Duenndarm 


0.0031 


0.0000 


■undef 0.0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 


0.0038 


0.0000 


undef 0.0000 


Gehirn 


0.0022 


0.0021 


1.0799 0.9260 


Haematopoetisch 


0.0053 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0048 


0.0000 


undef 0.0000 


Herz 


0.0042 


0.0137 


0.3084 3.2426 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0.0052 


0.0143 


0.3629 2.7557 


Magen-Speis eroehr e 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0017 


0.0060 


.0.2856 3.5020 


Niere 


0.0000 


0.0000 


undef undef 


Pankreas 


0.0017 


0.0110 


0.1496 6.6857 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus_Endometrium 


0.0135 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0229 


0.0068 


3.3668 0.2970 


Uterus_allgemein 


0.0204 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0 . 0353 

Weisse_Blutkoerperchen 0.0009 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haematopoetisch 0 . 0118 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0036 

Nebenniere 0.0000 

Niere 0.0124 

Placenta 0.0242 

Prostata 0.0000 

Sinnesorgane 0.0000 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit ** 

Brust 0.0000 

Eierstock_n 0.0000 

Eierstock_t 0.0000 

Endokrines_Gewebe 0.0245 

Foetal 0.0151 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0259 

Hoden 0.0000 

Lunge 0.0164 

Nerven 0.0080 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0250 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 7 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf iakeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0179 


0. 0000 tmrtef 


Brust 


0.0038 


0.0132 


0 2917 3 45>ft7 


Duenndarm 


0.0061 


0.0000 


undef 0 0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines__Gewebe 


0.0017 


0. 0100 


0.1698 5 8889 


Gastrointestinal 


0.0019 


0.0370 


0.0518 19 3158 


Gehirn 


0.0096 


0.0051 1 




Haematopoetis ch 


0.0067 


0 0000 


nnHof ft flftftft 


Haut 


0.0000 


0 0000 


<•LUU.tr J. UIJUc X. 


Hepatisch 


0. 0048 


n ooks 


ft 1 ^Cftft 


Herz 


0.0138 


0. 0137 


1 ftftO** ft GQ*7"7 


Hoden 


0.0288 






Lunge 


0 0031 




n om a coon 


Ma gen— Spei s e r oehr e 


0.0387 


0. 0153 




* Muskel-Skelett 


0. 0017 


0.0000 


undef 0.0000 


Niere 


0.0054 


0.0000 


undef 0.0000 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myametrium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0102 


0.0000 


undef 0.0000 


Brus t-Hyperplas ie 


0.0000 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0 . 0026 

Zervix 0.0000 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0028 
Gehirn 0.0751 
Haematopoetis ch 0.0079 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0499 
Sinnesorgane 0.0000. 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstock_n 0.0000 
EierstockM: 0.0000 
Endokrines__Gewebe 0.0000 
Foetal 0.0087 
Gastrointestinal 0.0122 
Haematopoetisch 0.0114 
Haut-Muskel 0.0097 
Hoden 0.0540 
Lunge 0.0082 
Nerven 0.0201 
Prostata 0.0205 
Sinnesorgane 0 . 0000 
Uterus n 0.0375 



12 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myoiaetrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock n 
Eierstock^t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
— Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 8 



NORMAL 


TUMOR 


Verhaeltnisse 


%Haen£icrkeifc 


%Haeufigkeit N/T T/N 


0 0000 


0.0179 


0 0000 nnrfpf 


0. 0013 


0. 0038 


0 ^40*3 2 Q^fiQ 


0 0123 


0 0000 


miuci u.uuw 


0 0060 


0 0078 


0 7675 1 3029 


0.0068 


0 0100 


0 67Q2 1 4722 


0.0038 


0. 0093 


0 4142 2 4145 


0.0044 


0.0144 


0 3086 3 2409 


0.0080 


0 0000 


iir»rff»'f r o noon 


0.0037 


0 0000 


und^'F 0 0000 


0.0000 


0.0000 


undef undef 


0.0021 


0 0000 


nndpf o onno 


0.0058 


0 0000 


iinflp -F A 0000 


0.0031 


0 0082 


0 3810 2 6245 


0.0000 


0.0307 


0.0000 undef 


0.0051 


0.0060 


0.8567 1.1673 


0.0000 


0.0000 


undef undef 


0.0066 


0.0000 


undef 0.0000 


0.0060 


0.0000 


undef 0.0000 


0.0087 


0.0064 


1.3648 0.7327 


0.0000 


0.0000 


undef undef 


0.0000 


0.0068 


0.0000 undef 


0.0051 


0.0000 


undef 0.0000 



0.0000 
0.0059 
0.0000 
0.0000 
0.0035 
0.0000 

FOETUS 

%Haeufigkeit 

0.0000 

0.0056 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0036 

0.0000 

0.0000 

0.0061 

0.0000 

0.0000: 



NORMIERTE /SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0204 

0.0000 

0.0608 

0.0000 

0.0029 

0.0244 

0.0000 

0.0097 

0.0154 

0.0000 

0.0080 

0.0000 

0.0000 

0.0000 
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DE 198 18 619 A 1 

Hektronischer Northern fur SEQ. ID. NO: 9 



10 



15 



20 



25 



30 



35 



40 



45 





NORMAL 


T0M0R 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0179 


0 0000 iirtH^^ 

U.UUUU LU.1UC i 


Brust 


0.0064 


n onsfi 

U • UUJD 




Duennda nn 


0.0000 


U • UIOJ 


n nnnn nn^af 
u. uuuu uuaei 


Eierstock 


0 0030 


0 009G 


l.XOJLO U.OOOD 


Endokrinf**j Gew^fa^ 


0.0153 


0 009-1 




Cast* T*f>"i n^^^t* inal 


0 0038 


ft ft ft ji £ 
U. UU4t> 


ft oooo 1 om*> 


Gehirn 


0 0015 


n nn*ii 
U. UUjX 


ft OQOft Q iITl>>l 

U. ZOoU 4 /24 


IlctdUci UUpUc Llo CXI 


n nn£*7 


ft nnnn 
U. UUUU 


ft ftftftft 

unaer u.uUUO 


Haut 


0 00*^7 
u . uu j / 


n np^7 

u • UO-i / 






o nnoo 

U * UUUU 


n nn£R 

u . UUDO 


ft ftftftft V-l-VX jr^ #-t 

u.uuuu unaer 


Herz 


0 00S3 

U . VUJJ 


ft nnnn 
U • UUUU 


-3 ^7 /\ ftftftft 

unaer u.uuuu 


Hoden 


0 0230 


ft ft'ysA 
U* U^j^ 


u.yttjy jl.uido 


Lunge 


0 0049 


a nn^i 

U. UU41 


1 Aid ft QOii 0 




0 0000 


ft 0077 
u . uu / / 


n nnnn nnriaf 

U • UUUU UIlUcI 


" Muskel— Skelefcfc 


0 0034 


ft nnnn 

U • UUUU 


im^A^ n nnnn 
unQci u.uuuu 


Niere 


0.0054 


n nnco 

U . UUDO 


ft 7Q7ft 1 OC1 ft 

u. /you j. . zoiu 




0.0017 


n nnnn 

u * UUUU 


nn/laf ft ftftftft 

unaei u.uuuu 


Penis 


0 00*^0 


n n^d 
u • u^o / 


n ii 9"^ p Qn^*^ 


r^uo l.u La 


0 006S 


0.0064 


1.0236 0.9769 


Otenis Endometrium 


0.0135 


n nnnn 
U . UUUU 


unaer U.UUUU 


Oterus_Myometrium 


0.0152 


0.0000 


undef 0.0000 


UlclUS al J.u.GJlle.LIl 


n nnnn 

u • UUUU 


0. 0000 


undef undef 


Brust-Hyperplasie 


0.0032 






xriOST.ata~uypeipia.5ie 


u . uuuu 








n nnno 

u • uuuu 








n nnno 
u . uuuu 








n nn^ ^ 






Zervix 


n nnnn 








r U£il UO 














Hill LWllyA,! Lilly 


o nooo 

u . uuuu 






baStruinLcnoLlIlal 


n nnnn 

U • UUUU 






Gehiim 


ft ftftftft 
U • UUUU 






Haematopoetisch 


n nnoo 






naiit. 


n nnnn 






Hepat i s ch 


0.0000 






Herz-Blutgefaesse 


0.0036 






Lunge 


0.0108 






Nebenniere 


0.0000 






Niere 


0.0062 






Placenta 


0.0000 






Prostata 


0.0499 






Sinnesorgane 


0.0000 







50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock^_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0.0114 
Haut-Muskel 0.0194 
Hoden 0.0309 
Lunge 0.0000 
Nerven 0.0040 
Prostata 0.0000 
Sinnesorgane 0.0000 
Dterus n 0.0125 



NORMIERTE/ SDBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0068 
0.0000 
0.0101 
0.0000 
0.0047 
0.0000 



14 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 10 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeifc 


%Haeufigkeit N/T 


T/N 


Blase 


0,0000 


0.0153 


0.0000 undef 


Brust 


0. 0051 


0.0019 


2.7221 0.3674 


Duenndarm 


0.0000 


0.0000 


undef 


undef 


Rierstonk 


0 0000 


0.0000 

w • w v W 


undef 


undef 




0 0000 


0.0000 


undef 


undef 


Ga <st rn intestinal 


0 0000 

V * w w 


0 009*5 


0.0000 undef 


Gehim 


0,0000 


0.0113 


0.0000 undef 


XiQ.cxug tupuc UAOUll 


o nooo 


0 0000 


undef 


undef 


Haut 


0 0000 

V . w w 


0 0000 


undef 


undef 


H#*nat"i ^f*h 

IlC^/O JL «P VU 


0 0000 


0. 0129 


0.0000 undef 


Herz 


0-0011 


0 0000 


undef 


0.0000 


Hoden 


0 0058 


0 0000 


undef 


0.0000 


Lung@ 


0, 0021 


0 . 0000 


undef 


0.0000 




0 0000 

V • w v W 


0.0000 


undef 


undef 


Muskel-Skelett 


0,0000 


0.0000 


undef 


undef 


Niere 


0.0000 


0.0000 


undef 


undef 


Fankreas 


0.0000 


0.0000 


undef 


undef 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0000 


0.0000 


undef 


undef 


Dte rusJBndome t r ium 


0.0000 


0.0000 


undef 


undef 


Uterus_ Myomet r ium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0000 


undef 


undef 



Brust-Hyperplasie 0.0000 

Prostata-Hyperplasie 0 . 0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0000 



FOETUS 

%Haeuf igkeit ' 
Entwicklung 0.0000 
Gastrointenstinal 0 . 0000 
Gehirn 0.0000 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 
Brust 0.0000 
Eierstock_n 0.0000 
Eierstock^t 0.0000 : - 
Endokrines_Gewebe. 0.0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
— Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0010 
Prostata* 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



15 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 11 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit %Haeufigkeit N/T 


T/N 


Blase 


0. 


0000 


0. 0153 


0. 


0000 


undef 


Brust 


n 

\J » 


0038 




1. 


0208 


0.9796 


Duennda nn 


n 
u • 


nnnn 


n nnnn 


uncle f 


undef 


R i o rs t" or 1c 




0000 


0.0104 


0. 


0000 


undef 


Endokrines Gewebe 


0. 


0068 


0.0025 


2. 


7170 


0.3681 


Ga s t roint e s t inal 


n 
u • 


Uli 1 / 




0. 


8283 


1.2072 




n 
u • 


0000 




0. 


0000 


undef 


HaP7nat"ODOPt3 sf*h 






o nnnn 


undef 


0.0000 


Haut 


0 . 


0073 


0 000D 


undef 


0.0000 


Heoatisch 


0. 


0000 


0.0194 


0. 


0000 


undef 


Herz 


0. 


0021 


0 0000 


undef 


0.0000 


Hoden 


0. 


0000 


0 0000 


undef 


undef 


Lunge 


0. 


0010 


0. 0041 


0. 


2540 


3.9367 


Magen— Spei.se r oehr e 


0. 


0097 


0.0077 


.1. 


2605 


0.7933 


Muskel-Skelett 


0. 


0000 


0.0000 


undef 


undef 


Niere 


0. 


0054 


0.0068 


0. 


7930 


1.2610 


Pankreas 


0, 


0050 


0.0000 


undef 


0.0000 


Penis 


0. 


0000 


0.0267 


0. 


0000 


undef 


Prostata 


0. 


0065 


0.0298 


0. 


2193 


4.5590 


Uterus_Endometriuih 


0. 


0068 


0.0528 


0. 


1280 


7.8106 


Uterus__Myometrium 


0. 


0076 


0.0068 


1. 


1223 


0.8911 


Uterus_allgemein 


0. 


0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0. 


0000 











Prostata-Hyperplasie 0 . 0238 

Samenblase 0.0000 

Sinnesorgane 0 . 0000 

Weiss e__ Blutkoerperchen 0 . 002 6 

Zervix 0.0106 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit" 
0.0139 
0.0194 
0.0063 
0.0000 
0.0000 
0.0260 
0071 
.0108 
,0000 
,0000 
0.0121 
0.0499 
0.0000. 



50 



55 



60 



65 



Brust 
EierstocK^n 
Eierstock^t 
Endo kr ine s_Ge webe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0557 ;; 

0.0000 

0.0076 

0.0000 

0.0000 

0.0000 



Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0030 
Prostata 0.0137 
Sinnesorgane 0 . 0387 
Uterus n 0.0042 



16 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 12 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit 


N/T T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0013 


0.0000 


undef 0.0000 


Duenndarm 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0026 


0.0000 undef 


Endokrines_Gewebe 


0.0017 


0.0075 


0.2264 4.4166 


Gastrointestinal 


0.0000 


0.0093 


0.0000 undef 


Gehirn 


0.0044 


0.0000 


undef 0.0000 


Haematopoetisch 


0.0000 


0.0000 


undef undef 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Herz 


0.0000 


0.0000 


undef undef 


Hoden 


0.0000 


0.0117 


0.0000 undef 


Lunge 


0.0000 


0.0000 


undef undef 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


o.oboo 


0.0000 


undef undef 


Niere 


0.0000 


0.0000 


undef undef 


Pankreas 


0.0017 


0.0000 


undef 0.0000 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0000 


0.0021 


0.0000 undef 


Uterus Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 0-0000 


0.0000 


undef undef 


Dt e rus_al lgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0032 







Pros tat a-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0 . 0000 

Zervix 0.0000 



FOETUS 

%Haeufigkeit" 
Entwicklung 0.0000 
Gastrointenstinal 0 . 0000 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.000Q: 



NORM IERTE / S UBTRAH I ERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstockji 0.0000 
EierstockJ: 0.0000 > 
Endokrines_Gewebe 0.0245 
Foetal 0.0023 
Gastrointestinal 0.0000 
Haematopoetisch 0-0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0020 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



17 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 13 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0090 


0.0169 


0.5293 1.8893 


Duenndarm 


0.0031 


0.0000 


undef 0.0000 


Eierstock 


0.0060 


0.0052 


1.1513 0.8686 


Endokrines_Gewebe 


0.0051 


0.0226 


0.2264 4.4166 


Gastrointestinal 


0.0057 


0.0231 


0.2485 4.0241 


Gehirn 


0.0052 


0.0082 


0.6300 1.5874 


Haematopoetisch 


0.0107 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0000 


undef undef 


Herz 


0.0138 


0.0000 


undef 0.0000 


Hoden 


0.0345 


0.0117 


2.9518 0.3388 


Lunge 


0.0021 


0.0123 


0.1693 5.9051 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0051 


0.0060 


0.8567 1.1673 


Niere 


0.0163 


0.0068 


2.3791 0.4203 


Pankreas 


0.0000 


0.0055 


0.0000 undef 


Penis 


0.0030 


0.0267 


0.1123 8.9035 


Prostata 


0.0174 


0.0128 


1.3648 0.7327 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus_ Myometrium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0051 


0.0000 


undef 0 . 0000 


Brus t-Hyperplas ie 


0.0000 







Prostata-Hyperplasie 0.0149 

Samenblase 0.0267 

Sinnesorgane 0.0118 

Weiss e_Blutkoerperchen 0 . 0087 

Zervix 0.0000 



35 



40 



45 



50 



55 



60 



65 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit* 
0.0139 
0.0139 
0.0063 
0.0079 
0.0000 
0.0000 
0107 
0072 
0000 
0000 
0061 
0249 



0.0000. 



Brust 
Eierstock_n 
EierstocJc_t 
Endokrines_Gewebe 

Foetal 0^0146 

Gastrointestinal 0.0000 

Haematopoetisch 0.0057 

Haut-Muskel 0.0000 

Hoden 0.0309 

Lunge 0.0000 

Nerven 0.0010 

Prostata 0.0137 

Sinnesorgane 0 . 0000 

Uterus n 0.0083 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0000 
0.0000 
0.0101 
0.0000 



18 



DE 198 18 619 Al 

Elektronischer Northern fur SEQ. ID. NO: 14 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0000 


0.0153 


0.0000 undef 


Brust 


0.0051 


0.0395 


0.1296 7.7146 


Duenndann 


0.0000 


6.0000 


undef undef 


Eierstock 


0.0060 


0.0234 


0.2558 3.9088 * 


Endokrines_Gewebe 


0.0000 


0.0000 


undef undef 


Gastrointestinal 


0.0115 


0.0000 


undef 0.0000 


Gehirn 


0.0000 


0.0041 


0.0000 undef 


Haematopoetisch 


0.0027 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0381 


0.0129 


2.9412 0.3400 


Herz 


0.0021 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.0234 


0.0000 undef 


Lunge 


0.0073 


0.0245 


0.2964 3.3743 


Magen-Speiseroehre 


0.0000 


0.0537 


0.0000 undef 


Muskel-Skelett 


0.0137 


0.0420 


0.3263 3.0643 


Niere 


0.0163 


0.0000 


undef 0.0000 


Pankreas 


0.0017 


0.0331 


0.0499 20.0570 


Penis 


0.0000 


0.0267 


0.0000 undef 


Prostata 


0.0022 


0.0128 


0.1706 5.8615 


Dterus_Endometriura 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometriura 


0.0305 


0.0136 


2.2445 0.4455 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0000 






Prostata-Hyperplasie 


0.0000 






Samenblase 


0.0000 






Sinnesorgane 


0.0118 






Weisse_ Blutkoerperchen 


0.0000 






Zervix 


0.0319 







FOETUS 

%Haeufigkeit' 

Entwicklung 0.0278 

Gastrointens tinal 0 . 0583 

Gehirn 0.0000 

Haematopoetisch 0.0354 

Haut 0.5025 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0071 

Lunge 0.0542 

Nebenniere 0.0761 

Niere 0.1235 

Placenta 0.0727 

Prostata 0.0249 

Sinnesorgane 0.1255: 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 

Brust 0.0204 

Eierstock_n 0.0000 

Eierstock^t 0.0709 

Endokr ines jGewebe 0.0245 

Foetal 0.0175 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0162 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 



19 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 15 



10 



15 



20 



25 



30 



35 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.1278 


0.0305 32.7774 


Brust 


0.0179 


0.0414 


' 0 4331 2 3091 


Duenndarm 


0.0031 


0.1323 


0 0232 43 1571 


Eierstock 


0.0090 


0. 0234 


0 3838 2 6058 


Endokrines Gewebe 


0.0358 


0.0301 


1.1887 0.8413 


Gastrointestinal 


0.0019 


0 0000 


Hnrjpf 0 0000 


Gehirn 


0.0007 


0 0010 

U . U U X U 


0 7?00 1 3890 




0 0000 


0 0000 
u • uuuu 


UilUC X LLL Uic X, 


Haut 


0.0037 


0 0000 
u . uuuu 


tinri^'F 0 0000 




0 0000 


0 006S 

u . u U U«J 


0 0000 unrlo-F 


Herz 


0.0244 


0 0000 
u .uuuu 


iinri^"F 0 0000 

UUUC X. u • uuuu 


Hoden 


0.0000 


0.0000 


undef undef 


Lunge 


0.0374 


0. 0491 


0.7621 1.3122 


Mag en— Spe i s e r oenr e 


0.0290 


0. 0077 


-3 7816 0 2644 


Muskel-Skelett 


0.0000 


0 0000 


unde f untie f 


NIere 


0.0027 


0 0068 
u . uuuu 


0 39fi^ ? Q 


Pankreas 


0.0017 


0.2209 


0.0075 133 7133 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0065 


0.0085 


0.7677 1.3026 


Uterus Endometrium 


0.0000 


o nnoo 

u . UUUU 


unaer unaex. 


Uterus_Myometrium 


0.0152 


0.0000 


undef 0.0000 




0 0000 

U . Uvwv 


u • UUUU 


undef undef 


Brust-Hyperplasie 


0.0192 






rrostata nyperpias le 








Oct UlcXlUXcto c 


0 0967 
u • u^ D / 






O iiiiicoUL y aim 


n 0000 






Weisse Blutkoerperchen 


u • uuuu 






Zervix 


u.UlUO 








FOETUS 








%Haeuf iakeit 






F.nt"wi f*kl tirifi 


0 0000 








0 0000 






ocuiiu 


0 0000 






ClClCsUtCL t>VWC I»*L«?Vj>A,L 


0 0000 

v • u u u u 






Haut 


0 0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0667 






Prostata 


0.0000 






Sinnesorgane 


o.oooo- 







50 

NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.1224 
Eierstocl^_n 0.0000 
55 Eierstock_t 0.1013 

Endokrines_Gewebe 0.0000 
Foetal 0.0268 
Gastrointestinal 0.0122 
Haematopoetisch 0 . 0057 
60 ~" Haut-Muskel 0.0065 

Hoden 0.0000 
Lunge 0.0246 
Nerven 0.0000 
Prostata 0.0205 
65 Sinnesorgane 0.0000 

Uterus n 0.0125 



20 



DE 198 18 619 A 1 







Elektronischer 


Northern fiir ^"Ptf"^ 
I^IUI LlJCxll 1 UI OIa^. 


. ID. NO: 


17 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeuf igkeit 


N/T 


T/N 


Blase 


0. 


.0039 


0.0537 


0.0726 


13.7665 


Brust 


0. 


.0077 


0.0207 


0.3712 


2.6940 


Duenndarm 


0. 


.0368 


0.0000 


undef 


6.0000 


Eierstock 


0. 


.0150 


0.0078 


1.9188 


0.5212 


Endokrines_Gewebe 


0. 


.0102 


0.0100 


1.0189 


0.9815 


Gastrointestinal 


0. 


0421 


0.0093 


4.5559 


0.2195 


Gehirn 


0. 


0118 


0.0195 


0.6063 


1.6494 


Haematopoetisch 


0. 


0174 


0.0379 


0.4587 


2.1798 


Haut 


0. 


0110 


0.0000 


undef 


0.0000 


Hepatisch 


0. 


0048 


0.0518 


0.0919 


10.8799 


Herz 


0. 


0127 


0.0275 


0.4626 


2.1618 


Hoden 


0. 


0115 


0.0117 


0.9839 


1.0163 


Lunge 


0. 


0114 


0.0061 


1.8628 


0.5368 


Magen-Speis eroehr e 


0. 


0000 


0.0460 


0.0000 


undef 


Muskel-Skelett 


0. 


0154 


0.0060 


2.5700 


0.3891 


Niere 


0. 


0054 


0.0068 


0.7930 


1.2610 


Pankreas 


0. 


0050 


0.0331 


0.1496 


6. 6857 


Penis 


0. 


0090 


0.0533 


0.1685 


5.9357 


Prostata 


0. 


0174 


0.0192 


0.9099 


1.0990 


Uterus Endometrium 


0. 


0068 


0 0000 


undef 


0 0000 


Uterus Myometrium 


0. 


0152 


0.0000 


undef 


0.0000 


Uterus allgemein 


0. 


0204 


0.0000 


undef 


0.0000 


BT*ll*?t— HVTlPTTll P**?"! A 

U L Li **Jf Wv^M^CLO 


0. 


0064 








Prostata-Hyperplasie 


0. 


0238 








Samenblase . 


0. 


0000 








Sinnesorgane 


0. 


0000 








Weisse_Blutkoerperchen 


0. 


0251 








Zervix 


0. 


0106 









FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0167 
Gehirn 0.0438 
Haematopoetisch 0.0118 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0107 
Lunge 0.0181 
Nebenniere 0.0000 
Niere. 0.0247 
Placenta 0.0061 
Prostata 0.0249 
Sinnesorgane 0.0000. 



Brust 
Eierstock^n 
EierstockM; 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
- - Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0408 

0.0000 

o.oioi ; 

0.0000 
0.0087 
0.0122 
0.0000 
0.0130 
0.0000 
0.0082 
0.0010 
0.0068 
0.0000 
0.0000 
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DE 198 18 619 A 1 

Elektronischer Northern fiir SEQ. ID. NO: 18 



10 



15 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haemat opoe t is ch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
Brust-Hyperplasie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0195 
0,0166 
0.0061 
0.0389 
0.0392 
0.0019 
0.0007 
0.0107 
0.0220 
0.0238 
0.0085 
0.0115 
0.0104 
0.0000 
0600 
0407 
0198 
0030 
0.0000 
0.0405 
0.0305 
0.0153 
0.0064 
0.0030 
0.0000 
0.0118 
0.0000 
0.0000 



TOMOR 


Verhaeltnisse 


%Haeufigkeit N/T T/N 


0.2556 




0.0357 




0.0662 


0 1 ft 7ftCn 


0.0052 


7 AR*ko f\ i 


0. 0326 


1 901 7 n Q^91 
JL»£.\J±f U.OOZl 


0.0000 


und^f n noon 


0.0606 


0 0122 81 Q4Q1 


0.0000 


undef 0 ftftftn 


0.5085 


0.0433 23. 0839 


0.0518 


0.4596 2.1760 


0.0000 


undef 0.0000 


0.0000 


undef 0*0000 


0.0041 


2.5402 0.3937 


0.0077 


0.0000 undef 


0.0480 


1.2493 0.8005 


0.0068 


5.9478 0.1681 


0.0331 


0.5983 1.6714 


0.1066 


0.0281 35.6140 


0.0021 


0.0000 undef 


0.0000 


undef 0.0000 


0.1155 


0.2641 3.7870 


0.0000 


undef 0.0000 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Ha ema t opo e t i s ch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS - 

%Haeufigkeit 

0.0696 

0.3332 

0.0000 

0.2202 

0.0000 

1.6381 

0.0285 

0.1337 

1.0903 

0.6301 

0.6786 

0.0499 

0.0000; 



50 



55 



60 



65 



Brust 
Eierstock^n 
Eierstock_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0030 
Prostata 0.0000 
Sinnesorgane 0.0000 
Oterus n 0.0250 



NORMIERTE/SOSTRAHIERTE bibliotheken 
%Haeufigkeit 
0.0544 
0000 

1063 > 
0000 
4264 
0000 
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Elektronischer Northern fur SEQ. ID. NO: 19 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf xgkeit 


%Haeuf igkeit N/T T/N 


Blase 


0.0195 


0.2301 


0 0847 1 1 7QQ8 


Brust 


0. 0192 


0. 0113 


1 701 ^ 0 ^R"7R 


Duennda mi 


0.0061 


0 0331 




Eier stock 


0.0180 


0 0000 


nnrifa'F 0 OOOO 
mmcL u . uuuu 


Endokrines Gewebe 


0 0034 


0 1555 


0 0219 45 6387 


Gastrointestinal 


0 0000 


0 0370 


o onoo liriHof 

u . uuuu miucJt 




0.0214 


0 1 Sfil 

U • X.J0X 


0 1 7*74 7 2001 


Ha ema t opo e t i s ch 


0.0134 


o noon 


nn/4a-F n nnnn 

UUttcJL u. UUUU 


Haut 


0 0073 


0 0000 


nnHof n nnnn 

UIlUc X u* uuuu 


Hp'oati ^r*h 

UC^O \« •!> O I..A 1 


0 0048 


ft noon 

U • UUUU 


unuer u. uuuu 


Herz 


0 0085 


0 0000 

U . UUUU 


IXLLUCX UtUUUU 


ilVAtCll 


0 0000 


0 0000 


UIJUCJL IXllUtfX 


Lunge 


0. 0197 


0 0020 


Q fi^27 0 1 O^fi 

27 • OJL / U . IUj D 


Maaen- Sueiseroehi-e 


0 0000 


0 0077 
u • uu / # 


.n nnnn 
'U.uuuu unuei 




0. 0034' 


n i **2n 


n n9^n R22i 


Niere 


0. 0109 


ft nnco 
u . uuoo 


X.OooX U.OOUO 


Panic T-^a ^ 

£ fill IVXCOJ 


0 0083 


o nnnn 

U • UUUU 


unaex u.uuuu 




o noon 


ft nnnn 


undef undef 


nua LaLa 


0 0000 


0.0021 


0.0000 undef 


IF L.CX UJ ffiiUUUlc? UXXU-UL 


n n203 


ft Aftftft 

U . UUUU 


ft ft ft ft ft 

under u.uuuu 


Dterus_Myometrium 


0.0000 


0.0475 


0.0000 undef 


Uterus allgemexn 


U . UIOJ 


ft ft rtft ft 

0. ouuo 


under 0.0000 


Brust-Hyperplasie 


0.0128 






Prostata - Hyperplasia 


n nmn 






■JCLULCl JiJ x a d c 


o nono 








0 0235 






1 «; e;£» T\~\ 11 "t* Vr^O TTK* T"^Vl Oft 


0 0000 






Zervix 


0 021 3 








FOETUS 








oiTClt;UJL_LyTi.t;X U 






LW±L.A.± Lilly 


n 0417 






uasx.ivoxnuensuj.nax 


n i 

U . IJOl 






Gehirn 


ft Aft £1 

U . UUbJ 






Haematopoetiscn 


u . xoo / 






n.„f 

Jtiaut 


n nnnn 
U. UUUU 








0.3380 






Herz-Blutgefaesse 


0.0249 






Lunge 


0.0578 






Nebenniere 


0.5071 






Niere 


0.2594 






Placenta 


0.4120 






Prostata 


0.0000 






Sinnesorgane 


0.0000- 







NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufxgkeit 

Brust 0.0612 

Exerstock_n 0.0000 

Eierstock_t 0.0000 

Endokrines_Gewebe 0.0000 

Foetai 0.1188 

Gastrointestinal 0.0244 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0000 

Nerven 0.0141 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0416 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 20 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0039 


0.0383 


0.1017 9. 


.8332 


Brust 


0.0077 


0.0075 


1.0208 0. 


.9796 


Duenndann 


0.0061 


0.0496 


0.1236 8, 


.0920 


Eierstock 


0.0000 


0.0130 


0.0000 undef 


Endokrines_Gewebe 


0.0153 


0.0000 


undef 


0. 


.0000 


Gastrointestinal 


0.0115 


0.0000 


undef 


0. 


.0000 


Gehirn 


0.0022 


0.0031 


0.7200 1. 


3890 


Haematopoetisch 


0.0013 


0.0000 


undef 


0. 


0000 


Haut 


0.0073 


0.0000 


undef 


0. 


0000 


Hepatisch 


0.0048 


0.0129 


0.36762. 


7200 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0042 


0.0102 


0.4064 


2. 


4605 


Magen-Speiseroehxe 


0.0290 


0.0077 


3.7816 


; o. 


2644 


Muskel-Skelett 


0.0000 


0.0000 


undef 


undef 


Niere 


0.0136 


0.0000 


undef 


0. 


0000 


Pankreas 


0.0017 


0.0000 


undef 


0. 


0000 


Penis 


0.0030 


0-0000 


undef 


0. 


0000 


Prostata 


0.0065 


0.0106 


0.6142 


1. 


6282 


D t e rus_Endome t r ium 


0.0068 


0.0528 


0.1280 


7. 


8106 


Uterus__Myometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0000 


undef 


undef 


Brust-Hyperplasie 


0.0032 











Prostata-Hyperplasie 0. 0268 

Samenblase 0.0267 

Sinnesorgane 0.0235 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0319 



FOETUS 

%Haeuf igkeit 
Entwicklung 0.0139 
Gastrointenstinal 0.0139 
Gehirn 0.0000 
Haematopoetisch 0.0039 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0036 
Nebenniere 0.0254 
Niere 0.0062 
Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000. 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstock_n 0.0000 

Eierstock^_t 0.0203 

Endokrines_Gewebe 0.0245 5 

Foetal 0.0128 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0082 

Nerven 0.0090 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 21 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0358 


0.1090 9.1777 


Brust 


0.0077 


0.0207 


0.3712 2.6940 


Duenndarm 


0.0215 


0.0331 


0.6488 1.5413 


Eierstock 


0.0030 


0.0078 


0.3838 2. 6058 


EndokrinesjGewebe 


0.0000 


0.0000 


undef unrfpf 


Gastrointestinal 


0.0575 


0.0046 


12.4251 1 


Gehirn 


0.0000 


0.0072 


0-0000 undef 


Haematopoetisch 


0.0214 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0194 


0. 0000 undef 


Herz 


0.0000 


0.0137 


0. 0000 undef 


Hoden 


0.0000 


0.0585 


0. 0000 undef 


Lunge 


0.0104 


0.0225 


0.4618 2.1652 


Magen-Speiseroehre 


0.0000 


0.0000 


undef undef 


Muskel-Skelett 


0.0257 


0.0120 


2.1416 0.4669 


Niere 


0.0054 


0.0000 


undef 0.0000 


Pankreas 


0.0099 


0.0221 


0.4487 2.2286 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0022 


0.0000 


undef 0.0000 


0 terusJEndome t r ium 


0.0000 


0.0000 


undef undef 


Uterus_Myomet r ium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0096 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_Blutkoerperchen 0 . 0000 

Zervix 0.0000 



0.0805 



10 



IS 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000. 



35 



40 



45 



Brust 
Eierstockja 
Eierstock^t 
Endokrines_Gewebe 

Foetal 0.0000 
Gastrointestinal 0.0122 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0164 
Nerven 0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0136 
0.0000 
0.0051 
0.0000 



Prostata 
Sinnesorgane 
Uterus n 



0.0000 
0.0000 
0.0000 



50 



55 



60 



65 
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Elektronischer Northern fur SEQ. ID. NO: 22 



10 



is 



20 



25 



30 



Blase 
Brust 
Duenndarm 
Eierstock 
Endokrines_Gewebe 
Gastrointestinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz 
Hoden 
Lunge 

Magen-Speiseroehre 
' Muskel-Skelett 
Niere 
Pankreas 
Penis 
Prostata 
Uterus_Endometrium 
Uterus_Myometrium 
Uterus_allgemein 
B rust-Hype rplas ie 
Prostata-Hyperplasie 
Samenblase 
Sinnesorgane 
Weisse_Blutkoerperchen 
Zervix 



NORMAL 
%Haeufigkeit 
0.0351 
0.0281 
0.0092 
0.0569 
0.0596 
0.0019 
0.0022 
0.0174 
0220 
0285 
0223 
0173 
0.0145 
0.0290 
0.0788 
0.0489 
0.0264 
0.0090 
0.0000 
0.1013 
0.0381 
0.0153 
0.0096 
0.0059 
0.0000 
0.0118 
0.0000 
0.0000 



TUMOR 


Verhaeltnisse 


%Haeufigkeit N/T T/N 


0.3144 


0.1116 8.9591 


0.0470 


0.5989 1. 6698 


0.0662 


0.1390 7.1929 


0.0208 


2.7342 0.3657 


0.0527 


1.1321 0.8833 


0.0139 


0.1381 7.2434 


0.0945 


0.0235 42.5950 


0.0000 


undef 0.0000 


0.5085 


0.0433 23.0839 


0. 0582 


0.4902 2 0400 


0.0000 


undef 0 . 0000 


0.0000 


undef 0.0000 


0.0082 


1.7781 0.5624 


0.0077 


•3.7816 0.2644 


0.0540 


1.4595 0.6852 


0.0137 


3.5687 0.2802 


0.0442 


0.5983 1.6714 


0.1066 


0.0842 11.8713 


0.0064 


0.0000 undef 


0.0000 


undef 0.0000 


0.1494 


0.2551 3.9206 


0.0954 


0.1601 6.2452 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 
0.0696 
0.4554 
0.0000 
0.2753 
0.0000 
6381 
,0605 
,1879 
,1663 
0.8215 
0.7816 
0.0499 
0.0000. 



50 



55 



60 



65 



Brust 
Eierstock_n 
Eierstock|_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0612 

0.0000 

0.1164 

0.0000 

0.4665 

0.0000 

0.0000 

0.0000 

0.0000 



Lunge 0.0000 
Nerven 0.0030 



Prostata 
Sinnesorgane 
Uterus n 



0.0068 
0.0000 
0.0291 
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DE 198 18 619 A 1 



Elektroniscber Northern fur SEQ. ID. NO: 23 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0332 


0.1173 8.5221 


Brust 


0.0038 


0.0263 


. 0.1458 6.8574 


Duenndarm 


0.0184 


0.0331 


0.5561 1.7982 


Eierstock 


0.0150 


0.0156 


0.9594 1.0423 


Endokrines_Gewebe 


0.0170 


0.0075 


2.2642 0.4417 


Gastrointestinal 


0.0192 


0.0324 


0.5917 1.6901 


Gehirn 


0.0133 


0.0113 


1.1781 0.8488 


Haematopoetisch 


0.0174 


0.0000 


undef 0.0000 


Haut 


0.0220 


0.0000 


undef 0.0000 


Hepatisch 


0.0048 


0.0388 


0.1225 8.1599 


Herz 


0.0339 


0.0137 


2.4671 0.4053 


Hoden 


0.0288 


0.0234 


1.2299 0.8130 


Lunge 


0.0218 


0.0184 


1.1854 0.8436 


Magen-Speiseroehre 


0.0000 


0.0307 


.0.0000 undef 


' Muskel-Skelett 


0.0291 


0.0240 


1.2136 0.8240 


Niere 


0.0136 


0.0137 


0.9913 1.0088 


Pankreas 


0.0050 


0.0166 


0.2991 3.3428 


Penis 


0.0120 


0.0000 


undef 0.0000 


Prostata 


0.0153 


0.0149 


1.0236 0.9769 


Uterus_Endometrium 


0.0203 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0204 


0.0000 undef 


Uterus_allgemein 


0.0051 


0.0000 


undef 0.0000 


Brus t-Hyperpla s ie 


0.0128 






Prostata-Hyperplasie 


0.0178 






Samenblase 


0.0267 






Sinnesorgane 


0.0118 






Weisse_Blutkoerperchen 


0.0243 






Zervix 


0.0213 








FOETUS 








%Haeufigkeit- 






Entwicklung 


0.0139 






Gastrointenstinal 


0.0056 






Gehirn 


0.0125 






Haematopoetisch 


0.0000 






Haut 


0.0000 






Hepatisch 


0.0000 






Herz-Blutgefaesse 


0.0320 






Lunge 


0.0036 






Nebenniere 


0.0254 






Niere 


0.0124 






Placenta 


0.0121 






Prostata 


0.0000 






Sinnesorgane 


0.0000: 







NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstock_n 0.0000 

Eierstock^t 0.0354 ; : 

Endokrines_Gewehe 0.0000 

Foetal 0.0105 

Gastrointestinal 0.0000 - m , 

Haematopoetisch 0.0171 

-** Haut-Muskel 0.0454 

Hoden 0.0000 

Lunge 0.0246 

Nerven 0.0211 *- 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 24 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0195 


0.1457 


0.1338 7.4732 


Brust 


0.0013 


0.0169 


. 0.0756 13-2250 


Duenndarm 


0.0061 


0.0496 


0.1236 8.0920 


Eierstock 


0.0090 


0.0260 


0.3454 2.8954 


Endokrines_Gewebe 


0.0119 


0.0125 


0.9509 1.0516 


Gastrointestinal 


0.1648 


0.0231 


7.1237 0.1404 


Gehirn 


0.0030 


0.0308 


0.0960 10.4173 


Haematopoetisch 


0.2099 


0.0000 


undef 0.0000 


Haut 


0.0000 


0.0000 


undef undef 


Hepatisch 


0.0000 


0.2006 


0. 0000 undef 


Herz 


0.0032 


0.0137 


0.2313 4.3235 


Hoden 


0.0000 


0.0819 


0. 0000 undef 


Lunge 


0.0956 


0.2965 


0.3223 3.1023 


Magen-Speiseroehre 


0.0097 


0.1917 


0.0504 19.8329 


" Muskel-Skelett 


0.0959 


0.0240 


3.9977 0.2501 


Niere 


0.0081 


0.0205 


0.3965 2.5219 


Pankreas 


0.0182 


0.0497 


0.3656 2.7350 


Penis 


0.0030 


0.1333 


0.0225 44.5175 


Prostata 


0.0131 


0.0064 


2.0473 0.4885 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0051 


0.0000 


undef 0.0000 


Brus t-Hyperplas ie 


0.0096 







Prostata-Hyperplasie 0.0030 
Samenblase 0.0267 
Sinnesorgane 0.0706 
Weisse_Blutkoerperchen 0.0017 
• Zervix 0.0000 



FOETUS 

%Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 . 

Haematopoetisch 0.0000 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0000 

Nebenniere 0.0000 

Niere 0.0000 

Placenta 0.0000 

Prostata 0.0000 

Sinnesorgane 0.000Q. 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstockji 0.0000 
EierstockM: 0.2076 
Endokrines_Gewebe 0 . 0000 
Foetal 0.0006 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0030 
Prostata 0.0068 
Sinnesorgane 0.0000 
Uterus n 0.0042 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0,0039 


0.0281 


0.1387 


7. 


.2110 


Brust 


0.0217 


0.0132 


1.6527 


0. 


.6051 


Duenndarm 


0.0123 


0.0000 


undef 


0. 


.0000 


Eierstock 


0.0000 


0.0026 


0.0000 


undef 


Endokrines_Gewebe 


0.0102 


0.0150 


0.6792 


1. 


4722 


Gastrointestinal 


0.0115 


0.0046 


2.4850 


0. 


4024 


Gehirn 


0.0148 


0.0154 


0.9599 


1. 


0417 


Haematopoetisch 


0.0107 


0.0000 


undef 


0. 


0000 


Haut 


0.0220 


0.0847 


0.2599 


3. 


8473 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0085 


0.0000 


undef 


0. 


0000 


Hoden 


0.0058 


0.0117 


0.4920 


2. 


0326 


Lunge 


0.0145 


0.0061 


2.3708 


0. 


4218 


Magen-Speiseroehre 


0.0000 


0.0077 


0.0000 


undef 


' Muskel-Skelett 


0.0086 


0.0360 


6.2380 


4. 


2024 


Niere 


0.0081 


0.0000 


undef 


0. 


0000 


Pankreas 


0.0050 


0.0166 


0.2991 


3. 


3428 


Penis 


0.0030 


0.0000 


undef 


0. 


0000 


Prostata 


0.0109 


0.0064 


1.7060 


0. 


5862 


0terus_Endometrium 


0.0203 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0.0076 


0.0068 


1.1223 


0. 


8911 


Uterus_allgemein 


0.0102 


0.0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0.0096 











Pros tat a-Hyperplasie 0.0059 

Samenblase 0.0089 

Sinnesorgane 0.0000 

Weiss e_Blutkoerperchen 0.0286 

Zervix 0.0106 



FOETUS 

%Haeufigkeit- 
Entwicklung 0.0000 
Gastrointenstinal 0. 0028 
Gehirn 0.0125 
Haematopoetisch 0.0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgef aesse 0 . 0000 
Lunge 0.0036 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000, 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0204 

Eierstock_n 0.0000 

EierstockM; 0.0051 

Endokrines_Gewebe 0.0000 

Foetal 0.0029 

Gastrointestinal 0 . 0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0010 

Prostata 0.0068 

Sinnesorgane 0.0000 

Uterus n 0.0000 



29 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 26 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0039 


0.0256 


0.1525 6. 


.5555 


Brust 


0.0038 


0.0056 


0.6805 1, 


,4694 


Duenndarm 


0.0031 


0.0000 


undef 


0. 


,0000 


Eierstock 


0.0210 


0.0052 


4.0294 0. 


.2482 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 


0. 


0000 


Gastrointestinal 


0.0057 


0.0000 


undef 


0. 


0000 


Gehirn 


0.0037 


0.0031 


1.1999 0. 


8334 


Haematopoetisch 


0.0040 


0.0000 


undef 


0. 


0000 


Haut 


0.0073 


0.0000 


undef 


0. 


0000 


Hepatisch 


0.0000 


0.0000 


undef 


undef 


Herz 


0.0000 


0.0000 


undef 


undef 


Hoden 


0.0000 


0.0000 


undef 


undef 


Lunge 


0.0073 


0.0020 


3'. 5562 


: 0. 


2812 


Magen-Speiseroehre 


0.0000 


0.0000 


undef 


undef 


Muskel-Skelett 


0.0069 


0.0000 


undef 


0. 


0000 


Niere 


0.0027 


0.0000 


undef 


0. 


0000 


Pankreas 


0.0017 


0.0000 


undef 


0. 


0000 


Penis 


0.0030 


0.0000 


undef 


0. 


0000 


Prostata 


0.0000 


0.0021 


0.0000 


undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0076 


0.0068 


1.1223 


0. 


8911 


Dt e rusallgeme in 


0.0102 


0.0000 


undef 


0. 


OOOC 



Brust-fiyperplasie 0.0000 
Prostata-Hyperplasie 0 . 0030 
Samenblase 0.0089 
Sinnesorgane 0.0000 
Weiss e_Blutkoerperchen 0.0121 
Zervix 0.0000 



FOETUS 

35 %Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0.0139 
Gehirn 0.0063 
Haematopoetisch 0.0039 
40 Haut 0.0000 

Hepatisch 0.0000 
Herz-Blutgefaesse 0.0036 
Lunge 0.0036 
Nebenniere 0.0000 
45 1 Niere 0.0062 

Placenta 0.0000 
Prostata 0.0000 
Sinnesorgane 0 . 0000 t 



50 

NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0000 
Eierstockjn 0.0000 
55 Eierstock[_t 0.0101 

Endokrines_Gewebe 0.0000 
Foetal 0.0029 
Gastrointestinal 0.0488 
Haematopoetisch 0.0114 
^ -- Haut-Muskel 0.0130 

Hoden 0.0154 
Lunge 0.0000 
Nerven 0.0020 
Prostata 0.0274 
^ Sinnesorgane 0.0155 

Uterus n 0.0083 



30 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 27 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0256 


0.1525 6.5555 . 


Brust 


0.0115 


0.0150 


0.7656 1.3062 


Duenndarm 


0.0307 


0.0000 


undef 0.0000 


Eierstock 


0.0150 


0.0052 


2.8781 0.3474 


Endokrines_Gewebe 


0.0119 


0.0150 


0.7925 1.2619 


Gastrointestinal 


0.0057 


0.0139 


0.4142 2.4145 


Gehirn 


0.0074 


0.0092 


0.8000 1.2501 


Haematopoetisch 


0.0147 


0.0000 


undef 0.0000 


Haut 


0.0037 


0.1695 


0.0217 46.1678 


Hepatisch 


0.0048 


0.0000 


undef 0.0000 


Herz 


0.0074 


0.0137 


0.5397 1.8529 


Hoden 


0.0000 


0.0234 


0.0000 undef 


Lunge 


0.0104 


0.0082 


1.2701 0.7873 


Magen-Speiseroehre 


0.0000 


0.0230 


.0.0000 undef 


Muskel-Skelett 


0.0086 


0.0000 


undef 0.0000 


Niere 


0.0190 


0.0000 


undef 0.0000 


Pankreas 


0.0066 


0.0055 


1.1966 0.8357 


Penis 


0.0150 


0.0267 


0.5616 1.7807 


Prostata 


0.0087 


0.0149 


0.5849 1.7096 


Uterus_Endometrium 


0.0203 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0000 


0.0136 


0. 0000 undef 


Uterus_allgemein 


0.0051 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.0064 







Prostata-Hyperplasie 0.0178 

Samenblase 0.0178 

Sinnesorgane 0 . 0235 

Weisse_Blutkoerperchen 0 . 0104 

Zervix 0.0000 



FOETUS 

%Haeuf igkeit * 

Entwicklung 0.0000 

Gastrointenstinal 0.0111 

Gehirn 0.0125 

Haematopoetisch 0.0000 

Haut 0.0000 

Hepatisch 0.0260 

Herz-Blutgefaesse 0.0071 

Lunge 0.0217 

Nebenniere 0.0254 

Niere 0.0062 

Placenta 0.0061 

Prostata 0.0000 

Sinnesorgane 0.0000-. 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0136 
Eierstock_n 0.0000 
EierstocYt 0.0101 
Endokrines_Gewebe 0.0245 
Foetal 0.0181 
Gastrointestinal 0.0244 
Haematopoetisch 0.0114 
= : Haut-Muskel 0.0065 
Hoden 0.0000 
Lunge 0.0164 
Nerven 0.0090 
Prostata 0.0068 
Sinnesorgane 0.0232 
Uterus n 0.0000 



31 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 28 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0,0312 


0.1968 


0.1585 6.3096 


Brust 


0.0281 


0.0526 


0.5347 1.8702 


Duenndarm 


0.2177 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0494 


0 . 0000 undef 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 


0.2778 


0.0324 


8.5792 0.1166 


Gehim 


0.0007 


0.0133 


0.0554 18.0566 


Haematopoetisch 


0.1470 


0.0000 


undef 0.0000 


Haut 


0.0551 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.1876 


0.0000 undef 


Herz 


0.0011 


0.0137 


0.0771 12.9706 


Hoden 


0.0000 


0.2339 


0.0000 undef 


Lunge 


0.0623 


0.1206 


0.5166 1.9356 


Magen-Speiseroehre 


0.0000 


0.0153 


0.0000 undef 


Muskel-Skelett 


0.1216 


0.0300 


4.0548 0.2466 


Niere 


0.0081 


0.1027 


0.0793 12.6097 


Pankreas 


0.0743 


0.2209 


0.3365 2.9714 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0065 


0.0149 


0.4387 2.2795 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Dterus_Myometrium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brus t- Hype rpl as i e 


0.0384 






Prostata-Hyperplasie 


0.0030 






Samenblase 


0.0000 






Sinnesorgane 


0.0118 






Wei s s eJBlutkoe rper chen 


0.0017 






Zervix 


0.0000 






; : 


FOETUS 








%Haeufigkeit. 






Entwicklung 


0.0000 






Gastrointenstinal 


0.0000 






Gehirn 


0.0000 






Haematopoetisch 


0.0079 






Haut 


O.0000 






Hepatisch 


0.0000 






Herz-Blutgef aesse 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.0000 







50 

NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstockji 0.0000 
55 Eierstock^t 0.0051 

Endokrines_Gewebe 0.0000 
Foetai 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0-0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
65 Uterus n 0.0000 



32 



DE 198 18 619 A 1 



Elektroniscbcr Northern fur SEQ. ID. NO: 29 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1713 


0.1594 6.2745 


Brust 


0.0217 


0.0395 


0.5509 1.8152 


Duenndarm 


0.1901 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0390 


0.0000 undef 


Endokrines_Gewebe 


0.0017 


0.0000 


undef 0.0000 


Gastrointestinal 


0.2491 


0.0278 


8.9737 0.1114 


Gehirn 


0.0000 


0.0113 


0 . 0000 undef 


Haeraatopoetisch 


0.1297 


0.0000 


undef 0.0000 


Haut 


0.0477 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.1423 


0 . 0000 undef 


Herz 


0.0032 


0.0000 


undef 0.0000 


Hoden 


0.0000 


0.2105 


0. 0000 undef 


Lunge 


0.0457 


0.0818 


0.5588 1.7894 


Magen-Speiseroehre 


0.0000 


0.0153 


p. 0000 undef 


Muskel-Skelett 


0.0822 


0.0240 


3.4266 0.2918 


Niere 


0.0054 


0.1027 


0.0529 18.9146 


Pankreas 


0.0694 


0.2430 


0.2855 3.5020 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0153 


0.0106 


1.4331 0.6978 


Uterus_ Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus Myometrium 


0.0000 


0.0000 


undef undef 


Uterus__allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0288 






Prostata-Hyperplasie 


0.0030 






Samenblase 


0.0000 






Sinnesorgane 


0.0118 






WeisseJBlutkoerperchen 


0.0009 






Zervix 


0.0000 







Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 
0.0000 
0.0000 
0.0000 
0.0079 
0.0000 
0.0000 
0.0000 
0.0000 
.0000 
.0000 
.0000 
,0000 



0.0000, 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0068 
Eierstock_n 0.0000 
Eierstock^t 0.0000 
Endokrines_Gewebe 0.0000 
Foetal 0.0000 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0082 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



33 



DE 198 18 619 A 1 



Elektroniscber Northern fur SEQ. ID. NO: 30 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0858 


0.4525 


0. 1896 5 2742 


Brust 


0.1279 


0.0977 


1.3087 0 7641 


Duenndarm 


0.4998 


0.2150 


2.3242 0 4302 


Eierstock 


0.0180 


0.1353 


0.1328 7 5280 


Endokrines Gewebe 


0.0307 


0.0176 


1.7466 0 57?5 


Gastrointestinal 


0.7434 


0.1341 


5 5413 0 1 ftftS 


Gehirn 


0.0030 


0. 0524 


0 0565 17 70Q3 


Haema t opoe t i s ch 


0. 6884 




una ex. u* uuuu 


Haut 


0.0551 


0. 1695 


fl 394Q 3 ft77Q 


Heria t i *5 r*h 


0 0381 


0 7635 


33 £U* UJ3D 


Herz 


0. 0095 


fl 041 7 

U(U4 X£ 




Hod en 


0 0000 


n 1 4fi3 


u * uuuu unoer 


Lunge 


0 9773 


ft tkAl R 
U ■ O4X0 


ft 1 QCQC 






0.3450 


0 1961 5 OQQQ 

,0 .ajux Jt V333 




n 3?n3 


0.0660 


4*8544 0.2060 


Niere 


0.0163 


0.1780 


0.0915 10.9284 


Pankreas 


0.0529 


0.2319 


0.2279 4.3875 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0828 


0.0192 


4.3220 0.2314 


Uterus_Endometriura 


0.0068 


0.0000 


undef 0.0000 


Uterus_MyometriuiQ 


0.0076 


0.0204 


0.3741 2.6732 


Uterus_allgemein 


0.0102 


0.0000 


undef 0.0000 


Brust-Hyperplasie 


0.1343 







Prostata-Byperplasie 0.0268 

Samenblase 0.0267 

Sinnesorgane 0 . 0353 

Wei ss e__Blut koe rpe r chen 0.0043 

Zervix 0.0852 



FOETUS 

%Haeuf igkeit - 
0.0000 
0.0000 
0.0000 
0.0118 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 

50 

NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.1837 
0.0000 
0.0405 
0.0000 
0.0023 
0.0976 
0.0000 
0.0000 
0.0000 
0.0000 
0.0020 
0.0000 
0.0000 
0.0042 



35 

Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
40 Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
45 - Niere 

Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_n 
EierstockM: 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
z ; Hau t-Mus ke 1 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



34 



DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. H>. NO: 31 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 


0.0039 


0.0204 


0.1907 5.2444 


Brust 


0.0077 


0.0056 


1.3611 0.7347 


Duenndarm 


0.0184 


0.0331 


0.5561 1.7982 


Eierstock 


0.0150 


0.0104 


1.4391 0.6949 


Endokrines Gewebe 


0.0102 


0.0000 


undef 0.0000 


Gastrointestinal 


0.0019 


0.0046 


0.4142 2.4145 


Gehirn 


0.0074 


' 0.0072 


1*0285 0.9723 


Haematopoetisch 


0.0067 


0.0000 


undef 0.0000 


Haut 


0.0147 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0065 


0. 0000 undef 


Herz 


0.0064 


0.0000 


undef 0.0000 


Hoden 


0.0345 


0.0234 


1.4759 0.6775 


Lunge 


0.0073 


0.0143 


0.5080 1. 9684 


Magen— Speiseroehre 


0.0000 


0.0000 


undef undef 


• Muskel-Skelett 


0.0034 


0.0000 


•undef 0.0000 


Niere 


0.0054 


0.0000 


undef 0.0000 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 


Penis 


0.0030 


0.0000 


undef 0.0000 


Prostata 


0.0153 


0.0043 


3.5827 0.2791 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Ut e rus_Myome trium 


0.0000 


0.0068 


0.0000 undef 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0000 







Prostata-Hyperplasie 0.0089 

Samenblase 0.0178 

Sinnesorgane 0 . 0000 

WeisseJBlutkoerperchen 0.0069 

Zervix 0.0106 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit. 

0.0000 

0.0111 

0.0000 

0.0039 

0.2513 

0.0000 

0.0071 

0.0036 

0.0000 

0.0062 

0.0061 

0.0249 

0.0000 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

Eierstocto_t 0.0152 

Endokrines_Gewebe 0.0000 

Foetal 0.0041 

Gastrointestinal 0.0122 

Haematopoetisch 0.0000 

Haut-Muskel 0.0130 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0.0040 

Prostata 0.0137 

Sinnesorgane 0 . 0000 

Uterus n 0.0000 



35 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 32 







NORMAL 


TDMOR 


Verhaeltnisse 






%Haeufigkeit 


%Haeufigkeit N/T T/N 


5 


Blase 


0.0039 


0.0204 


0.1907 5.2444 




Brust 


0.0141 


0.0282 


0.4991 2.0038 




Duenndarm 


0.0061 


0.0000 


undef 0.0000 




Eierstock 


0.0090 


0 . 0312 


0.2878 3.4745 




Endokrines_Gewebe 


0.0153 


0.0176 


0.8733 1.1451 


in 


Gastrointestinal 


0.0192 


0.0231 


0.8283 1.2072 


Gehirn 


0.0222 


0.0113 


1.9635 0.5093 




Haematopoetisch 


0.0094 


0.0000 


undef 0.0000 




Haut 


0.0110 


0.0000 


undef 0.0000 




Hepatisch 


0.0048 


0.0129 


0.3676 2.7200 




Herz 


0.0159 


0.0000 


undef 0.0000 


15 


Hoden 


0.0000 


0.0234 


0.0000 undef 




Lunge 


0.0135 


0.0123 


1.1007 0.9085 




Magen-Speiseroehre 


0.0097 


0.0153 


0.6303 1.5866 




Muskel-Skelett 


0.0188 


0.0300 


0.6282 1.5918 


20 


Niere 


0.0163 


0.0205 


0.7930 1 2610 


Pankreas 


0.0083 


0.0221 


0.3739 2. 6743 




Penis 


0.0090 


0.0267 


0.3369 2.9678 




Prostata 


0.0109 


0.0043 


2.5591 0.3908 




Oterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


25 


Uterus_Myometrium 


0.0076 


0.0204 


0.3741 2.6732 


Dt erus_al 1 gemein 


0.0102 


0.0954 


0.1067 9.3678 




Brust-Hyperplasie 


0.0032 








Prostata-Hyperplasie 


0.0119 








Samenblase 


0.0089 








Sinnesorgane 


0.0235 






30 


Weisse_Blutkoerperchen 


0.0087 








Zervix 


0.0000 




■* 






FOETUS 






35 




%Haeuf igkei t * 








Entwicklung 


0.0000 








Gastrointenstinal 


0.0111 








Gehirn 


0.0000 








Haematopoetisch 


0.0039 






40 


Haut 


0.0000 








Hepatisch 


0.0260 








Herz-Blutgefaesse 


0.0107 








Lunge 


0.0072 








Nebenniere 


0.0000 






45 


: Niere 


0.0124 






Placenta 


0.0000 








Prostata 


0.0000 








Sinnesorgane 


0.0000* 







NORMIERTE/SCTdTRAHIERTE BIBLIOTHEKEN 

%Haeuf igkeit 

0.0000 

0.1595 

0.0203 ; 

0.0000 

0.0058 

0.0488 

0.0114 

0.0032 

0.0154 

0.0164 

0.0060 

0.0068 

0.0000 

0.0083 



Brust 
Eierstock n 
Eierstock_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoet is ch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 
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Elektronischer Northern fur SEQ. K>. NO: 33 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0429 


0.2173 


0.1974 5.0656 


Brust 


0.0409 


0.0357 


1.1462 0-8725 


Duenndarm 


0.1870 


0.0662 


2.8269 0.3537 


Eier stock 


0.0030 


0.0833 


0.0360 27.7957 


Endokrines Gewebe 


0.0034 


0.0050 


0.6792 1.4722 


Gastrointestinal 


0.2740 


0.0740 


3.7016 0.2702 


Gehirn 


0.0007 


0.0216 


0.0343 29.1683 


Haematopoetisch 


0.2165 


0.0000 


undef 0.0000 


Haut 


0.0220 


0.0000 


undef 0.0000 


Hepatisch 


0.0095 


0.3429 


0.0277 36.0397 


Her z 


0.0074 


0.0275 


0.2698 3.7059 


Hoden 


0.0000 


0.0585 


0. 0000 undef 


Lunge 


0. 1039 


0. 1738 


0.5977 1. 6731 


Mag en— Spe i s e r o ehr e 


0.0387 


0.0920 


0.4202 2.3799 


Muskel-Skelett 


0.0994 


0.0420 


2.3660 0.4227 


Niere 


0.0054 


0.0753 


0.0721 13.8707 


Pankreas 


0.0231 


0.1160 


0.1994 5.0142 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0262 


0.0000 


undef 0 . 0000 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0.0000 


Uterus_Myometrium 


0.0076 


0.0000 


undef 0.0000 


Uterus_allgemein 


0.0051 


0.0000 


undef 0.0000 


Bmst-Hyperplasie 


0.0512 







Prostata-Hyperplasie 0.0178 

Samenblase 0.0089 

Sinnesorgane 0.0118 

Weisse__Blut koerperchen 0.0000 

Zervix 0.0319 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 

0.0000 

0.0000 

0.0000 

0.0079. 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0612 
Eierstock_n 0.0000 
EierstocfcM: 0.0152 
Endokrines_Gewebe 0.0000 
Foetal 0.0012 
Gastrointestinal 0.0366 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0000 
Nerven 0.0000 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 



37 
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Elektronischer Northern fur SEQ. ID. NO: 34 



10 



15 



20 



25 





NORMAL 


TDMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0.0117 


0.0588 


0. 


1990 


5. 


0259 


Brust 


0.0102 


0.0094 


1. 


0888 


0. 


.9184 


Duenndarm 


0.0153 


0.0331 


0. 


4634 


2. 


1579 


Eierstock 


0.0539 


0.0130 


4. 


1445 


0. 


,2413 


Endokrines_Gewebe 


0.0187 


0.0075 


2. 


4906 


0. 


4015 


Gastrointestinal 


0.0134 


0.0093 


1. 


4496 


0. 


6898 


Gehirn 


0.0133 


0.0123 


1. 


0799 


0. 


9260 


Haematopoetisch 


0.0134 


0.0000 


undef 


0. 


0000 


Haut 


0.0147 


0.0000 


undef 


0. 


0000 


Hepatisch 


0.0000 


0.0129 


0. 


0000 


undef 


Herz 


0.0170 


0.0137 


1. 


2336 


0. 


8107 


Hoden 


0.0173 


0.0234 


0. 


7380 


1. 


3551 


Lunge 


0.0062 • 


0.0020 


3. 


0482 


0. 


3281 


Magen-Speiseroehre 


0.0676 


0.0000 


undef 


0. 


0000 


• Muskel-Skelett 


0.0171 


0.0180 


0. 


9518 


1. 


0506 


Niere 


0.0109 


0.0274 


0. 


3965 


2. 


5219 


Pankreas 


0.0066 


0.0110 


0. 


5983 


1. 


6714 


Penis 


0.0269 


0.0533 


0. 


5054 


1. 


9786 


Prostata 


0.0327 


0.0213 


1. 


5354 


0. 


6513 


Uterus_Kndometrium 


0.0068 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0.0534 


0.0000 


undef 


0. 


0000 


Uterus_allgemein 


0.0306 


0.0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0.0000 













Prostata-Hyperplasie 0.0030 
Samenblase 0.0000 
Sinnesorgane 0.0118 
30 Weisse_Blutkoerperchen 0.0061 
Zervix 0.0213 



FOETUS 

35 %Haeufigkeit. 

Entwicklung 0.0139 

Gastrointenstinal 0 . 0555 

Gehirn 0.0000 

Haematopoetisch 0-0079 

40 Haut 0.0000 

Hepatisch 0.0260 

Herz-Blutgefaesse 0.0071 

Lunge 0.0253 

Nebenniere 0.0254 

45 -- Niere 0.0000 

Placenta 0.0485 

Prostata 0.0000 

Sinnesorgane 0.0000 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 

Brust 0.0204 

Eierstock_n 0.0000 

Eierstock^t 0.2430 

Endokrines_Gewebe 0.0245 

Foetal 0.0338 

Gastrointestinal 0.0122 

Haematopoetisch 0.0171 

Haut-Muskel 0.0680 

Hoden 0.0077 

Lunge 0.0000 

Nerven 0-0151 

Prostata 0.0342 

Sinnesorgane 0.0000 

Uterus n 0.1166 
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Elektronischer Northern fur SEQ. ID. NO: 35 





NORMAL 


TUMOR 


Verhaeltnisse 




St Hri f*i tF i cr lef* i. i* 

v *l CI w U JuXM w .A- ^ 


%Haeufigkeit N/T T/N 


Blase 


0.0273 


0.1227 


0.2225 4.4952 


Brust 


0 0243 


0 0263 


0 9236 1 0828 


Duennda inn 


0.1625 


ft 0000 


nnHpf ft 0000 




0.0000 


0.0364 


0 0000 w\de£ 




0 0000 


0 0000 


tinde f unde f 




0-2146 


0. 0093 


23 1935 


Gehirn 


0.0000 


0 00 F>9 


0 0000 nnHpf 




0.1203 


ft 0000 






0. 0441 


n onnn 


iirtriof o noon 




ft 0000 


0 1 


o noon HTiHof 




ft 0091 


o noon 


nn^of O flflflrt 
LUlvLc JL U ■ UUUU 


Hoden 


0 0000 


0-0935 


0 0000 nnd*s'F 


xiiinge 


0.0353 


0 0900 


0 3926 2 5473 




0.0000 


0.0077 


0.0000 undef 


" Muskel-Skelett 


0.0857 


0.0120 


7.1388 0.1401 


Niere 


0.0081 


0.0753 


0.1081 9.2471 


Pankreas 


0.0529 


0.1381 


0.3829 2.6116 


Penis 


0.0000 


0.0000 


undef undef 


Prostata 


0.0109 


0.0064 


1.7060 0.5862 


Uterus_Endometrium 


0.0000 


0.0000 


undef undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


Uterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0160 







Prostata-Hyperplasie 0.0000 

Samenblase 0.0000 

Sinnesorgane 0-0118 

Weisse_Blutkoerperchen 0.0009 

Zervix 0.0106 



0.0431 



10 



15 



20 



25 



30 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Repatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit- 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000.. 



35 



40 



45 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0000 

Eierstockja 0.0000 

EierstockM: 0-0000 ;. 

Endokrines_Gewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0082 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 



50 



55 



60 



65 
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Elektromscher Northern fair SEQ. ID. NO: 36 







NORMAL 


TUMOR 


Verhaeltnisse 






%Haeufigkeit 


%Haeufigkeit 


N/T T/N 


5 


Blase 


0.0078 


0.0332 


0.2347 4.2611 




Brust 


0,0153 


0.0169 


0.9074 1.1021 




Duenndarm 


0.0092 


0.0165 


0.5561 1.7982 




Eierstock 


0.0210 


0.0130 


1.6118 0.6204 




Endokrines_Gewebe 


0.0085 


0.0150 


0.5660 1.7667 


i n 


Gastrointestinal 


0.0153 


0.0231 


0.6627 1.5090 


1U 


Gehirn 


0.0185 


0.0226 


0.8181 1.2223 




Haematopoetisch 


0.0227 


0.0379 


0.5999 1.6669 




Haut 


0.0073 


0.0000 


undef 0.0000 




Hepatisch 


0.0048 


0.0194 


0.2451 4.0800 




Herz 


0.0201 


0.0137 


1.4649 0.6827 


15 


Hoden 


0.0115 


0.0234 


0.4920 2.0326 




Lunge 


0.0166 


0.0164 


1.0161 0.9842 




Magen-Speiseroehre 


0.0000 


0.0153 


0.0000 undef 




Muskel-Skelett 


0.0137 


0.0060 


•2.2844 0.4378 


20 


Niere 


0.0081 


0.0274 


0.2974 3.3626 


Pankreas 


0.0050 


0.0055 


0.8974 1.1143 




Penis 


0.0180 


0.0267 


0.6739 1.4839 




Prostata 


0.0065 


0.0106 


0.6142 1.6282 




Uterus_Endometriuia 


0.0135 


0.0528 


0.2561 3.9053 




Uterus Myometrium 


0.0152 


0.0408 


0.3741 2.6732 


25 


Uterus_allgemein 


0.0458 


0.0000 


undef 0.0000 




Brust-Hyperplasie 


0.0256 








Prostata-Hyperplasie 


0.0208 








Samenblase 


0.0178 








Sinnesorgane 


0.0118 






30 


Wei s s e_Blut koerperchen 


0.0061 








Zervix 


0.0106 







FOETUS 

%Haeufigkeit. 

0.0417 

0.0167 

0.0250 

0.0079 

0.2513 

0.0000 

0.0142 

0.0181 

0.0000 

0.0000 

0.0242 

0.0249 

0.0000 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0136 

0.0000 

0.0101 

0.0000 

0.0082 ' 

0.0000 

0.0000 

0.0292 

0.0077 

0.0164 

0.0110 

0.0068 

0.0000 

0.0208 



35 

Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
40 Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
45 Niere 
Placenta 
Prostata 
Sinnesorgane 



Brust 
Eierstock_n 
Eierstock^_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haema t opoet i s ch 
„ Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 37 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


ft nmo 
U. UU /o 


ft A.O "30 


0.2347 4.2611 


Brust 


U. 0026 


ft ftftftft 

0. 0000 


. under 


ft ft ftftn 

0. 0000 


U U Gill 1U.CI X1U 


ft ft nftft 

o. uooo 


ft ft A i%ft 
0. 0000 


undef 


undef 


11*1 xt-^- 


U . UUUU 


ft ftftftft 

0. 0000 


undef 


undef 


Kndokrlnps Rouphp 

i-lllVAW Jvi. XUCJ UCItCL/C 


u • uuuu 


ft ftftftft 

u. uuuu 


undef 


undef 


Gastrointe**?!* inal 


ft m i c 
0. Olio 


ft ftftftft 

0.0000 


undef 


0.0000 


Gehirn 


a ft ft ft ft 

U. uuuu 


ft ftftftft 

0. 0000 


undef 


undef 


XlCltaUiq U.L£>L>il 


n ftl jn 
0. U14 / 


ft ftftftft 

0. 0000 


undef 


0.0000 


naut 


ft ftftftft 
u. UUUU 


ft ftftftft 

u. uuuu 


undef 


undef 




ft nftftn 


ft ftftftft 

u.uuou 


undef 


undef 




ft ft ftftft 

u. uuuu 


ft ftftftft 

u. uuuu 


undef 


undef 


U/^iH on 
nuucu 


ft ftftftft 

u • uuuu 


ft ftftftft 

u . uuuu 


undef 


undef 


Lunge 


0.0021 


0.0020 


1.0161 0.9842 


Manpn— Sni^T or* op* hi t~o 


0.0000 


0.0000 


undef 


undef 


' Muskel-Skelett 


0.0069 


0.0000 


undef 


0.0000 


Niere 


0.0000 


0.0000 


undef 


undef 


Pankreas 


0.0017 


0.0607 


0.0272 36.7712 


Penis 


0.0000 


0.0000 


undef 


undef 


Prostata 


0.0000 


0.0000 


undef 


undef 


Uterus_Endometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef 


undef 


Uterus_allgemein 


0.0000 


0.0000 


undef 


undef 



Brust-Hyperplasie 0.0000 
Prostata-Hyperplasie 0.0000 
Samenblas e 0.0000 
Sinnesorgane 0 . 0000 
Weisse_Blutkoerperchen 0.0009 
Zervix 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit 
0.0000 
0.0000 
.0000 
,0039 
,0000 
,0000 
,0000 
.0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000, 



Brust 
Eierstock_n 
Eierstock[_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 38 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0, 


.0624 


0. 


.2607 


0. 


.2393 4. 


.1791 


Brust 


0. 


.0051 


0, 


.0282 


0. 


.1815 5. 


.5104 


Duenndarm 


0. 


.0337 


0, 


.0000 


undef 0 . 


.0000 


Eierstock 


0, 


,0030 


0. 


.0937 


0. 


0320 31.2702 


Endokrines_Gewebe 


0. 


0034 


0, 


0025 


1. 


3585 0. 


.7361 


Gastrointestinal 


0. 


.1360 


0. 


.0694 


1. 


9604 0. 


.5101 


Gehirn 


0. 


0015 


0. 


.0318 


0. 


0464 21.5290 


Haematopoetisch 


0. 


1337 


0. 


0000 


undef 0 . 


0000 


Haut 


0. 


0147 


0. 


0000 


undef 0. 


0000 


Hepatisch 


0. 


0000 


0. 


4594 


0. 


0000 undef 


Herz 


0. 


0127 


0. 


0137 


0. 


9252 1. 


0809 


Hoden 


0. 


0000 


0. 


1754 


0. 


0000 undef 


Lunge 


0. 


0540 


0. 


2086 


0. 


2590 3. 


8610 


Magen-Speiseroehre 


0. 


0193 


0. 


0767 


•0. 


2521 3. 


9666 


Muskel-Skelett 


0. 


1747 


0. 


0240 


7. 


2815 0. 


1373 


Niere 


0. 


0190 


0. 


0205 


0. 


9252 1. 


0808 


Pankreas 


0. 


0611 


0. 


0773 


0. 


7906 1. 


2649 


Penis 


0. 


0030 


0. 


0000 


undef 0. 


0000 


Prostata 


0. 


0196 


0. 


0021 


9. 


2126 0. 


1085 


Uterus_EndQmetrium 


0. 


0135 


0. 


0000 


undef 0. 


0000 


Oterus_Myometrium 


0. 


0000 


0. 


0000 


undef undef 


Dt erus_allgeme in 


0. 


0000 


0. 


0000 


undef undef 


Brust-Hyperplasie 


0. 


0064 













b Prostata-Hyperplasie 0.0089 

Samenblase 0.0089 

Sinnesorgane 0 . 0353 

Weisse_Blutkoerperchen 0.0009 

Zervix 0.0426 



FOETUS 

%Haeufigkeit 
Entwicklung 0.0000 
Gastrointenstinal 0 . 0000 
Gehirn 0.0000 
Haematopoetisch 0 - 0000 
Haut 0.0000 
Hepatisch 0.0000 
Herz-Blutgefaesse 0.0000 
Lunge 0.0000 
Nebenniere 0.0000 
Niere 0.0000 
Placenta 0.0061 
Prostata 0.0000 
Sinnesorgane 0.0000-; 



NORMIERTE/SDBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0000 

Eierstock^n 0.0000 

Eierstock_t 0.0101 

EndokrinesGewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0 . 0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0010 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 39 





NORMAL 


TUMOR 


Verhaeltnisse 




4 Ha t»n -f i nV"<=» i 1- 


%Haeufigkeit N/T T/N 


Blase 


0.0702 


0.2761 


0.2542 3 9333 


Brust 


0.0614 


0.0620 


0.9899 1.0102 


Duennda rm. 


0.2024 


0.1489 


1.3594 0.7356 


Eierstock 


0.0060 


0.0442 


0.1354 7.3832 


Endokrines Gewebe 


0.0255 


0.0025 


10.1887 0 


Gastrointestinal 


0.4138 


0.0740 


5.5913 0.1788 


Gehirn 


0.0044 


0.0431 


0.1029 9.7228 


Haematopoetisch 


0.5774 


0.0000 


undef 0.0000 


Haut 


0.0257 


0.0000 


undef 0.0000 


Hepatisch 


0.0285 


0.3364 


0.0848 11.7866 


Herz 


0.0095 


0.0412 


0.2313 4.3235 


Hoden 


0.0000 


0.2689 


0. 0000 undef 


Lunge 


0.1735 


0.2678 


0.6476 1.5441 


Magen- Spe i s e r oehr e 


0.0870 


0.1917 


0.4538 2.2037 


Muskel-Skelett 


0.2193 


0.0180 


12.1835 0 


Niere 


0.0136 


0.1164 




Pankreas 


0.0677 


0. 1822 


0 3717 2 GQOfi 


Penis 


o oo^o 


0. 1333 


fl 0295 AA 1175 


Prostata 


0.0327 


0.0085 


3.8386 0.2605 


TH"AT11Q KrtriOTIlP't'T**! ITTtl 

\J ^ X- tuiUUUIC U> X. J.UIU 


o nooo 


n nnnn 
u . uuuu 


undef undef 


Uterus_Myometrium 


0.0000 


0.0000 


undef undef 


u t. e rus_a x x g eine m 


a nn^i 


n nnnn 
U. UUUU 


<inf)A-F n nnnn 

under u.uuuu 


Brust-Hyperplasie 


0.0927 






rtootata nypcipxaoie 


U . Ul / o 






Samenblase 


0.0267 






Sinnesorgane 


0 2823 






Tdp> ■? o o o "Rl 111" VoA"r~P»OTT*VkO'n 


0. 0087 






Zervix 


0.0426 






















Hill Lnl OfvX LUllj 


n nnnn 

u * uuuu 






uaS t to 111 Utiliy t. Ilia X 








Gehirn 


n nnnn 

u. uuuu 






Ha eniat opoe t i s ch 


n m 
u * uxo / 






naUL 


n nnnn 
u ■ uuuu 






Hepatisch 


0.0000 






He r z-Blut gef aess e 


0.0000 






Lunge 


0.0000 






Nebenniere 


0.0254 






Niere 


0.0000 






Placenta 


0.0000 






Prostata 


0.0000 






Sinnesorgane 


0.000Q: 







5 



20 



35 



40 



45 



Brust 
Eierstock_n 
Eierstock^t 
Endokrines__Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.4082 

0.0000 

0.3493 

0.0000 

0.0082 

0.1953 

0.0000 

0.0000 

0.0000 

0.0491 

0.0030 

0.0000 

0.0000 

0.0000 



50 



55 



60 



65 



43 



DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 40 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0, 


0195 


0.0690 


0.2825 


3. 


.5400 


Brust 


0. 


,0166 


0.0320 


0.5204 


1. 


9216 


Duenndarm 


0. 


0031 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0150 


0,0130 


1.1513 


0. 


8686 


EndokrinesGewebe 


0. 


0085 


0.0100 


0.8491 


1. 


1778 


Gastrointestinal 


0. 


0019 


0.0093 


0.2071 


4. 


8289 


Gehirn 


0. 


0067 


0.0442 


0.1507 


6. 


6362 


Haematopoetisch 


0. 


0187 


0.0000 


undef 


0. 


0000 


Haut 


0, 


0073 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0238 


0.0194 


1.2255 


0. 


8160 


Herz 


0. 


0625 


0.1512 


0. 4135 


2. 


4182 


Hoden 


0. 


0345 


0.0117 


2.9518 


0. 


3388 


Lunge 


0. 


0322 


0.0286 


1.1249 


0. 


8889 


Magen-Speiseroehre 


0. 


0000 


0.0307 


.0.0000 


undef 


MuskeX-Skelett 


0. 


0668 


0.1260 


0.5303 


1. 


8857 


Niere 


0. 


0190 


0.0342 


0.5551 


1. 


8014 


Pankreas 


0. 


0050 


0.1160 


0.0427 


23 


.3998 


Penis 


0. 


0299 


0.0000 


undef 


0. 


0000 


Prostata 


0. 


0131 


0.0170 


0.7677 


1. 


3026 


Uterus ^Endometrium 


0. 


0068 


0.0528 


0.1280 


7. 


8106 


Uterus__Myometriura 


0. 


0305 


0.0204 


1.4964 


0. 


6683 


Dterus_all gemein 


0. 


0357 


0.0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0. 


0000 











Prostata-Hyperplasie 0 . 0208 

Samenblase 0.0356 

Sinnesorgane 0.0588 

30 Weisse_Blutkoerperchen 0.0000 

Zervix 0.0319 



FOETUS 

35 %Haeufigkeit' 
Entwicklung 0 . 0000 
Gastrointenstinal 0 . 0167 
Gehirn 0.0000 
Haematopoetisch 0.0118 
4Q Haut 0.0000 

Hepatisch 0.0520 
Herz-Blutgef aesse 0 . 0107 
Lunge 0.0325 
Nebenniere 0.0000 
45 Niere 0.0494 

Placenta 0.0909 
Prostata 0.0000 
Sinnesorgane 0.0000:" 



NORMIERTE/ SUEiTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.0340 
Eierstoclfji 0.0000 
Eierstock_t 0.0203 
55 Endokrines_Gewehe 0.0490 

Foetal 0.0297 
Gastrointestinal 0.0000 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
60 Hoden 0.0154 

Lunge 0.0082 
Nerven 0.0030 
Prostata 0.0000 
Sinnesorgane 0 . 0000 
65 Uterus n 0.0000 
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DE 198 18 619 A 1 





Elektronischer Northern ftir SEQ. ID. NO: 41 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T T/N 


Blase 0.0390 


0.1355 


0.2878 3.4744 


Brust 


0.0269 


0.0564 


0.4764 2.0992 


Duenndarm 


0.1932 


0.0000 


undef 0.0000 


Eierstock 


0.0000 


0.0312 


0.0000 undef 


Endokrines_Gewebe 


0.0017 


0.0025 


0.6792 1.4722 


Gastrointestinal 


0.2203 


. 0.0509 


4.3300 0.2309 


Gehirn 


0.0000 


0.0103 


0.0000 undef 


Haema topoe t is ch 


0.0882 


0.0000 


undef 0.0000 


Haut 


0.0184 


0.0000 


undef 0.0000 


Hepatisch 


0.0000 


0.0518 


0.0000 undef 


Herz 


0.0095 


0.0137 


0.6939 1.4412 


Hoden 


0.0000 


0.0702 


0.0000 undef 


Lunge 


0.0478 


0.1472 


0.3246 3.0809 


Magen-Speiseroehre 


0.0193 


0.0153 


1.2605 0.7933 


Muskel-Skelett 


0.1011 


0.0120 


8.4237 0.1187 


Niere 


0.0054 


0.0548 


0.0991 10.0878 


Pankreas 


0.0099 


0.0552 


0.1795 5.5714 


Penis 


0.0000 


0.2933 


0.0000 undef 


Prostata 


0.0153 


0.0021 


7.1654 0.1396 


Uterus_Endometrium 


0.0068 


0.0000 


undef 0-0000 


U t erus_Myome t r ium 


0.0000 


0.0068 


0.0000 undef 


Oterus_allgemein 


0.0000 


0.0000 


undef undef 


Brust-Hyperplasie 


0.0192 







Prostata-Hyperplasie 0 . 0119 

Samenblase 0.0000 

S inne s or gane 0.0000 

Weisse_Blutkoerperchen 0. 0009 

Zervix 0.0213 



FOETUS 

%Haeufigkeit 

Entwicklung 0.0000 

Gastrointenstinal 0.0000 

Gehirn 0.0000 

Haema topoet isch 0.0079 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0000 

Nebenniere 0.0000 

Niere 0.0000 

Placenta 0.0000 

Prostata 0.0000 

S innesor gane 0.0000: 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0000 

Eierstockja 0.0000 

Eierstock^t 0.0000 

EndokrinesjGewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0 . 0000 

Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

. Uterus n 0.0000 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 42 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%fiaeufigkeit N/T 


T/N 


Blase 


0. 


0273 


0. 


0895 


0.3051 


3. 


2777 


Brust 


0. 


0665 


0. 


0808 


0.8230 


1. 


2151 


Duenndarm 


0. 


0429 


0; 


0000 


undef 


0. 


0000 


Eierstock 


0. 


0539 


0. 


0234 


2.3025 


0. 


4343 


EndokrinesjGewebe 


0. 


0630 


0. 


0978 


0.6444 


1. 


5518 


Gastrointestinal 


0. 


0441 


0. 


0324 


1.3608 


0. 


7348 


Gehirn 


0. 


0554 


0. 


1006 


0.5510 


1. 


8149 


Haematopoetisch 


0. 


0454 


0. 


0379 


1.1998 


0. 


8335 


Haut 


0. 


0257 


0. 


0000 


undef 


0. 


0000 


Hepatisch 


0. 


0381 


0. 


0453 


0.8403 


1. 


1900 


Herz 


0. 


0435 


0. 


0825 


0.5268 


1. 


8981 


Hoden 


0. 


0575 


0. 


0000 


undef 


0. 


0000 


Lunge 


0. 


1008 


0. 


0552 


1.8252 


0. 


5479 


Magen-Speiseroehre 


0. 


0580 


0. 


0997 


0.5818 


1. 


7188 


* Muskel-Skelett 


0. 


0976 


0. 


0660* 


1.4797 


0. 


6758 


Niere 


0. 


0516 


0. 


0890 


0.5795 


1. 


7255 


Pankreas 


0. 


0248 


0. 


0773 


0.3205 


3. 


1200 


Penis 


0. 


0599 


0. 


1066 


0.5616 


1. 


7807 


Prostata 


0. 


0567 


0. 


0766 


0.7393 


1. 


3527 


Uterus Endometrium 


0. 


0405 


0. 


1055 


0.3841 


2. 


6035 


Uterus_Myomet rium 


0. 


0534 


0. 


0475 


1.1223 


0. 


8911 


Uterus_allgemein 


0. 


0866 


0. 


0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0. 


0416 













Prostata-Hyperplasie 0 . 0654 

Samenblase 0.0712 

Sinnesorgane 0 . 0823 

Weisse_Blutkoerperchen 0 . 1110 

Zervix 0.0319 



FOETUS 

35 %Haeufigkeit 
Entwicklung 0.0139 
Gastrointenstinal 0.0361 
Qehirn 0.0125 
Haematopoetisch 0.0433 
40 Haut 0.0000 

Hepatisch 0.0000 
Herz-Blutgefaesse 0.0071 
Lunge 0.0253 
Nebenniere 0.0254 
45 Niere 0.0432 

Placenta 0.0364 
Prostata 0.0499 
Sinnesorgane 0.0126 



NORMIERTE / SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.1020 

Eierstockja 0.1595 

Eierstock^t 0.0709 

Endokr inesjGewebe 0.0000 

Foetal 0.0635 

Gastrointestinal 0.0000 

Haematopoetisch 0.0114 

Haut-Muskel 0.0680 

Hoden 0.0463 

Lunge 0.0328 

Nerven 0.0351 

Prostata 0.0342 

Sinnesorgane 0.0464 

Uterus n 0.0083 
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DE 198 18 619 A 1 





Elektronischer Northern fur SEQ. ED. NO: 43 




NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T T/N 


Blase 


0. 0195 


0.0639 


0.3051 3.2777 


Brust 


n ft^ a ^ 


0.0470 


0.7350 1.3606 


Duenndarm 


0. 0.399 


0.1985 


0.2008 4.9797 


Eierstock 


ft fil ^ft 


0.0676 


0.2214 4.5168 


Endokr ine s_ Gewebe 


0. u^jy 


0.0878 


0.2717 3.6805 


Gastrointestinal 


0. OooZ 


0.1064 


0.8103 1.2341 


Gehirn 


O.UUo/ 


0.0277 


0.2400 4.1669 


Haema t opoe t is ch 


ft ftftQft 


0.0000 


undef 0.0000 


Haut 


0. 0587 


0.0000 


undef 0.0000 


Hepatisch 


0.0190 


0.0323 


0.5882 1.7000 


Herz 


0.0011 


0.0962 


0.0110 90.7941 


Hoden 


0. 0000 


0.0234 


0.0000 undef 


Lunge 


0. UUbZ 


0.0164 


0.3810 2.6245 


Magen-Speiseroehre 


ft riooT 


0.3450 


, 0.1120 8.9248 


• Muskel-Skelett 


ft flftftft 


0.0360 


0.0000 undef 


Niere 




0.1643 


0.4626 2.1617 


Pankreas 


0.0677 


0.0276 


2.4530 0.4077 


Penis 


0.0090 


0.0533 


0.1685 5.9357 


Prostata 


0.0109 


0.0255 


0.4265 2.3446 


Dterus_Endometrium 


0.0270 


0.0000 


undef 0.0000 


UterusJMyometrium 


0.0076 


0.0272 


0.2806 3.5642 


Uterus_allgemein 


0.0000 


0.4771 


0.0000 undef 


Brust-Hyperplasie 


0.0576 







Prostata-Hyperplasie 0.0119 

Samenblase 0.1068 

Sinnesorgane 0 . 0235 

Weisse_Blutkoerperchen 0.0061 

Zervix 0.0319 



FOETOS 

%Haeufigkeit. 

Entwicklung 0.0000 

Gastrointenstinal 0. 0111 . 

Gehirn 0.0813 

Haematopoetisch 0.0000 

Haut 0.0000 

Hepatisch 0.0000 

Herz-Blutgefaesse 0.0000 

Lunge 0.0145 

Nebenniere 0.0000 

Niere 0.0309 

Placenta 0.0121 

Prostata 0.0249 

Sinnesorgane 0 . 0000 



NORMIERTE/SDBTRAHIERTE BIBLIOTBEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

Eierstock^t 0.0101 

Endokrines_Gewebe 0 . 0000 

Foetal. 0.0122 

Gastrointestinal 0.4149 

Haematopoetisch 0.0000 

Haut-Muskel 0.0000 

Hoden 0.0154 

Lunge 0.0573 

Nerven 0.0040 

Prostata 0.0068 

Sinnesorgane 0 . 0000 

Uterus n 0.0000 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 44 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0. 


0156 


0.0511 


0.3051 


3. 


2777 


Brust 


0. 


0166 


0.0320 


0.5204 


1. 


9216 


Duenndarm 


0. 


0828 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0030 


0.0416 


0.0720 


13.8979 


Endokrines_Gewebe 


0. 


0017 


0.0050 


0.3396 


2. 


9444 


Gastrointestinal 


0. 


1226 


0.0278 


4.4178 


0. 


2264 


Gehim 


0. 


0000 


0.0123 


0.0000 


undef 


Haematopoetisch 


0. 


0481 


0.0000 


undef 


0. 


0000 


Haut 


0. 


0294 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0000 


0.1294 


0.0000 


undef 


Herz 


0. 


0032 


0.0137 


0.2313 


4. 


3235 


Hoden 


0. 


0000 


0.1403 


0.0000 


undef 


Lunge 


0. 


0447 


0.1390 


0.3213 


3. 


1128 


Magen-Speiseroehre 


0. 


0097 


0.0000 


undef 


0. 


0000 


" Muskel-Skelett 


0. 


0497 


0.0060 


"6.2810 


0. 


1208 


Niere 


0. 


0054 


0.0685 


0.0793 


12 


.6097 


Pankreas 


0. 


0215 


0.0166 


1.2963 


0. 


7714 


Penis 


0. 


0030 


0.0000 


undef 


0. 


0000 


Prostata 


0. 


0044 


0.0043 


1.0236 


0. 


9769 


Uterus_Endometrium 


0. 


0000 


0.0000 


undef 


undef 


Oterus_Myometrium 


0. 


0000 


0.0000 


undef 


undef 


Uterus__allgemein 


0. 


0000 


0.0000 


undef 


undef 


Brus t-Hyperplasie 


0. 


0352 











Pros ta ta-Hyperplas ie 0.0030 

Samenblase 0.0000 

Sinnesorgane 0.0000 

Weisse_, Blutkoerperchen 0.0000 

Zervix 0.0106 



FOETUS 

%Haeufigkeit. 

0.0000 

0.0000 

0.0000 

0.0079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Her z-Blu tgef aes se 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 

Brust 0.0136 

Eierstock_n 0.0000 

EierstockM; 0.0000 

Endokrines_Gewebe 0.0000 

Foetal 0.0000 

Gastrointestinal 0.0000 

Haematopoetisch 0.0000 

-.. Haut-Muskel 0.0000 

Hoden 0.0000 

Lunge 0.0000 

Nerven 0.0000 

Prostata 0.0000 

Sinnesorgane 0.0000 

Uterus n 0.0000 
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DE 198 18 619 A 1 



Elektronischer Northern fur SEQ. ID. NO: 45 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0. 


0117 


0.0383 


0.3051 


3. 


2777 


Brust 


0. 


0179 


0.0207 


0.8661 


1. 


1546 


Duenndarm 


0. 


0245 


0.0165 


1.4830 


0. 


6743 


Eierstock 


0. 


0329 


0.0156 


2.1106 


0. 


4738 


Endokrines_Gewebe 


0. 


0221 


0.0326 


0.6792 


1. 


4722 


Gastrointestinal 


0. 


0153 


0.0139 


1.1045 


0. 


9054 


Gehirn 


0. 


0296 


0.0288 


1.0285 


0. 


9723 


Haematopoetisch 


0. 


0187 


0.0379 


0.4940 


2. 


0241 


Haut 


0. 


0257 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0048 


0.0129 


0.3676 


2. 


7200 


Herz 


0. 


0201 


0.0550 


0.3662 


2. 


7306 


Ho.den 


0. 


0058 


0.0117 


0.4920 


2. 


0326 


Lunge 


0. 


0218 


0.0225 


0.9699 


1. 


0311 


Magen-rSpeiseroehre 


0. 


0387 


0.0383 


-1.0084 


0. 


9916 


■ Muskel-Skelett 


0. 


0171 


0.0240 


0.7139 


1. 


4008 


Niere 


0. 


0190 


0.0068 


2.7756 


0. 


3603 


Pankreas 


0. 


0116 


0.0221 


0.5235 


1. 


9102 


Penis 


0. 


0090 


0.0000 


undef 


0. 


0000 


Prostata 


0. 


0327 


0.0319 


1.0236 


0. 


9769 


Uterus_Endometrium 


0. 


0068 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0. 


0000 


0.0136 


0.0000 


undef 


Dterus_allgemein 


0. 


0560 


0.0000 


undef 


0. 


0000 


Brus t-Hyperplas ie 


0. 


0032 











5 



10 



15 



20 



25 



Prostata-Hyperplasia 0 . 0208 

Samenblase 0.0178 

Sinnesorgane 0.0118 

Weisse_Blutkoerperchen 0.0260 

Zervix 0.0000 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit\ 

0.0000 

0.0139 

0.0313 

0.0039 

0.0000 

0.0000 

0.0071 

0.0217 

0.0000 

0.0000 

0.0424 

0.0249 

0.0000 



35 



45 



Brust 
Eierstockji 
Eierstock^t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 
Prostata 
Sinnesorgane 
Uterus n 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 

%Haeufigkeit 

0.0408 

0.0000 

0.0405 

0.0245 

0.0087 

0.0000 

0.0057 

0.0097 

0.0231 

0.0082 

0.0251 

0.0205 

0.0000 

0.0125 



50 



55 



60 



65 
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DE 198 18 619 A 1 

Elektronischer Northern fur SEQ. ID. NO: 46 



to 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0. 


1170 


0.3067 


0. 


3814 


2. 


6222 


Brust 


0. 


3019 


0.2387 


1. 


2646 


0. 


7908 


Duenndarm 


1. 


1559 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0120 


0.0676 


0. 


1771 


5. 


6460 


EndokrinesjGewebe 


0. 


0034 


0.0125 


0- 


2717 


3. 


6805 


Gastrointestinal 


1. 


2798 


0.1804 


7. 


0940 


0. 


1410 


Gehirn 


0. 


0007 


0.0380 


0, 


0195 


51 


.3918 


Haematopoetisch 


0, 


4785 


0.0000 


undef 


0. 


0000 


Haut 


0. 


1322 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0285 


0.4594 


0, 


0621 


16 


1.0932 


Herz 


0. 


0138 


0.0275 


0. 


5011 


1. 


9955 


Hoden 


0. 


0000 


0.2456 


0. 


0000 


undef 


Lunge 


0, 


3625 


0.3435 


1. 


0554 


0. 


9475 


Magen-Speiseroehre 


0. 


0483 


0.1533 


0. 


3151 


3. 


1733 


* Muskel-Skelett 


0. 


2124 


0.0480 


4. 


4260 


0. 


2259 


Niere 


0. 


0163 


0.1917 


0. 


0850 


11 


.7691 


Pankreas 


0. 


1074 


0.4528 


0. 


2371 


4. 


2171 


Penis 


0. 


0000 


0.0533 


0. 


0000 


undef 


Prostata 


0. 


0806 


0.0255 


3. 


1562 


0. 


3168 


Uterus_Endometrium 


0, 


0068 


0.0000 


undef 


0. 


0000 


Uterus Myometrium 


0. 


0152 


0.0000 


undef 


0. 


0000 


Uterus_allgemein 


0. 


0102 


o.oooo 


undef 


0. 


0000 


Brus t-Hyperplas ie 


0, 


3805 













Prostata-Hyperplasie 0.0505 
Samenblase 0.0356 
Sinnesorgane 0.5175 
Weisse_Blutkoerperchen 0.0078 
Zervix 0.0213 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit. 

0.0000 

0.0000 

0.0000 

0.0039 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0303 

0.0000 

0.0000 



50 



55 



60 



65 



NORMIERTE/ SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.7687 
0.0000 
0.1924 
0.0000 
0.0029 
0.0976 



Eierstockji 
Eierstock*_t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0.0000 
Haut-Muskel 0.0000 
Hoden 0.0000 
Lunge 0.0328 
Nerven 0.0020 
Prostata 0.0000 
Sinnesorgane 0.0000 
Uterus n 0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 47 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeuf igkeit N/T 


T/N 


Blase 


0. 


0195 


0.0486 


0.4014 


2. 


4911 


Brust 


0. 


0550 


0.1297 


. 0.4241 


2. 


3580 


Duenndarm 


0. 


0123 


0.1654 


0.0741 


13 


1.4866 


Eierstock 


0. 


0449 


0.0728 


0.6167 


1. 


6214 


Endokrines_Gewebe 


0. 


0119 


0.0075 


1.5849 


0. 


6309 


Gastrointestinal 


0. 


0153 


0.1018 


0.1506 


6. 


6398 


Gehirn 


0. 


0037 


0.0359 


0.1029 


9. 


7228 


Haematopoetisch 


0. 


0134 


0.0000 


undef 


0. 


0000 


Haut 


0. 


0404 


0.0847 


0.4765 


2. 


0985 


Hepatisch 


0. 


0285 


0.0323 


0.8824 


1. 


1333 


Herz 


0. 


0233 


0.0825 


0.2827 


3. 


5374 


Hoden 


0. 


0460 


0.0351 


1.3119 


0. 


7622 


Lunge 


0. 


0551 


0.0491 


1.1219 


0. 


8913 


Magen-Speiseroehre 


0. 


0000 


0.0613 


0.0000 


undef 


' Muskel-Skelett 


0, 


1148 


0.3120 


0.3679 


2. 


7180 


Niere 


0. 


0271 


0.0137 


1.9826 


0. 


5044 


Pankreas 


0. 


0116 


0.0884 


0.1309 


7. 


6408 


Penis 


0. 


0180 


0.0533 


0.3369 


2. 


9678 


Prostata 


0. 


0087 


0.0319 


0.2730 


3. 


6634 


Uterus Endometrium 


0. 


0270 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0. 


0610 


0.1902 


0.3206 


3. 


1187 


Uterus_allgemein 


0. 


0509 


0.0000 


undef 


0. 


0000 



Brust-Hyperplasie 0.0288 

Prostata-Hyperplasie 0.0089 

Samenblase 0.0178 

Sinnesorgane 0. 0118 

Weisse_Blutkoerperchen 0.0000 

Zervix 0.0532 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Haut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeuf igkeit- 
0.1809 
0.1388 
0.0063 
0.0551 
2513 
0000 
0605 
1409 
2535 
1235 
0.0364 
0.0748 
1.3934, 



Brust 
Eierstock_n 
EierstockM; 
Endokrines_Gewei>e 
Foetal 
Gastrointestinal 
Haematopoetisch 
Haut-Muskel 
Hoden 
Lunge 
Nerven 0.0040 
Prostata 0.0137 
Sinnesorgane 0 . 0000 
Uterus n 0.0083 



NORMIERTE / SUBTRAH IERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0204 
0.0000 
0.0304 
0.0000 
0.0315 
0.0122 
0.0000 
0.0518 
0.0231 
0.0000 
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Elektronischer Northern fur SEQ. ID. NO: 48 



10 



15 



20 



25 



30 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeuf igkeit 


%Haeufigkeit N/T T/N 


Blase 


0.0624 


0 1 ^Rft 


ft A S9ft 9 91 9 1 * 


Brust 


0.0345 




ft cioi; i £**9*7 


Duennda nn. 


0.0491 


ft ni 


9 Qfi^Q ft ^^*79 


Eierstoek 


0.0689 






F.ndokrines Qpwp.hp 


0.0392 


n o?7fi 


1 4909 0 7041 


Gastrointestinal 


0*0460 


n n^n 

U • UD^tJ 


ft 71 ftft 1 Af\QA 


Gehirn 


0.0333 


ft nfi7R 


ft 4QftQ 9 ft**79 


Haemat opoe t is ch 


0.0374 


ft ft"7^(i 


ft 4Q4ft 9 ft941 


Haut 


0.0257 




ft 1 SI fi fi SQS4 


Hepatiscli 


0.1142 


U • UJIO 




Herz 


0.0774 


0.7010 


0 1104 9 0616 


Hoden 


0.2589 


0.1520 


1 7030 0 5872 


Lunge 


0.0540 


0. 0491 


1 1007 0 9085 


Magen— Spei se roehre 


0.1256 


0.2070 


0. 6069 1. 6477 


• " Muskel-Skelett 


0.1542 


0.2100 


0.7343 1.3619 


Niere 


0.0109 


0.0959 


0 1133 8 8268 


Pankreas 


0.0198 


0.1270 


0 1561 6 4071 


Penis 


0.0359 


0 0800 


0 4493 2 2259 


Prostata 


0.0785 


0.0511 


1.5354 0.6513 


Uterus Endometrium 


0.0338 


ft ft^9ft 


ft GLAC\0 1 Rfi91 


Uterus_Myometrium 


0.0457 


0.0679 


0.6734 1.4851 


TTt p T*n °. a 1 1 ripmp i n 


0. 0764 


U. U954 


a ortnc i Ojinrt 

u. huud i.z^yu 


Brust-Hyperplasie 


0.0224 






riOStaLa flype ipiaolc 


n ft£9A 






O CLl 1 LC 1 xu xaoc 


0. 0445 






O J.1JX1 fcio U X. y aXic 


n m i ft 






Weisse Hlutkoerperchen 


n rm9Q 






Zervix 


n n*;'*9 








J; V/Hj J. 














X-jlX W _L_ J^J. U11U 


0. 0139 






OCi. o LIU1U Lcllo L.XX1.CIJ- 


0* 0167 






VSfcJIl-L J- 11 


n nnno 






ClacXllcLUULJUG L.JL ol-.il 








QauL 


n noon 






Hepa t i s ch 


0-0000 






Herz-Blutgefaesse 


0.0249 






Lunge 


0.0108 






Nebenniere 


0.0000 






Niere 


0.0000 






Placenta 


0.3333 






Prostata 


0.1995 






Sinnesorgane 


0.0000. 







50 

NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeuf igkeit 

Brust 0.1156 

Eierstockji 0.0000 

55 Eierstoc)r_t 0.1873 

Endokr ines_Gewebe 0.0000 

Foetal 0.0181 

Gastrointestinal 0.0000 

Haematopoetisch 0.0057 

Haut-Muskel 0.0486 

Hoden 0.0000 

Lunge 0.0328 

Nerven 0,0020 

Prostata 0.0274 

Sinnesorgane 0.0155 

65 Uterus n 0.0541 
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Elektronischer Northern ffir SEQ. ID, NO: 49 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit %Haeufigkeit N/T 


T/N 


Blase 


0. 


0585 


0.1278 


0.4576 


2. 


1852 


Brust 


0. 


0742 


0-0507 


. 1.4619 


0. 


6841 


Duenndann 


0. 


1134 


0.0000 


undef 


0. 


0000 


Eierstock 


0. 


0659 


0.0390 


1.6885 


0. 


5922 


Endokrines__Gewebe 


0. 


0324 


0.0176 


1.8437 


0. 


5424 


Gastrointestinal 


0. 


0690 


0.0694 


0.9940 


1. 


0060 


Gehim 


0. 


0325 


0.0893 


0.3641 


2. 


7464 


Haematopoetisch 


0. 


0842 


0.0000 


undef 


0. 


0000 


Haut 


0. 


0404 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


1855 


0.0647 


2.8677 


0. 


3487 


Herz 


0. 


0435 


0.0962 


0.4516 


2. 


2145 


Hoden 


0. 


0230 


0.0585 


0.3936 


2. 


5408 


Lunge 


0. 


2275 


0.1063 


2.1396 


0. 


4674 


Magen-Speiseroehre 


0. 


0483 


0.1993 


0.2424 


4. 


1252 


'* Muskel-Skelett 


0. 


0857 


0.0720 


1.1898 


0. 


8405 


Niere 


0. 


0706 


0.0274 


2.5774 


0. 


3880 


Pankreas 


0. 


0347 


0.2927 


0.1185 


8. 


4367 


Penis 


0. 


0090 


0.0267 


0.3369 


2. 


9678 


Prostata 


0. 


0458 


0.0319 


1.4331 


0. 


6978 


Uterus_Endometrium 


0, 


0270 


0.0000 


undef 


0. 


0000 


Uterus_Myometrium 


0. 


0610 


0.0204 


2.9927 


0. 


3341 


Oterus__ allgemein 


0. 


0917 


0.0000 


undef 


0. 


0000 


Brust-Hyperplasie 


0. 


0416 











Prostata-Hyperplasie 0.0386 

Samenblase 0.0267 

Sinnesorgane 0.1647 

Weisse_Blutkoerperchen 0.2497 

Zervix 0.0000 



FOETUS 

%Haeufigkeit- 

Entwicklung 0.0000 

Gastrointenstinal 0.0916 

Gehirn 0.0500 

Haematopoetisch 0 . 08 65 

Haut 0.0000 

Hepatisch 0.0520 

Herz-Blutgefaesse 0.0178 

Lunge 0.0434 

Nebenniere 0.0000 

Niere 0.0432 

Placenta 0.1636 

Prostata 0.0000 

Sinnesorgane 0.0000. 



NORMIERTE/SUBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
Brust 0.1156 
Eierstock n 0.0000 
Eierstoclf^t 0.0354 
Endokr ine s_Gewebe 0.0490 
Foetal 0.0571 
Gastrointestinal 0.0854 
Haematopoetisch 0.0057 
Haut-Muskel 0.0097 
Hoden 0.0154 
Lunge 0.0491 
Nerven 0.0562 
Prostata 0.0615 
Sinnesorgane 0.0000 
Uterus n 0.1957 
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Mektronischer Northern fur SEQ. ED. NO: 50 



5 



10 



15 



20 



25 





NORMAL 


TUMOR 


Verhaeltnisse 




%Haeufigkeit 


%Haeufigkeit N/T 


T/N 


Blase 


0. 


0546 


0.1099 


0.4967 


2. 


0135 


Brust 


0. 


0563 


0.0489 


1.1517 


0. 


8683 


Duenndarm 


0. 


1380 


0.0331 


4.1708 


0. 


2398 


Eierstock 


0. 


0599 


0.0650 


0.9210 


1. 


0858 


Endokrines_Gewebe 


0. 


0238 


0.0351 


0.6792 


1. 


4722 


Gastrointestinal 


0. 


1322 


0.1804 


0.7328 


1. 


3647 


Gehirn 


0. 


0229 


0.0452 


0.5072 


1. 


9714 


Haematopoe t i s ch 


0. 


0241 


0.0000 


undef 


0. 


0000 


Haut 


0. 


1689 


0.0000 


undef 


0. 


0000 


Hepatisch 


0. 


0285 


0.0518 


0.5515 


1. 


8133 


Herz 


0. 


1166 


0.1649 


0.7067 


1. 


4150 


Hoden 


0. 


0115 


0.0117 


0.9839 


1. 


0163 


Lunge 


0. 


1070 


0.1329 


0.8050 


1. 


2422 


Magen-Speiseroehxe 


0. 


1450 


0.0613 


•2.3635 


0. 


4231 


Muskel-Skelett 


0. 


0685 


0.0240 


2.8555 


0. 


3502 


Niere 


0. 


0570 


0.0753 


0.7570 


1. 


3210 


Pankreas 


0. 


0165 


0.1491 


0.1108 


9. 


0256 


Penis 


0. 


0779 


0.0267 


2.9202 


0. 


3424 


Prostata 


0. 


0610 


0.0255 


2.3885 


0. 


4187 


Uterus_Endometrium 


0. 


0338 


0.0000 


undef 


0. 


0000 


0terus_Myometrium 


0. 


0991 


0.0340 


2.9179 


0. 


3427 


Uterus_allgemein 


0. 


0509 


0.1908 


0.2669 


3. 


7471 



Brust-Hyperplasie 0. 0064 
Prostata-Hyperplasie O.0386 
Samenblase 0.0801 
Sinnesorgane 0. 0588 
30 Weisse_Blutkoerperchen 0.0616 
Zervix 0.1810 



35 



40 



45 



Entwicklung 
Gastrointenstinal 
Gehirn 
Haematopoetisch 
Ha-ut 
Hepatisch 
Herz-Blutgefaesse 
Lunge 
Nebenniere 
Niere 
Placenta 
Prostata 
Sinnesorgane 



FOETUS 

%Haeufigkeit 

0.0139 

0.0194 

0.0000 

0.0275 

0.0000 

0.0000 

0.0142 

0.0145 

0.0000 

0.0247 

0.0364 

0.0499 

0.0000' 



50 



55 



60 



65 



Brust 
Eierstock^n 
Eierstock__t 
Endokrines_Gewebe 
Foetal 
Gastrointestinal 
Haematopoetisch 0 . 0057 
Haut-Muskel 0.0259 
Hoden 0.0309 
Lunge 0.1802 
Nerven 0.0050 



NORMIERTE/SOBTRAHIERTE BIBLIOTHEKEN 
%Haeufigkeit 
0.0068 
0.0000 
0.0101 
0.0000 
0.0064 
0.0976 



Prostata 
Sinnesorgane 
Uterus n 



0.0274 
0.0000 
0.0125 
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2.2 Fisher-Test 

Um zu entscheiden, ob eine Partial-Sequenz S eines Gens in einer Bibliothek fur Normal-Gewebe significant haufiger 
oder seltener vorkommt als in einer Bibliothek fur entartetes Gewebe, wird Fishers Exakter Test, ein statistisches Stan- 
dardverfahren (Hays, W. L., (1991) Statistics, Harcourt Brace College Publishers, Fort Worth), durchgefuhrt. 

Die Null-Hypothese lautet: die beiden Bibliotheken konnen heziiglich der Haufigkeit zu S homologer Sequenzen nicht 
unterschieden werden. Falls die NuU-Hypothese mit hinreichend hoher Sicherheit abgelehnt werden kann, wird das zu S 
gehorende Gen als interessanter Kandidat fur ein Krebs-Gen akzeptiert, und es wird im nachsten Schritt versucht, eine 
Veriangerung seiner Sequenz zu erreichen. 

Beispiel 3 

Automatische Veriangerung der Partial-Sequenz 
Die automatische Veriangerung der Partial-Sequenz S vollzieht sich in drei Schritten: 

1. Ermittlung aller zu S homologen Sequenzen aus der Gesamtmenge der zur Verfueune stehenden Seauenzen mit 
Hilfe von BLAST 

2. Assemblierung dieser Sequenzen mittels des Standardprogramms GAP4 (Bonfield, J. K., Smith, K. F und Sta- 
den R. (1995), Nucleic Acids Research 23 4992-4999) (Contig-Bildung). 

3. Berechnung einer Konsens-Sequenz C aus den assemblierten Sequenzen. 

Die Konsens-Sequenz C wird im allgemeinen langer sein als die Ausgangssequenz s. Ihr elektronischer Northern-Blot 
wird demzufolge von dem fur S abweichen. Ein emeuter Fisher-Test entscheidet, ob die Altemativ-Hypothese der Ab- 
weichung von einer gleichmaBigen Expression in beiden Bibliotheken aufrechterhalten werden kann. 1st dies der Fall, 
wird versucht, C in gleicher Wcise wie S zu verlangern. Diese Iteration wird mit der jeweils erhaltenen Konsensus-Se- 
quenzen Q (i: Index der Iteration) fortgesetzt, bis die Altemativ-Hypothese verworfen wird (if Hq Exit; Abbruchkrite- 
rium I) oder bis keine automatische Veriangerung mehr moglich ist (while Q > C^; Abbruchkriterium H). 

Im Fall des Abbruchkriteriums II bekommt man mit der nach der letzten Iteration vorliegenden Konsens-Sequenz eine 
komplette oder annahemd komplette Sequenz eines Gens, das mit hoher statistischer Sicherheit mit Krebs in Zusammen- 
hang gebracht werden kann. 

Analog der oben beschriebenen Beispiele konnten die in der TabeWe I beschriebenen Nukleinsaure-Sequenzen aus 
Blasentumorgewebe gefunden werden. 

Femer konnten zu den einzelnen Nukleinsaure-Sequenzen die Peptidsequenzen (ORFs) bestimmt werden, die in der 
Tabelle II aufgelistet sind, wobei wenigen Nukleinsaure-Sequenzen kein Peptid zugeordnet werden kann und einigen 
Nukleinsaure-Sequenzen mehr als ein Peptid zugeordnet werden kann. Wie bereits oben erwahnt, sind sowohl die ermit- 
telten Nukleinsaure-Sequenzen, als auch die den Nukleinsaure-Sequenzen zugeordneten Peptid-Sequenzen Gegenstand 
der vorliegenden Erfindung. 
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Glycoprotein M 


Hyaluroneclln (HN) Ist eln Glykoprotein, dass Hyaluron bindet 
und oft in humanen Tumoren gefunden wlrd 


I unbekanntl 
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das H19 Gen wird nur vom maternalen Chromosom exprimiert 
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65 kD Protein gerichtet ist 
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I 94.051 


| 94.05| 


o 
o 


o 
o 


o 
o 


Sequenz 
ID No.: 




CM 


CO 




in 


CO 




CO 


o> 


o 




CM 


CO 




to 


CO 




00 


09 


o 

CM 


CM 


CM 
CM 


CO 
CM 


CM 


in 

CM 


CD 
CM 


CM 


CO 
CM 


CD 
CM 


o 

CO 
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Chromosomale 
Lokaiisation 


I unbekanntl 


| unbekanntl 








I unbekanntl 


















| unbekannt) 








** 




Lange der 
angemeldeten 
Sequenz in 
Basen 


I 1031 | 


o> 

CO 


in 
CD 


co 

CM 
CO 


N» 

m 


| 1203| 


O 
CM 


CO 


CO 
CM 
CO 


2384 


3 

co 


m 

3 


| 2233] 


co 
^* 

CM 


5 


1644 


I 11331 


o> 

CD 
CO 


T — 

CD 


o 


Lange des 
AusgangsrEST in 
Basen 


o 

t~ 
CM 


00 


CO 

oo 


o> 


CO 
CM 


CO 

o 

CM 


h- 
O 
CM 


m 

CO 


CO 
CM 


CO 


V* 


CO 
O) 


CO 
N. 


CM 


o 

CM 
CM 


o 
o 

CM 


CM 
CO 
CM 


O 
00 

T— 


O 


O 

T— 

CM 


Funktion 


unbekanntl 


mitochondrlales Enzym | 


identisch zur Ig kappa lelchten Kette variable Region D1 1.| 


Identisch zum humanen hsp27| 


identisch zur Ig schweren Kettevarlabien Region V(2-1) (v(h)-iv 

Famille). 


unbekannt] 


identisch zur Immunoglobulin lleichten Kette variable Region 
(lambda-lllb Untergruppe) von IgM Rheuma-bezogener 

Faktoren. 


| identisch zu US-Patent | 


identisch zu einer immunogiobuiin lambda lelchten Kette 

(X57812) 


. Identisch zur humanen hPGI mRNA, die das "bone small 

proteoglycan P kodiert 


| identisch zu patentlerter Sequenz 


| identisch zu patentierter Sequenz 


| identisch zur CD24 "signal transducer mRNA"! 


identisch zur humanen Ig schweren Kette varlablen Region 

(Klon M49) 


| unbekannt 


Identisch zur Ig Alpha 1- Alpha 2m=lmmunogiobulin A1-A2 
lambda hybrid GAU schwere Kette {secreted alpha chain} 


| identisch zu Pro-alpha 2(1) collagen (C0L1 A2) Gen 


| identisch zu humanen NIC (Natural Inhibitor of Collagenase) 


| identisch zur humanen Ferritin L Kette 


identisch zum humanen Calcyclin Gen (auch Projactln- 
Rezeptor assozllertes Protein) 


Wahrscheinlichkeit fur 
eine spezifische 
Expression im 
Tumorgewebe % 


90.26| 


90.26I 


o 
o 


99,76 1 


o 
o 


9.3.951 


' 93.95 


o 
o 

x— 


o 
o 


99.42 


| 99.991 


| 99.77] 


| 99.11] 


97.85 


| 94.61 


o 
o 


| 93.7 


| -99.54 


| 99.28 


97.9 
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TABELLEE 



DNA-Sequenzen 


Peptid-Sequenzen (ORF's) 


Seq. ID. No. 


Seq. ID. No. 


2 


128 






129 






130 




3 


131 






132 






133 




4 


134 






135 






136 




5 


" 137 






138 






139 




7 


140 






141 






142 




8 


143 




9 


144 






.145 






146 




10 


147 




■ 


148 






149 




11 


150 






151 






152 




12 


153 






154 






155 




13 


156 






157 






158 : 




16 


159 






160 






161 
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DNA-Sequenzen 


Peptid^equenzen (ORFs) 


Seq. ID. No. 


Seq. ID. No. 


18 


162 






163 






164 






165 




20 


166 






167 






168 




23 


169 






170 




26 


171 






172 




27 


. 173 




31 


174 




32 


175 






176 






177 




36 


178 






179 






180 




45 


181 






182 






183 





Die erfinderischen Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 der ermittelten Kandidatengene und die 
ermittelten Aminosaure-Sequenzen Seq. ID No. 51-106 werden in dem nachfolgenden Sequenzprotokoll beschrieben. 

Sequenzprotokoll 

(1) ALLGEMEINE INFORMATION: 



(i) ANMELDER: 

(A) NAME: metaGen - Gesellschaftfur Genomforschung mbH 

(B) "STRASSE: Ihnestrasse 63 

(C) STADT: Berlin 

(E) LAND: Deutschland 

(F) POST CODE (ZIP): D-14195 

(G) TELEFON: (030)-841 3 1673 

(H) TELEFAX: (030)-8413 1674 
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(ii) TITEL DER ERFINDUNG: Menschliche Nukleinsaure-Sequenzen aus 

Blasentumorgewebe 

(iii) Anzahl der Sequenzen: 106 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 (EPO) 

(2) INFORMATION UBER SEQ ID NO: 1: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 202 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 

tgagagtcat ggacctcctg cacaagaaca tgaaacacct gtggttcttc ctcctcctgg 60 
tggcagctcc _cagatgggtc ctgtcccagg tgcagctaca gcagtggggc gcaggactgtl20 
tgaagccttc ggagaccctg tccctaacat gcgctgtctc cggtgactct tccagtacttl80 
actactggga ttggatccgc ca 202 

(2) INFORMATION UBER SEQ ID NO: 2: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1926 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NE1N 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 2 

ttgcgatggc tgatggactg tggctctcta accaaaggac cctagcgggc tcaacaattg 60 
tcaagagcag ttggtggttc tgaatacaat cctcagccaa ggatccctcc tgtgttacag 120 
atggatcagc taaaacaagc caacactgaa gacacaaaga atgaggttag gttcattgaa 180 
accagggtaa cacctgtgga tgagctaaac acaaagatga caatgacctt gtaccaggta 240 
tagaagctca gagacatgcc tgcaaaatga aatccctgag gaattttgca gctacccaga 300 
gatacgtggt tcaaattaaa atgtctgacg gatcactcat ttgaggaaca gcacatcagc 360 
ttcgcccttt acgtggacaa taggtttttt actttgacgg tgacaagtct ccacctggtg 420 
ttccagatgg gagtcatatt cccacaataa gcagccctta ctaagccgag agatgtcatt 480 
cctgcaggca ggacctatag gcacgtgaag atttgaatga aagtacagtt ccatttggaa 540 
gcccagacat aggatgggtc agtgggcatg gctctattcc tattctcaaa ccatgccagt 600 
ggcaacctgt gctcagtctg aagacaatgg acccacgtta ggtgtgacac gttcacataa 660 
ctgtgcagca. catgccggga gtgatcagtc agacatttta atttgaacca cgtatctctg 720 
ggtagctaca aaattcctca gggatttcat tttgcaggca tgtctctgag cttctatacc 780 
tgctcaaggt cagtgtcatcr tttgtgttta. gctcatccaa aggtgttacc ctggtttcaa 840 
tgaacctraac ctcattcttt gtgtcttcag tgttggcttg ttttagctga tccatctgta 900 
acacaggagg gatccttggc tgaggattgt atttcagaac caccaactgc tcttgacaat 960 
tgttaacccg ctaggctcct ttggttagag aagccacagt ccttcagcct ccaattggtgl020 
tcagtactta ggaagaccac agctagatgg acaaacagca ttgggaggcc ttagccctgcl080 
tcctctcaat tccatcctgt agagaacagg agtcaggagc cgctggcagg agacagcatgll40 
tcacccagga ctctgccggt gcagaatatg aacaatgcca tgttcttgca gaaaacgcttl200 
agcctgagtt tcataggagg taatcaccag acaactgcag aatgtagaac actgagcaggl260 
acaactgacc tgtctccttc acatagtcca tatcaccaca aatcacacaa caaaaaggagl320 
aagagatatt ttgggttcaa aaaaagtaaa aagataatgt agctgcattt ctttagttatl380 
tttgaagccc caaatatttc ctcatctttt tgttgttgtc atggatggtg gtgacatggal440 
cttgtttata gaggacaggt cagctgtctg gctcagtgat ctacattctg aagttgtctglSOO 
aaaatgtctt catgattaaa ttcagcctaa acgttttgcc gggaacactg cagagacaatl560 
gctgtgagtt tccaacctca gcccatctgc gggcagagaa ggtctagttt gtccatcaccl620 
attatgatat caggactggt tacttggtta aggaggggtc taggagatct gtcccttttal680 
gagacacctt acttataatg aagtacttgg gaaagtggtt ttcaagagta taaatatcctl740 
gtattctaat gatcatcctc taaacatttt atcatttatt aatcctccct gcctgtgtctl800 
attattatat tcatatctct acgctgcaaa ctttctgcct caatgtttac tgtgcctttgl860 
tttttgctag tgtgtgttgt tgaaaaaaaa aacattccct gcctaagtta gttttggcaal920 
agtatt 1926 



(2) INFORMATION UBER SEQ ID NO: 3: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) l^NGE: 762 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(Hi) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 3 

ctccactgca accacccaga gccatggctc cccgaggctg catcgtagct gtctttgcca 60 
ttttctgcat ctccaggctc ctctgctcac acggagcccc agtggccccc atgactccttl20 
acctgatgct gtgccagcca cacaagagat gtggggacaa gttctacgac cccctgcagcl80 
actgttgcta tgatgatgcc gtcgtgccct tggccaggac ccagacgtgt ggaaactgca240 
ccttcagagt ctgctttgag cagtgctgcc cctggacctt catggtgaag ctgataaacc300 
agaactgcga ctcagcccgg acctcggatg acaggctttg tcgcagtgtc agctaatgga360 
acatcagggg aacgatgact- cctggattct ccttcctggg tgggcctgga gaaagaggct420 
ggtgttacct gagatctggg atgctgagtg gctgtttggg ggccagagaa acacacactc480 
aactgcccac ttcattctgt gacctgtctg aggcccaccc tgccgctgcc ctgaggaggc540 
ccacaggtcc ccttctagaa ttctggacag catgagatgc gtgtgctgat gggggcccag600 
ggactctgaa ccctcctgat gacccctatg gccaacatca acccggcacc accccaaggc660 
tggctgggga acccttcacc cttctgtgag attttccatc atctcaagtt ctcttctatc720 
caggagcaaa gcacaggatc ataataaatt tatgtacttt aa 762 

(2) INFORMATION UBER SEQ ID NO: 4: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 918 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 4 

ctcgagccgc tcgagccgat tcggctcgag ccttcccgct ccctgcttgc aaagtggttg 60 
tgccccaagg tccgcctcca ggccacgtgg gtgctgcggg ccaagctttc ccttcctttgl20 
agagaggttt ccgctgtagg agcagagctt ccgggctgcg ctcttcgttg cccagtttccl80 
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gctcagtggt cgcgtctccg ccccccaccc accagtcccc^ ctgcattctc ggcpgggctc24C 
taggcgccat ggctccccgc gggaggaagc gtaaggctga ggccgcggtg gtcgccgtag3C0 
ccgagaagcg agagaagctg gcgaacggcg gggagggaat ggaggaggcg accgttgtta360 
tcgagcattg cactagctga cgcgtctatg ggcgcaacgc cgcggccctg agccaggcgc420 
tgcgcctgga ggccccagag cttccagtaa aggtgaaccc gacgaagccc cggagggcag480 
cttcgaggtg acgctgctgc gcccggacgg cagcagtgcg gaactctgga ctgggattaa540 
gaaggggccc ccacgcaaac tcaaattccc tgagcctcaa gaggtggtgg aagagttgaa600 
gaagtacctg tcgtagggag atttgggtag aagccctcat gctgagcttt gtgtccctgg660, 
tgatgttgga acattaatga tggaacatgg ccaaacttca gtcatgatcc tgaagccatg720 
gtttcttccc tgccagaaat gaaggttcag .ttatgaggca accctctagt aaggcattgt780 
aaaagttact ggatttggtt taataaaagt tgaaataaag taaaagaaaa aaaaaaaaga840 
gaaaaaagaa aaagaaaaaa agaaagaaga aaaaagaaag gagaagcgag agaaagggag900 
gccgcggggc gcggcggc 918 



(2) INFORMATION UBER SEQ ID NO: 5: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1 146 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 5 

tcagtttagt ggcagggtgg ttttttaatt ttcttctgtg gctggatttt tgttgtttgt . 60 
ttaaataact cttctgggaa gttggtttat aagcctttgc caggtgtaac tgttgtgaaa 120 
tacccaccac taaagttttt taagttccat attttctcca ttttgccttc ttatgtattt 180 
tcaagattat tctgtgcact ttaaatttac ttaacttacc ataaatgcag tgtgactttt 240 
cccacacact ggattgtgag gctcttaact tcttaaaagt ataatggcat cttgtgaatc 300 
ctataagcag tctttatgtc tcttaacatt cacacctact ttttaaaaac aaatattatt 360 
actattttta ttattgtttg* tcctttataa attttcttaa agattaagaa aatttaagac 420 
cccattgagt tactgtaatg caattcaact ttgagttatc ttttaaatat gtcttgtata 480 
gttcatattc atggctgaaa cttgaccaca ctattgctga ttgtatggtt ttcacctgga 540 
caccgtgtag aatgcttgat tacttgtact cttcttatgc taatatgctc tgggctggag 600 
aaatgaaatc ctcaagccat caggatttgc tatttaagtg gcttgacaac tgggccacca 660 
aagaacttga acttcacctt ttaggatttg agctgttctg gaacacattg ctgcactttg 720 
gaaagtcaaa atcaagtgcc agtggcgccc tttccataga gaatttgccc agctttgctt 780 
taaaagatgt cttgtttttt atatacacat aatcaatagg tccaatctgc tctcaaggcc 840 
ttggtcctgg tgggattcct tcaccaatta ctttaattaa aaatggctgc aactgtaaga 900 
acccttgtct gatataittg caactatgct cccatttaca aatgtacctt ctaatgctca 960 
gttgccaggt tccaatgcaa aggtggcgtg gactcccttt gtgtgggtgg ggtttgtgggl020 
tagtggtgaa ggaccgatat cagaaaaatg ccttcaagtg tactaattta ttaataaaca!080 
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ttaggtgttt gttaaaaaaa aaaaaaaaaa aaaaaaatag; ggtgggtggt Qttgyttgtall40 
ttggtt ^ ' 1146 

(2) INFORMATION UBER SEQ ID NO: 6: 

(i) SEQUENZ CHARAKTERISTIK: 
(A) LANGE: 2407 Basenpaare 

1 (B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOlTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 



(Hi) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 6 



gagtgagtga 
ggttgtccag 
ttgaaatcca 
gaacttgcag 
aaaaagataa 
tcccagctct 
cagcagcaag 
gtcatatcag 
gtccagaaag 
agtggattcc 
gtcggcagcc 
ccctttccaa 
tagagacttc 
gatcagccat 
tgactggctg 
ctccttccat 
ggctttcaat 
agtatctgct 
ttttcttccc 
gcactattaa 
ttttaagtaa 
ctgtgaagct 
tacagccaaa 
tctgtaatat 
aattacagta 
aaatataact 
ctaatatgag 
gcgcatttct 
tctaccagcg 
ttgaagctaa 
aatatttctg 



gtgtgttgca 
attgaagtga 
aagagcaaga 
aaatgaaaag 
atgagatgtc 
cctactca^c 
tcaaacagct 
tttacagaat 
tactgaagca 
ttggcaggac 
gctgccattg 
aggtttctga 
aaaccagcag 
gcccagaggt 
agtggcttta 
tcctgtactc 
tccaaaacat 
taaaaccct.t 
atctgcctta 
ctaaaatato> 
attgttgaca 
caggaaatcc 
agaaatgcct 
ttttattggc 
gattatatta 
ttttccttaa 
ctgccaccaa 
gaatctagca 
cccataaatg 
aaaccctgat 
ttcactttac 



tcgaattaag 
ggtctcacgg 
agtaaatgaa 
atactctgag 
gaaggaagtc 
aagctpatcc 
ccagaaccag 
gcatcttctg 
aatccttacc 
actgcccctt 
ttctcattcg 
ggacttctcc 
aggtgaaagt 
ctggtcctga 
tcaccaccga 
gggcagtgcc 
gatttaattt 
catcatgata 
tatatctcat 
aaacttaaaa 
tactgcaaat 
aaacatttgt 
catagttctt 
tcataaagat 
acaaaatact 
agttttcagc 
cacccctaga 
aatcctcctt 
atctctagaa 
atggtaatat 
tttcaggtta 



gactcttgaa 
tgaaaagaaa 
cttctgcaaa 
agctcttcaa 
accaaattga 
aaaaggcaga 
ctggcggaat 
tatgctgtgc 
atgtgtaaaa 
gtcatctgtc 
tggtatgcac 
caggagaaga 
ccctgtcatc 
tgctggcagg 
gtgatgtgct 
attcagcaca 
ctaactaaat 
tcctgtggat 
caccctgctt 
acaaaagcaa 
tttctatgca 
gtttcaacaa 
aacctcaact 
gttttcatat 
ttttaggtag 
tatagcaaaa 
actttcagcc 
ttacccgttg 
ggactgttag 
tatggtgcat 
aaaatgtttc 



gagaagagag 
aggaaaatat 
aattccagca 
aactggagga 
aggaggcctt 
gtcagcagct 
gcaagaaaca 

accagtctca 
tttgtgttag 
tgtggcctag 
ctgcccgcct 
ccttcagatt 
ggggccccct 
gaggcctcct 
ggagagctct 
tagtatggca 
ttaaaaactc 
atcaatattc 
gttgtcctta 
aacttgcctc 
gggacagtaa 
tttgtagaag 
ctgaactcct 
ccatgcttga 
ggtagttatg 
atggtgtctt 
aatgttttga 
taccaatctg 
agcagaggtc 
taacacgctt 



gtccattcag 60 
tcagactctc 120 
agctcaggaa 180 
agataaagat 240 
gaacagcctc 300 
ggaggcgctg 360 
acaccaggag 420 
ggatgaagat 480 
aaagaagtaa 540 
atccagagtt 600 
cgtagttctt 660 
cagaactgct 720 
ccagagctgg 780 
cctccatccc 840 
gcagtgaatg 900 
ttttgccttt 960 
ctagttatgal020 
taatt ccat gl08 0 
agtttgatgall40 
aaagttcttt!200 
ctgctgttatl260 
actgtgtgttl320 
tatttttttcl380 
aaataagtgal440 
gactttttaalSOO 
tatgccagacl560 
cagaattgtal 620 
atgccctgacl680 
tttttcaactl740 
tcggaaaaaal800 
gcaacttcccl860 
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ttatggcatt aatcttgttg agggagagag acagaatcct ggactctcca aa~tatttca!923 
ctgaaagtag ggcctgctct gacagggccq atgtcccaca? -^ggctgcttg gcctcagtygl930 
gtgcttggct gtgctggatg atatgttgat ctgtattgga taaggaccaa tgacagcaaa2040 
gcaaaaatgg ctttaaagct tggtgttact tttcttaagt tgtttaatta tagttaagca2100 
atttcaaaaa tgctccaaag aaatgtgaaa ggaccttttg tcacagcact tcagaaaata2160 
cacaacagcc ccttctgccc ccgcacagaa atgctgcaga gtatataaaa cttgagacat2220 
ttttgtagga tgcctgacga ggtgtagcct tttatcttgt ttccggatgc atatttatta2280 
cgagtactct ggttaaatat tgaaaagtta tatgctgtag tttttagtat tttgtctttg2340 
taatttacag aagttattgg agaaaataaa cttgtttcat tttgcaaaaa aaaaaaaaaa2400 
atgaaaa 2407 

(2) INFORMATION UBER SEQ ID NO: 7: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1471 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Edrtierung 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 7 

ctcgtgcaac ccggcggctc ctgcagcggt ggtcggctgt tgggtgtgga gtttcccagc 60 
gcccctcggg tccgaccctt tgagcgttct gctccggcgc cagctacctc gctcctcggc 120 
gccatgacca caaccaccac cttcaaggga gtcgacccca acagcaggaa tagctcccga 180 
gttttgcggc ctccaggtgg tggatccaat ttttcattag gttttgatga accaacagaa 240 
caacctgtga ggaagaacaa aatggcctct aatatctttg ggacacctga agaaaatcaa 300 
gcttcttggg ccaagtcagc aggtgccaag tctagtggtg gcagggaaga cttggagtca 360 
tctggactgc agagaaggaa ctcctctgaa gtaagctccg gagacttctt agatctgaag 420 
ggagaaggtg atattcatga aaatgtggac acagacttgc caggcagcct ggggcagagt 480 
gaagagaagc ccgtgcctgc tgcgcctgtg cccagcccgg tggccccggc cccagtgcca 540 
tccagaagaa atccccctgg cggcaagtcc agcctcgtct tgggttagct ctgactgtcc 600 
tgaacgctgt cgttctgtct gtttcctcca tgcttgtgaa ctgcacaact tgagcctgac 660 
tgtacatctc ttggatttgfc* ttcattaaaa agaagcactt tatgtactgc tgtctttttt 720 
ttttttcttt tgaagaacag gtttctctct gtccttgact cttgggtctg tgggccatgg 780 
catgagtgtt ttctagtagt agattggagg gaaagctttg tgacacttag tactgtgttt 840 
ttaagaagaa ataatttggt tccagatgtg ttagaggatc ttttgtactg aggtttttaa 900 
cactttactt gggtttacca agcctcaact ggacagacca taaacagtcc acaggcaccg 960 
ttcctgccag gccccaaccc acagggagtc tctccgcaga gccttcttgg tgttgccctal020 
acttgccagt ggcctttgct cagagcctcc tcctgtgaca tgtgaac'aat gaagaggcctl080 
gcgcctcctg ccttgccgcc tgcaaagcaa agaaactgcc ttttattttt taaccttaaall40 
aagtagccag atagtaacaa gactggctgg ctgatgagca aagcctttgc tctcacgcagl200 
aggaaggctt ggatgtacaa tgaaactgcc tggaactaaa agcagtgaag caagggaggcl260 
aatcacactg aagcgggtct tcctccagga acggggtccc acaggcgtgt tgttttaaatl320 
aacctgatgc tgtgtgcatg atgctggtgc ttgaccatga aaggaaagtc tcatccttaal380 
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aatgtgttgt acttcacaat cctggactgt tgcttcaagt aaacaatatc cacattttgal440 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a * 1471 

(2) INFORMATION UBER SEQ ID NO: 8: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1732 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(Ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partlelle cDNA 

(iii) HYPOTHETISCH: NEIN 

(ill) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 8 

gcagaaccta cgcctgacgg gcccggcggc ggctgagccg cgctgcgcag cgacgcggga 60 
atgaagcggg cgctgggcag gcgaaagggc gtgtggttgc gcctgaggaa gatacttttc 120 
tgtgttttgg ggttgtacat tgccattcca tttctcatca aactatgtcc tggaatacag 180 
gccaaactga ttttcttgaa tttcgtaaga gttccctatt tcattgattt gaaaaaacca 240 
caggatcaag gtttgaatca cacgtgtaac tactacctgc agccagagga agacgtgacc 300 
attggagtct ggcacaccgt ccctgcagtc tggtggaaga acgcccaagg caaagaccag 360 
atgtggtatg aggatgcctt ggcttccagc caccctatca ttctgtacct gcatgggaac 420 
gcaggtacca gaggaggcga ccaccgcgtg gagctttaca aggtgctgag ttcccttggt 480 
taccatgtgg tcacctttga ctacagaggt tggggtgact cagtgggaac gccatctgag 540 
cggggcatga cctatgacgc actccacgtt tttgactgga tcaaagcaag aagtggtgac 600 
aaccccgtgt acatctgggg ccactctctg ggcactggcg tggcgacaaa tctggtgcgg 660 
cgcctctgtg agcgagagac gcctccagat gcccttatat tggaatctcc attcactaat 720 
atccgtgaag aagctaagag ccatccattt tcagtgatat atcgatactt ccctgggttt 780 
gactggttct tccttgatcc tattacaagt agtggaatta aatttgcaaa tgatgaaaac 840 
gtgaagcaca tctcctgtcc cctgctcatc ctgcacgctg aggacgaccc ggtggtgccc 900 
ttccagcttg gcagaaagct ctatagcatc gccgcaccag ctcgaagctt ccgagatttc 960 
aaagttcagt ttgtgccctt tcattcagac cttggctaca ggcacaaata catttacaagl020 
agccctgagc tgccacgga,t actgagggaa ttcctgggga agtcggagcc tgagcaccagl080 
cactgagcct ggccgtggga aggaagcatg aagacctctg ccctcctccc gttttcctccll40 
agtcagcagc ccggtatcct> gaagccccgg ggggccggca cctgcaatgc tcaggagcccl200 
agctcgcacc tggagagcac ctcagatccc aggcggggag gcccctgcag gcctgcagtgl260 
cccggaggcc tgagcatggc tgtgtggaaa gcgtgggtgg caggcatgtg gctctccttgl320 
ccgcccct-ca acctgagatc ttgttgggag acttaatggc agcaggcagc catcactgccl380 
tggttgatgc tgcactgagc tggacagggg gagtccgggc aggggactct tggggctcgg!440 
gaccatgctg agctttttgg caccacccac agagaacgtg gggtccaggt tctttctgcalSOO 
ccttcccagc acatgcagaa tgactccagt ggttccatcg tcccctc'ctg ccctgtgtacl560 
ctgcttgcct ttctcagctg ccccacctcc cctgggctgg cccactcacc cacagtggaal620 
gtgcccggga tctgcacttc ctcccctttc acctacctgt acacctaacc tggccttagal680 
ctgagcttta tttaagaata aaatcgtggt ggtggtcaaa aagacactct gc 1732 
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(2) INFORMATION UBER SEQ ID NO: 9: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 989 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 9 



cggctcgagc gtgatcgtcg actcagctga ccctgcggga ccggaaa^ag aaattcccgg 60 
gccctggctt cttggcgcga tgaggttccg gttctgtggt gatctggact gtcccgactgl20 
ggtcctggca gaaatcagca cgctggccaa gatgtcctct gtgaagttgc ggctgctctgl80 
caccaggtac taaaggagct gctgggacag gggattgatt atgagaagat cctgaagctc240 
acggctgacg ccaagtttga gtcaggcgat gtgaaggcca cagtggcagt gctgagtttc300 
atcctctcca gtgcggccaa gcacagtgtc gatggcgaat ccttgtccag tgaactgcag360 
cagctggggc tgcccaaaga gcacgcggcc agcctgtgcc gctgttatga ggagaagcaa420 
agccccttgc agaagcactt gcgggtctgc agcctacgca tgaataggtt ggcaggtgtg480 
ggctggcggg tggactacac cctgagctcc agcctgctgc aatccgtgga agagcccatg540 
gtgcacctgc ggctggaggt ggcagctgcc ccagggaccc cagcccagcc tgttgccatg600 
tccctctcag cagacaagtt ccaggtcctc ctggcagaac tgaagcaggc ccagaccctg660 
atgagctccc tgggctgagg agaagggtgt tccaggcctg tgtggagccg ccctgcccgt720 
atggagtcac gccctctgaa ctgctcttcg ggaggcagcc ctggttctag gatgctgagg780 
ccctggcccg gactctggcc tcccagatcc ccagctgcct cacttctctc ttgagaactt840 
ggctcagggc tcctgaggac ctttcccagc attaccttcc cttcccttga aaggcaattg900 
ttggctgttt tcataagcag gaaaaataaa cagaagtata aaagagaaaa aaaaaaaaaa960 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 989 

(2) INFORMATION OBER SEQ ID NO: 10: 

(i) SEQUENZ CHARAKTERISTIK: : 

(A) LANGE: 150 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 10 

attttatgaa gttgaaaaat agctcacttt aaagctagtt ttgaagacgt gcagctgtga 60 
cttgggtctg gttgggggtg ttgtgttttg agtcagccgt tttcactccc actgaggttgl20 
tcagaacatg cagattgctt cgattttctc 150 



(2) INFORMATION UBER SEQ ID NO: 11: 

(I) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1467 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 11 

cgaggaccgg ccttgcgagc ggcgacgact ataaaatggc gcgtgctgca acccgcgccc 60 
gcttcggaga gagaaatgct ggggtgcagc ttcaagctta ggaccaccca ccatgcctat 120 
ccaggtgctg aagggcctga* ccatcactca ttaagaacag aggaggctgc ctgttactcc 180 
tggtgttgca tccctccaga cactctgctg tttcctgcct aggcgtggct gcagccatgg 240 
ctaggaaagc gctgccaccc acccacctgg gccagagctg gttctgctcc tgctgcaggg 300 
acactgagct ggctatctcg gcgcttcggg caagaactgc aacaggctct cctgggtcct 360 
gcaggtgtac agccgggccc ctgccttgtg cctcagctct cgagagctgc tgctgccggg 420 
tgacctgatc caacctgata aggtgccatc ttcagctacc actgcaaggc cctgagggca 480 
acagcagcac ggcactgccc acccggctgc tgatggcctg gtgccagfctg ggagtcctcc 540 
cggcacttcg aggccactga gccacccttc cagccccagc ccaccatgga caggggtatc 600 
cagcttcctc ctcaacctcg tcctctgccc ctgagccagt gacgcccaag gacatgcctg 660 
ttacccaggt cctgtaccag cactagctgg tcaagggcat gacagtgctg gaggccgtct 720 
tggagatcca ggccatcact ggcagcaggc tgctctccat ggtgccaggg cccgccaggc 780 
caccaggctc atgctgggac ccaacccagt gcacaaggac ttggctgctg agccacacac 840 
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gggctaccaa gggcttcstg caggctaggg gcggagccac .900 
caggcctatg gggaggagct - gtccatacgc caccgtgaga SCO 
acagacaccg cctggcctgg tgctccaggg gtgaagcaggl020 
ctcctggttt gagctgcatt caggaagtgc gggacatggtl080 
ggcactaccc tccctgtgga gctgttcggt gtccgtcgagll40 
tgttcaaggg taccggaaga gaagggtgtc tgcccccaacl200 
gccagatgtc atgagggaag caggccttgt gagtggacacl260 
gagtgatccc ccaggcatcg tgtgccatgt tgcacttctgl320 
ggtaccatgg gtgcccaccc ctccaccaca tggggcccca!380 
ggcaacccca .cacccttgac ataaaagcat cttgaagcttl440 
aaaataa 1467 

(2) INFORMATION OBER SEQ ID NO: 12: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 895 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parfielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 12 

ctcgagccgc tcgagaacct cacttcctta ctcctccaaa aagaagtggg gaaagaacca 60 
tcaaaccttt cctcctgact taccaaacca ggaaaacagc aggagagggt ggctcaggacl20 
ttagggacag ggtatagctt agatggtgga aagcaaagga gagcaggaag ttgtaaatcal80 
ctggctaatg agaaaaggag acagctaact ctaggatgaa gctgtgacta ggctggagtt240 
gcttccttga agatgggact ccttgggtat caagacctat gccacatcac actggggcta300 
gggaagtagg tgatgccagc cctcaagtct gtcttcagcc agggacttga gaagttatat360 
tgggcagtgg ctccaatctg tggaccagta tttcagcttt ccctgaagat caggcagggt420 
gccattcatt gtctttctct cctagccccc tcaggaaaga aggactatat ttgtactgta480 
ccctaggggt tctggaaggg aaaacatgga atcaggattc tatagactga taggccctat540 
ccacaagggc catgactggg aaaaggtatg ggagcagaag gagaattggg attttagggt600 
gcagtacgct caccctaaao ttttggtggc ctggggcatg tcttgaggcc cagactgtta660 
agcaggctct gctggcctgt ttactcgtca ccacctctgc acctgctgtc ttgagactcc720. 
atccagcccc aggcacgcca cctgctcctg agcctccact atctccctgt gacgggtgaa780 
cttcgtgtac tgtgtctcgg gtccatatat gaattgtgag cagggttcat ctattttaaa840 
cacagatgtt tacaaaataa agattatttc aaaccaaaaa gaaaaaaaaa aaaaa 895 



(2) INFORMATION UBER SEQ ID NO: 13: 

(i) SEQUENZ CHARAKTERISTIK: 
(A) LANGE: 467 Basenpaare 



ccaggagaag gtggataagt 
ccccgcttcc ctattgtgac 
cctgggcctg gctctcaagg 
ccagaatcct gggggagctg 
aggggaggca aaaagccttg 
ctagccacac cctgacacca 
ctcccctgtg ggtgtcactg 
tgaccatgag tccctggggg 
cccaggcagc agggtgggtg 
aagcactgca ggccaagcag 
ttaaaaaaaa aaaaaaaaaa 
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(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parh'elle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 1 3 

acccagcagc cctcgcgcgg tccggcacag cggacaccag gactccaaaa tggcgtcagt 60 
tgtaccagtg aaggacaaga aacttctgga ggtcaaactg ggggagctgc caagctggatl20 
cttgatgcgg gacttcagtc ctagtggcat tttcggagcg tttcaaagag gttactaccgl80 
gtactacaac aagtacatca atgtgaagaa ggggagcatc tcggggatta ccatggtgct240 
ggcatgctac gtgctcttta gctactcctt ttcctacaag catctcaagc acgagcggct300 
ccgcaaatac cactgaagag gacacactct gcaccccccc accccacgac cttggcccga360 
gcccctccgt gaggaacaca atctcaatcg ttgctgaatc ctttcatatc ctaataggaa420 
ttaacctcca aataaaacat gactggtacg tgtaaaaaaa aaaaaaa 467 



(2) INFORMATION UBER SEQ ID NO: 14: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 511 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 14 
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actagttcta gatcgcgagc 
tctttttttt ttttttgtgg 
gggaggagag cgtgtgcggc 
ctgcctctgg cttctcaggc 
agctgcagcc catcctcccg 
tcagagacaa cttcccaaag 
agactctgta cctattttgt 
gctggaataa agcatgtgga 
aaaaaaaaaa aaaaaaaaaa 



ggccgccctt tttgttcaaa 
atggggactt gtgaattttt 
tccagcccag cccgctgctc 
ctctgctctc cgacctctct 
gcgccctcct agtctgtcct 
cacaaagcag tttttccccc 
atgtgtataa taatttgaga 
aatgacccca aacataaaaa 
aaaaaaaaaa a 



gtctattttt attctctttt 60 
ctaaaggtgc tatttaacatl20 
actttccacc ctctctccacl80 
cctctgaaac cctcctccac240 
gcgtcctctg tccccgggtt300 
taggggtggg aggaagcaaa360 
tgtttttaat tattttgatt420 
aaaaaaaaaa aaaaaaaaaa480 

511 



(2) INFORMATION USER SEQ ID NO: 15: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1899 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 15 

tctccaccct ccccttcctt ctttcttttt ccccttcttg cacgtggatc actcaggcct 60 

cagaatgagg ctgctttatt. ggaagctatt ctgacatcac tttccagact gtctcactgt 120 

cttgggacca ggcatgggag cjcaggggtgg gaatcttctt gtgattgtgg gtggtggctg 180 

gaggagtgga gtggtgggag gcggctcagt cgcgggcact cctgcgactg gcggatgcgg 240 

tccggatgga ataagccttc aggagcccag gacccgcact gctggagaag ctcagggcat 300 

tgctgcccat ggttcccccg agggtcagcc caatgccacc gccactgcta ctgccaccag 360 

tggaattcat cacagagata ttcacggctc ccactccatc tccagccaac cggctctcct 420 

cgccctccag cagcttgcgg taggtggcga tctcgatgtc cagggccagc ttcacgctca 480 

tgagttcctg gtactcacgc agctgccgtg ccatatcctg cttggcccgc tgcagggcgg 540 

cttccagctc ctcctgcttg. gcacgagcat ccttgagcgc cagctcccca cgctcctcag 600 

cctcggcaat ggcggcctcc aacttggcac gctggttctt gatgttgtcg atctcagcct 660 

gcagcctctg gatggcccgg ttcatctctg aaatctcatt ccgggtattc cggaggtcgt 720 

ccccatgctt cccagcctgg gcctggaggg tctcaaactt ggtctggtac caggcttcag 780 

cctcagcccg gctgcatttg gccatctcct catactgcgc cttgacctca gcgatgatgc 840 

cgtccaggtc cagggagcga ctgttgtcca tggacagcac cacagatgtg tcggagatct 900 

gggactgcag ctctgtcaac tccgtctcat tgagggtcct gaggaagttg atctcatcat 960 

tcagggcatc caccttggcc tccagcjtcca ccttgctcat gtaggcagca tccacatcctl020 

tcttcagcac cacaaactca. ttctcagcag ctgtgcggcg gttaatttca tcttcgtact!080 

tattcttgaa gtcctccacc acatcctgcc atctccgcag ctccgcctcc aggcggccccll40 

catccacctg cagtgcctca agctgacccc gaaggccagc aatctgggcc tcaaagatgtl200 

ctgggaggcg gctgctcttg gccgacttct gctcctgcag cagcgtccac ttggtctccal260 
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gcagcttgtt ctgctgctcc agaaaccgca ccttvjtcgat .gaaguagg^a aacttgttgt3 320 
tgagggtctt gatctgctcg ctctcctcct ggcgbacccg"ctggagggay gggtcggcgtl380 
ccagccgcag cggggccagc aggctctggt taatggtgac ctcgcggatg ccggcgcccal440 
ccgggccccc ataggcagag cgcacggcca cgcgcggccg tgaggcgccg aggccgtagalSOG 
ggctgctgct gccaaggccg ccggggcgag cggagctcag gcgcacctgg gcgccgcggcl560 
ccgagaaggc ggctgagcgc gaggtgaata ccggggagct gaagtggatg gacatggtggl620 
ctgggccggg atggacctag cggcgggcga ggaggagcgc gcactcgctg acctcggggal680 
cactccgcac cttttatccg cgggagccgg tgctgggctt ccacaggtag gggcggggctl740 
ggccgcgggc accgtttctc tgctgccagg cccctcctgc gcgtccgtcc gccctctgccl800 
cgcccgcccc gccgaagccc aggctttcag tccaagcagg gatggtccgg agtaggcaggl860 
agcgccatcc ctagacggcc gcagagaaca gcgggggac 1899 

(2) INFORMATION UBER SEQ ID NO: 16: 

(I) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 758 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 16 

cggctcgagc ggctcgaggg gcaagaagaa patcacgtgc tgtgacaccg acttgtgcaa 60 

cgccagcggg gcccatgccc tgcagccggc tgccgccatc cttgcgctgc tccctgcactl20 

cggcctgctg ctctggggac ccggccagct ataggctctg gggggccccg ctgcagcccal80 

cactgggtgt ggtgccccag gcctctgtgc cactcctcac agacctggcc cagtgggagc240 

ctgtcctggt tcctgaggca catcctaacg caagtctgac catgtatgtc tgcacccctg300 

tgccccaccc tgaccctccc atggccctct ccaggactcc cacccggcag atcagctcta360 

gtgacacaga tccgcctgca gatggcccct ccaaccctct ctgctgctgt ttccatggcc420 

cagcattctc cacccttaac cctgtgctca ggcacctctt cccccaggaa gccttccctg480 

cccaccccat ctatgactt;g agccaggtct ggtccgtggt gtcccccgca cccagcaggg540 

gacaggcact caggagggcc cagtaaaggc tgagatgaag tggactgagt agaactggag600 

gacaagagtc gacgtgagtt> cctgggagtc tccagagatg gggcctggag gcctggagga660 

a gggg cca 9g cctcacattc gtggggctcc ctgaatggca gcctgagcac agcgtaggcc720 

cttaataaac acctgttgga taagccaaaa aaaaaaaa 758 



(2) INFORMATION UBER SEQ ID NO: 17: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 302 Basenpaare 

(B) TYP: Nukleinsaure 
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(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHRE1BUNG: SEQ ID NO: 17 

cggctcgacg gtctcgagat caagcatgaa ttgcaagcaa actgctacga ggaggtcaag 60 
gaccgctgca ccctggcaga gaagctgggg ggcagtgccg tcatctccct ggagggcaagl20 
cctttgtgag cccctttctg cgcccccttg cctgggagca tctgggcagg ccccaacaccl80 
ttgccctttg ggggtttgca gggctcgccc cctttcctgg ccagaaccgg gagggggctg240 
gggggggatt cccaggcagg gggggagggg ccaattccct tttcaacccc caggttgggc300 
ca 302 



(2) INFORMATION UBER SEQ ID NO: 18: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 824 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: - 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 18 

ggcgaggaca gaggaggcgc gtccggcctt cctgaacacc ttaggctggt ggggctgcgg 60 
caagaagcgg gtctgtttct ttacttcctc cacggagtcg gcacactatg gctgcccctgl20 



73 



DE 198 18 619 A 1 



ggctcccaga acccacaaca tgaaagaaat 
aatccggaca caaaaccctc tagcttggaa 
ctacctgact caggaatcgg ctctggaagg 
caacatcaaa gacaccatcg gaacagcagc 
ccatcttcat ggccaccccc tgcggcggac 
agctgagctc ctccagcggg atgacgccgt 
ccttctgtct ctttgtttct gagctttcct 
ctccacgact ctgtttcccc cgtcccttct 
ggttggagtt gtggagacgg ccttgagtct 
ttggcaagtg cctgtgcagg gcccggccgc 
ggctgtgtgc ccgaaggcct caccctggcc 
cgaacatcaa gggaggcaag cctttcaagg 



ggtgccaccc agctcaagnc tgggcctttg3 80 
atgaatatgc tgczictttac aaccactgca2£0 
tgaagctaga ggaaccagac ctcatcagcc300 
gcccgcagca cccaccccgc accggcgact360 
ggttgaccac cagccaccac atcatcccag420 
ccccaccacc tccctcttct tctttttcat4B0 
gtctttcctt ttttctgaga gattcaaagc540 
gaatttaatt tgcactaagt catttgcact600 
cagtacgagt gtgcgtgagt gtgagccacc660 
cctccatctg ggccgggtga ctggggcgcc720 
cttcggcctt agtctgggaa ggttccgaac780 
catttccatt aatt 824 



(2) INFORMATION UBER SEQ ID NO: 1 9: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2190 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 



(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 19 



tttttttttt tttttttttt tttttttttt tttttttttt tttcatgact tttaatgctt 60 
tattgggatt gcaagcgtta caaggttaaa gacaaaaccc aagcatggga ttttgccgga 120 
aatattagcg ttaaaggagc tgagttgagt caaacacggg ccgcaaggtg gaccgaggcg 180 
..gcaggcacag gtgacattca gtgtttggcg tgggggtctt caggtgatgg cagaggaggg 240 
gacccaagag ggggcccccc actgaagaca ttggggacac ggggaggaga caagatggag 300 
agccacgact aggcacggag gtcagacagg cagcccgggc caggatggtt agtggcccag 360 
gggagagctg. caaacctggg gacgcaaggg gctggtcggc aagtgccccc gggaacaccc 420 
actccggcga ggcagaatatf aacactgggt gggtgggtgt cctgacgaat gggcaggtaa 480 
tttggggtgc ctcgaagcgt tttggatctc aggccaatgt gggttccaca attgtgacaa 540 
tttggctctt tgggcttctg tccaatgttc cgaatggccc actcacaggg cgcttgccga 600 
gggaccctfct gcgacgctcg agctcgagcc gaaatgaggg aacccccaaa tttcatgtca 660 
attgatctat tccccctctt tgtttcttgg ggcagttttt tttttacccc tccttagctt 720 
tatgcgctca gaaaccaaat taaacccccc ccccatgtaa caggggggca gtgacaaaag 780 
caagaacgca cgaagccagc ctggagacca ccacgtcctg ccccccgcca tttatcgccc 840 
tgattggatt ttgtttttca tctgtccctg ttgcttgggt . tgagttgagg gtggagcctc 900 
ctggggggca ctggccactg agcccccttg gagaagtcag aggggagtgg agaaggccac 960 
tgtccggcct ggcttctggg gacagtggct ggtccccaga agtcctgagg gcggagggggl020 
gggttgggca gggtctcctc aggtgtcagg agggtgctcg gaggccacag gagggggctcl080 
ctggctggcc tgaggctggc cggaggggaa ggggctagca ggtgtgtaaa cagagggttcll40 
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catcaggctg 

ggtgacatct 

gggacctctt 

gcccacagag 

cccacgggtc 

ctcggcctct 

gcagcccgac 

gaccccgggg 

ctgtgagtgg. 

tggaacccac 

tcgtcaggac 

gggcacttgc 

ccatcctggc 

tcctccccgt 

atcacctgaa 

ccttgcggcc 

ccatgcttgg 

tcatgaaatt 



gggcagggtg 
tgcccgcccc 
gggggcaagg 
caggagagct 
tggtgatgcc 
gggaggttta 
tagcagtcta 
gtcgtccatg 
gccattcgga 
attggcctga 
acccacccac 
cgaccagccc 
ccgggctgcc 
gtccccaatg 
gacccccacg 
cgtgtttgac 
gttttgtctt 
caaaaaaaaa 



gccgccttcc 
tcagcaccct 
tggggtgagg 
gccaggtctg 
atagcagcca 
cctcgccccc 
gaggcctgag 
ccagtccgcc 
acattggaca 
gatccaaaac 
ccagtgttat 
cttgcgtccc 
tgtctgacct 
tcttcagtgg 
ccaaacactg 
tcaactcagc 
taaccttgta 
aaaaaaaaaa 



gcacHCtfcga 
gcctcgcctc 
ccggggagta 
cccatcgacc 
ccaccgcggc 
acttgtgccc 
gcttctgggt 
tcagtcgcag 
gaagcccaaa 
gcttcgaggc 
attctgcctc 
caggtttgca 
ccgtgcctag 
ggggccccct 
aatgtcacct 
tcctttaacg 
acgcttgcaa 



ggaaccctf.c 
caggaggtcc 
gggaggtcag 
aggttgcttg 
gcctagggct 
ccagctcagc 
cctggtgacg 
agggtccctc 
gagccaaatt 
accccaaatt 
gccggagtgg 
gctctcccct 
tcgtggctct 
cttgggtccc 
gtgcctgccg 
ctaatatttc 
tcccaataaa 



eccctccctc3 200 
gaagctctgtl^tiO 
gcgggtctgal320 
ggccccggag 13 8 0 
gcggcaggga!440 
ccccctgcacl500 
gggctggcatl560 
ggcaagcgccl620 
gtcacaattgl680 
acctgcccatl740 
gtgttcccggl800 
gggccactaal860 
ccatcttgtcl920 
ctcctctgccl980 
cctcggtcca2040 
cggcaaaatc2100 
gcattaaaag2160 
2190 



(2) INFORMATION OBER SEQ ID NO: 20: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2565 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MGLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 20 

ctcccccacc tgtggcccgc aagccgtctg tgggagtccc ggaccccgcc tcccccagtt 60 
accctcgagc tgagcccctt actgctcctc ccaccaatgg gctccctcac acccaggaca 120 
ggactaagag ggagctggccp gagaatggag gtgtcctgca gctggtgggc ccagaggaga 180 
agatgggcct cccgggctca gactcacaga aagagctggc ctgaccacca ggcacctcac 240 
tggcactgct gacccatccc agaaacacaa tctcagggac ccgagcagct ccaaggacga 300 
gaggatacag cagacacaac ctaatagaga gggcgcctgc agccttaacc tccacggcct 360 
tcgatactta tgcaagcctg gtgttgctcc tgtcctcaga gtcatcctgc gctcatgcct 420 
tttcccgaat gggttcacct ctggcagttg ccgcttcagt cttggcctta gcctcatctt 480 
gaagtgggta gctggcggga gagggtggct gcgccccctg ctggccctga ggctgcagag 540 
ttgggagcag gacacctcac ctgagtttca ttttttttca tgtccaaacc atgcacatac 600 
tatagtccag aatcaaagca cttttgaaaa gtggctgcat ggccatcctc cagggcccag 660 
gaagttgcat tccaagggcc tgtttacatg gcagcagaat ccatccccgg cagtcagccc 720 
atagcttggg accagtctgt gccctcctgc ccagtccagt ttactcctct tggttcctga 780 
aggtggccaa gtcattgtgt tcccacaggc ttctctaggc tgggggcagg tgtggggctg 840 
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tggaattcca aagcacaaaa ggtgcagagg ggartggcct- tcctgtcccrL caactcaeca 900 

accaccctcc tgccttccag ttctgccagg tgctccatgtr -tg^ggacaag taggagactg 960 

ccagggccca aagaaatggg tgagcagtag agtcatctcg gggcacttgg cagtgtcaagl020 

cacctgcccc ttgcctcctt gaccacactg gggtgggtgg gcccccagca cttcagaggclO80 

aggagccttt gggctgagca agcactgagg aggtggatgg aagggagcat ctggagggggll40 

ggagcttcct tgagcagtgg gcccaggcct ggccctccac acttcattct ctgacctttcl200 

tctctcctca tttcggtgca tgtcctttct gcagctgcct ttcagcacag gtggttccacl260 

tgggggcagc taacgctgag tgacaaggat gggaagccac aggtgcattt tactcaagtcl320 

ttctctagtc aatgaggggc acccagtgct tctagggcag gctgggtggt ggtcccctagl380 

gtatcagcct ctcttactgt actctccggg aatgttaacc tttctatttt cagcctgtgcl440 

cacctgtcta ggcaagctgg cttccccatt ggcccctgtg ggtccacagc agcgtggctglSOO 

ccccccaggg ccaccgcttc tttcttgatc ctctttcctt aacagtgact tgggcttgagl560 

tctggcaagg aaccttgctt ttagcttcac caccaaggag agaggttgac atgacctcccl620 

cgccccctca ccaaggctgg gaacagaggg gatgtggtga gagccaggtt cctctggcccl680 

tctccagggt gttttccact agtcactact gtcttctcct tgtagctaat caatcaatat!740 

tcttcccttg cctgtgggca gtggagagtg ctgctgggtg tacgctgcac ctgcccactgl800 

agttggggaa agaggataat cagtgagcac tgttctgctc agagctcctg atctaccccal860 

ccccctagga tccaggactg ggtcaaagct gcatgaaacc aggccctggc agcaacctggl920 

gaatggctgg aggtgggaga gaacctgact tctctttccc tctccctcct ccaacattacl980 

tggaactcta tcctgttagg atcttctgag cttgtttccc tgctgggtgg gacagaggac2040 

aaaggagaag ggagggtcta gaagaggcag cccttctttg tcctctgggg taaatgagct2100 

tgacctagag taaatggaga gaccaaaagc ctctgatttt taatttccat aaaatgttag2160 

aagtatatat atacatatat atatttcttt aaatttttga gtctttgata tgtctaaaaa2220 

tccattccct ctgccctgaa gcctgagtga gacacatgaa gaaaactgtg tttcatttaa2280 

agatgttaat taaatgattg aaacttgaaa aaggctactg cttcttaatg ttggggggac2340 

agggcagtgg tctgggccca catttagaag ggaaaatgtt ttgcctgctg cacacattgg2400 

acccaagtat gggcctcttc tgcctagtac tgccaaaggg actgttaagg tgtcttgtcc2460 

atcttctacc ccccaccccc cattacaggg taaagggaac cccagactag gtgaggggcc2520 

agcagctgcc tcacacttgt gttctctcct gagatggtcc agctt 2565 



(2) INFORMATION UBER SEQ ID NO: 21: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 461 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 21 

gttcctccct agcggctcgc tcagaagcag agttctgggg tgtctccacc atggcctgga 60 
cccctctctg gctcactctc ctcactcttt gcataggttc tgtggtttct tctgagctgal20 
ctcaggaccc tgctgtgtct gtggccttgg gacagacagt caggatcaca tgccaaggagl80 
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acagcctcag aagctattat gcaagctggt accagcagaa gccagoacag gcccctgtac240 
ttgtcatcta tggtaaaaac aaccggccct cagggatcco ^gaccgat^c tctggctcca300 
gctcaggaaa cacagcttcc ttgaccatca ctggggctca ggcggaagat gaggctgact360 
attacttgta aactccccgg gacagcaagt gggtaaccaa tgtgggtatt ccggcgggag420 
ggacccaagc ttgacccgtt cttaaggtca gcccaaaggg c 461 

(2) INFORMATION UBER SEQ ID NO: 22: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2096 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partlelle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 22 

atcccggtca cttttggtta caggacgtgg cagctggttg gacgagggga gctggtgggc 60 
agggtttgat cccagggcct gggcaacgga ggtgtagctg gcagcagcgg gcaggtgagg 120 
accccatctg ccgggcaggt gagtcccttc cctccccagg cctcgcttcc ccagccttct 180 
gaaagaagga ggtttagggg atcgagggct ggcggggaga agcagacacc ctcccagcag 240 
aggggcagga tgggggcagg agagttagca aaggtgacat cttctcgggg ggagccgaga 300 
ctgcgcaagg ctggggggtt atgggcccgt tccaggcaga aagagcaaga gggcagggag 360 
ggagcacagg ggtggccagc gtagggtcca gcacgtgggg tggtacccca ggcctgggtc 420 
agacagggac atggcagggg acacaggaca gaggggtccc cagctgccac ctcacccacc 480 
gcaattcatt tagtagcagg cacaggggca gctccggcac ggctttctca ggcctatgcc 540 
ggagcctcga gggctggaga gcgggaagac aggcagtgct cggggagttg cagcaggacg 600 
tcaccaggag ggcgaacggc cacgggaggg gggccccggg acattgcgca caaaggaggc 660 
tgcaggggct cggcctgcgg gcgccgg.tcc cacgaggcac tgcggcccag ggtctggtgc 720 
ggagagggcc cacagtggac ttggtgacgc tgtatgccct caccgctcag cccctggggc 780 
tggcttggca gacagtacag catccagggg agtcaagggc atggggcgag accagactag 840 
gcgaggcggg cggggcggaxj tgaatgagct ctcaggaggg aggatggtgc aggcaggggt 900 
gaggagcgca gggggcggcg agcgggaggc actggcctcc agagcccgtg gccaaggcgg 960 
gcctcgcggg cggcgacgga> gccgggatcg gtgcctcagc gttcgggctg gagacgaggcl020 
caggtctcca gctggggtgg acgtgcccac cagctgccga aggcaagacg ccaggtccgg!080 
tggacgtgac aagcaggaca tgacatggtc cggtgtgacg gcgaggacag aggaggcgcgll40 
tccggccttc ctgaacacct taggctggtg gggctgcggc aagaagcggg tctgtttcttl200 
tacttcctcc acggagtcgg cacactatgg ctgccctctg ggctcccaga acccacaacal260 
tgaaagaaat ggtgctaccc agctcaagcc tgggcctttg aatccggaca caaaaccctcl320 
tagcttggaa atgaatatgc tgcactttac aaccactgca ctacctgact caggaatcggl380 
ctctggaagg tgaagctaga ggaaccagac ctcatcagcc caacatcaaa gacaccatcgl440 
gaacagcagc gcccgcagca cccaccccgc accggcgact ccatcttcat ggccacccccl500 
tgcggcggac ggttgaccac cagccaccac atcatcccag agctgagctc ctccagcgggl560 
atgacgccgt ccccaccacc tccctcttct tctttttcat ccttctgtct ctttgtttctl620 
gagctttcct gtctttcctt ttttctgaga gattcaaagc ctccacgact ctgtttccccl680 
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cgtcccttct gaatttaatt tgcactaagt 
ccttgagtct cagtacgagt gtgcgtgagt 
gcccggccgc cctccatctg ggccgggtga 
ccctgccctc gcctagtctg gaagctccga 
cattacgccc catctcgctc tgtgcccctc 
tcatcatcaa taaacactgt tacagcaaaa 
aaagaaaaag aaaaggaaaa aaagagaggg 



catctgcact ggttggagtt gtggag£.c\jql740 
gtgagccacc ttggcaagtg cctgtgcaggI300 
ctgggcgccg gctgtgtgcc cgaggcctcal860 
ccgacatcac ggagcagcct tcaagcattcl920 
cccaccaggg cttcagcagg agccctggacl980 
aaaaaaaaga aaaaaaaaag aaaagaaaaa2040 
aggaagaagg agaaaaggga gtgtgg 2096 



(2) INFORMATION ClBER SEQ ID NO: 23: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1348 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 23 

ctctcgcgag gccccagaga gcaggcgctg ggcagtgtgg aggtcgttgg agtcacttcc 60 
gcgtcajccag ctcctgtgcc tgccagtcgg tgcccctccc gctccagcca tgctctccgc 120 
cctcgcccgg cctgccagcg ctgctctccg ccgcagcttc agcacctcgg cccagaacaa 180 
tgctaaagta gctgtgctag gggcctctgg aggcatcggg cagccacttt cacttctcct 240 
gaagaacagc cccttggtga gccgcctgac cctctatgat atcgcgcaca cacccggagt 300 
ggccgcagat ctgagccaca tcgagaccaa agccgctgtg aaaggctacc tcggacctga 360 
acagctgcct gactgcctga aaggttgtga tgtggtagtt attccggctg gagtccccag 420 
aaagccaggc atgacccggg acgacctgtt caacaccaat gccacgattg tggccaccct 480 
gaccgctgcc tgtgcccagc actgcccgga agccatgatc tgcgtcattg ccaatccggt 540 
taattccacc atccccatca cagcagaagt tttcaagaag catggagtgt acaaccccaa 600 
caaaatcttc ggcgtgacga ccctggacat cgtcagagcc aacacctttg ttgcagagct 660 
gaagggtttg gatccagctc gagtcaacgt ccctgtcatt ggtggccatg ctgggaagac 720 
catcatcccc ctgatctctcf agtgcacccc caaggtggac tttccccagg accagctgac 780 
agcactcact gggcggatcc aggaggccgg cacggaggtg gtcaaggcta aagccggagc 840 
aggctctgcc accctctcca tggcgtatgc cggcgcccgc tttgtcttct cccttgtgga 900 
tgcaatgaat ggaaaggaag gtgttgtgga atgttccttc gttaagtcac aggaaacgga 960 
atgtacctac ttctccacac cgctgctgct tgggaaaaag ggcatcgaga agaacctgggl020 
catcggcaaa gtctcctctt ttgaggagaa gatgatctcg gatgccatcc ccgagctgaal080 
ggcctccatc aagaaggggg aagatttcgt gaagaccctg aagtgagccg ctgtgacgggll40 
tggccagttt ccttaattta tgaaggcatc atgtcactgc aaagccgttg cagataaactl200 
ttgtatttta atttgctttg gtgatgatta ctgtattgac atcatcatgc cttccaaattl260 
gtgggtggct ctgtgggcgc atcaataaaa gccgtccttg attttaaaaa aaaaaaaaaal320 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1348 
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(2) INFORMATION UBER SEQ ID NO: 24 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 358 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzefnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 24 

aatgggggac cggagaagaa gtacaagagc acgggtcagg ggcgggactc cgacggctcc 60 
ttcttcc'tct acagcaggct aaccgtggac aagagcaggt ggcaggaggg gaatgtcttcl20 
tcatgctccg tgatgcatga ggctctgcac aaccactaca cgcagaagag cctctccctgl80 
tctccgggta aatgagtgcg acggccggca agcccocgct ccccgggctc tcgcggtcgo240 
acgaggatgc ttggcacgta ccccgtgtac atacttccca ggcacccagc atggaaataa300 
agcacccagc gctgccctgg ggcccctgcg aaaaaaaaga aaaagaatcg aaaagggg 358 



(2) INFORMATION UBER SEQ ID NO: 25: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 89 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
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(A) BIBLIOTHEK: cDNA library 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 25 

gcccctagcc cctggcagac atagctgctt cagtgcccct tttcctctgc tggctagatg60 
gatgttgatg cactggaggt acttttagc 89 

(2) INFORMATION UBER SEQ ID NO: 26: 

(i) SEQUENZ CHARACTERISTIC 

(A) LANGE: 1 632 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 26 

gacactggtt ggttctgata agaggcaggg gaggagaaag ccgaggaaga gggagttgcg 60 
gaagaggagg gagttaacaa gttctcttat ccaccatcac accgggagtg ttgtccagcc 120 
gtggaggagg aggacgatga agaagctgta aagaaagaag ctcacagaac ctctacttct 180 
gccttgrtctc caggatccaa gcccagcact tgggtgtctt gcccagggga ggaagagaat 240 
caagccacgg aggataaaag aacagaaaga agtaaaggag ccaggaagac ctccgtgtcc 300 
ccccgatctt caggctccga ccccaggtcc tgggagtatc gttcaggaga ggcgtccgag 360 
gagaaggagg aaaaggcaca caaagaaact gggaaaggag aagctgcccc agggccgcaa 420 
tcctcagccc cagcccagag gccccagctc aagtcctggt ggtgccaacc cagtgatgaa 480 
gaggagggtg aggtcaaggc tttgggggca gctgagaagg atggagaagc tgagtgtcct 540 
ccctgcatcc ccccaccaag tgccttcctg aaggcctggg tgtattggcc aggagaggac 600 
acagaggaag aggaagatga ggaagaagat gaggacagtg actctggatc agatgaggaa 660 
gagggagaag ctgaggcttc ctcttccact cctgctacag gtgtcttctt gaagtcctgg 720 
gtctatcagc caggagagga cacacagtga tacaggatca gccgaggatg aaagagaagc 780 
tgagacttct gcttccacac cccctgcaag tgctttcttg aaggcctggg tgtatcggcc 840 
aggagaggac actggatagt- gaggataagg aagatgattc agaagcagcc ttaggagaag 900 
ctgagtcaga cccacatccc tcccacccgg accagagggc ccacttcagg ggctggggat 960 
atcgacctgg aaaagagaca gaggaagagg aagctgctga ggactgggga gaagctgagcl020 
cctgcccctt ccgagtggcc atctatgtac ctggagagaa gccaccgcct ccctgggctcl080 
ctcctaggct gcccctccga ctgcaaaggc ggctcaagcg cccagaaacc cctacrtcatgll40 
atccggaccc tgagactccc ctaaaggcca gaaaggtgcg cttctccgag aaggtcactgl200 
tccatttcct ggctgtctgg gcagggccgg cccaggccgc ccgccagggc ccctgggagcl260 
agcttgctcg ggatcgcagc cgcttcgcac gccgcatcac ccaggcccag gaggagctgal320 
gcccctgcct cacccctgct gcccgggcca gagcctgggc acgcctcagg aacccaccttl380 
tagcccccat ccctgccctc acccagacct tgccttcctc ctctgtccct tcgtccccagl440 
tccagaccac gcccttgagc caagctgtgg ccacaccttc ccgctcgtct gctgctgcaglSOO 
cggctgccct ggacctcagt gggaggcgtg gctgagacca actggtttgc ctataatttal560 
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ttaactattt attttttcta agtgtgggtt tatataagga ataaagcctt ttgatttgtal6Z0 
acgaaaaaaa aa 1632 



(2) INFORMATION OBER SEQ ID NO: 27: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2972 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOG1E: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte parBelle cDNA 

(Hi) HYPOTHET1SCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 27 

ccaggacgag cacctcatta cattcttcgt gcctgtcttt gagccgctgc cccctcagta 60 
cttcatccga gtggtgtctg accgctggct ctcttgtgag acccagctgc ctgtctcctt 120 
ccggcacctg atcttgccgg agaagtaccc ccctccaacc gaacttttgg acctgcagcc 180 
cttgcccgtg tctgctctga gaaacagtgc ctttgagagt ctttaccaag ataaatttcc 240 
tttcttcaat cccatccaga cccaggtgtt taacactgta tacaacagtg acgacaacgt 300 
gtttgtgggg gcccccacgg gcagcgggaa gactatttgt gcagagtttg ccatcctgcg 360 
aatgctgctg cagagctcgg aggggcgctg tgtgtacatc acccccatgg aggccctggc 420 
agagcaggta tacatggact ggtacgagaa gttccaggac aggctcaaca agaaggtggt 480 
actcctgaca ggcgagacca gcacagacct gaagctgctg ggcaaaggga acattatcat 540 
cagcacccct gagaagtggg acatactttc ccggcgatgg aagcagcgca agaacgtgca 600 
gaacatcaac ctcttcgtgg tggatgaggt ccaccttatc gggggcgaga atgggcctgt 660 
cttagaagtg atctgctccc gaatgcgcta catctcctcc cagattgagc ggcccattcg 720 
cattgtggca ctcagctctt cgctctccaa tgccaaggat gtggcccact ggctgggctg 780 
cagtgccacc tccaccttca acttccatcc caatgtgcgt cccgtcccct tggagctgca 840 
catccagggc ttcaacatca gccatacaca aacccgcqtg ctctccatgg ccaagcctgt 900 
gtaccatgct atcaccaagc actcgcccaa gaagcctgtc attgtctttg tgccgtctcg 960 
caagcagacc cgcctcactrg ccattgacat cctcaccacc tgtgcagcag acatccaacgl020 
gcagaggttc ttgcactgca ccgagaagga tctgattccg tacctggaga agctaagtgal080 
cagcacgctc aaggaaacgc tgctaaatgg ggtgggctac ctgcatgagg ggctcagcccll40 
catggagcga cgcctggtgg agcagctctt cagctcaggg gctatccagg tggtggtggcl200 
ttctcggagt ctctgctggg gcatgaacgt ggctgcccac ctggtaatca tcatggatacl260 
ccagtactfac aatggcaaga tccacgccta tgtggattac cccatctatg acgtgcttcal320 
gatggtgggc cacgccaacc gccctttgca ggacgatgag gggcgctgtg tcatcatgtgl380 
tcagggctcc aagaaggatt tcttcaagaa gttcttatat gagccatjtgc cagtagaatcl440 
tcacctggac cactgtatgc atgaccactt caatgctgag atcgtcacca agaccattgalSO.O 
gaacaagcag gatgctgtgg actacctcac ctggaccttt ctgtaccgcc gcatgacacal560 
gaaccccaat tactacaacc tgcagggcat ctcccatcgt cacttgtcgg accacttgtcl620 
agagctggtg gagcagaccc tgagtgacct ggagcagtcc aagtgcatca gcatcgaggal680 
cgagatggac gtggcgcctc tgaacctagg catgatcgcc gcctactatt acatcaactal740 
caccaccatt gagctcttca gcatgtccct caatgccaag accaaggtgc gagggcttatl800 
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cgagatcatc tccaatgcag cagagtatga gaacattccc atccggcacc atgaagaCaal860 
tctcctgagg cagttggctc agaaggtccc ccacscigctg aataacccts. agttcaal;gal320 
tccgcacgtc aagaccaacc tgctcctgca ggctcacttg tctcgcatgc agctgagtgcl980 
tgagttgcag tcagatacgg aggaaatcct tagtaaggca. atccggctca tccaggcctg2040 
cgtggatgtc ctttccagca atgggtggct cagccctgct ctggcagcta tggaactggc2100 
ccagatggtc acccaagcca tgtggtccaa ggactcatac ctgaagcagc tgccacactt2160 
cacctctgag catatcaaac gttgcacaga caagggagtg gagagtgttt tcgacatcat2220 
ggagatggag gatgaagaac ggaacgcgtt gcttcagctg actgacagcc agattgcaga2280 
tgtggctcgc ttttgtaacc gctaccctaa tatcgaacta tcttatgagg tggtagataa2340 
ggacagcatc cgcagtggcg ggccagttgt ggtgctggtg cagctggagc gaga ggagga2 400 
agtcacaggc cctgtcattg cgcctctctt cccgcagaaa cgtgaagagg gctggtgggt2460 
ggtgattgga gatgccaagt ccaatagcct catctccatc aagaggctga ccttgcagca2520 
gaaggccaag gtgaagttgg actttgtggc cccagccact ggtgcccaca actacactct2580 
gtacttcatg agtgacgctt acatgggatg tgaccaggag tacaaattca gcgtggatgt2640 
gaaagaagct gagacagaca gtgattcaga ttgagtcctg aggcatttac ttttgggtaa2700 
aggagagttg agcctgaatt aggaatgtgt acattgtagg aatcctggtt gtggggacca2760 
ggtctgtggg cctcaggtct ggccagccag ggctggtgct gtccccgcct acctccactt2820 
cctttccctt gctcactctg gatccagtga cagcaggtgt catgggtcaa gcataaatca2880 
tatatagcat tttcaggcat gttcctggta gttcttttga gtctgacatt c.taataaaat2940 
aatttgtaga aaaaaaacca aaaaaaaaaa aa 2972 

(2) INFORMATION UBER SEQ ID NO: 28: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 496 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(in) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 28 



ctcgagccga agagtcctgg acctcctgtg 
tctcctggtg gcagctccca 1 gatgggtcct 
aggactggtg aagccttcgg agaccctgtc 
cagtagttac tactggagct ggatccggca 
gtatatctat tacagtggga gcaccaacta 
atcagtagac acgtccaaga accagttctc 
cacggccgtg tattactgtg cgagacaggg 
ccagggaacc ctggtcaccg tctcctgagc 
ggcaccctgc tccaag 



caiagaacatg aaacatctgt ggttcttcct 60 
gtcccaggtg cagctgcagg agtcgggcccl20 
cctcacctgc actgtctctg gtggctccatl80 
gcccccaggg aagggactgg agtggattgg240 
caacccctcc ctcaagagtc gagtcaccat300 
cctgaagctg agctctgtga ccgctgcgga360 
tatagcagtg gaccagcttg actactgggg420 
ctgcaccaag gggccatcgg tcttccccct480 

496 



(2) INFORMATION UBER SEQ ID NO: 29: 
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(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 397 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 29 

gaggtcctgg acctcctgtg caagaacatg aaacacctgt ggttcttcct cctcctggtg 60 
gcagctccca gatgggtcct gtcccaggtg- cagctgcagg agtcgggccc aggactggtgl20 
aagccttcgg agaccctgtc cctcacctgc actgtctctg gtggctccat cagtagttacl80 
tactggagct ggatccggca gcccgccggg aagggactgg agtggattgg gcgtatctat240 
accagtggga gcaccaacta caacccctcc ctcaagagtc gagtcaccat gtcagtagac300 
acgtccaaga accagttctc cctgaagctg agctctgtga ccgccgcgga cacggccgtg360 
tattactgtg cgagagcaaa acgcagctgg acctcag 397 



(2) INFORMATION UBER SEQ ID NO: 30: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 772 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO:' 36 

gggactcctc agttcacctt ctcacaatga ggctccctgc tcagctcctg gggctgctaa 60 
tgctctgggt ctctggatcc agtggggata ttgtgatgac tcagtctcca ctctccctgcl20 
ccgtcacccc tggagagccg gcctccatct cctgcaggtc tagtcagagc ctcctgcatal80 
gtaatggata caactatttg gattggtacc tgcagaagcc agggcagtct ccacagctcc240 
tgatctattt gggttctaat cgggcctccg gggtccctga caggttcagt ggcagtggat300 
caggcacaga ttttacactg aaaatcagca gagtggaggc tgaggatgtt ggggtttatt360 
actgcatgca agctctacaa actcctctca ctttcggcgg agggaccaag gtggagatca420 
aacgaactgt ggctgcacca tctgtcttca tcttcccgcc atctgatgag cagttgaaat480 
ctggaactgc ctctgttgtg tgpctgctga ataacttcta tcccagagag gccaaagtac540 
agtggaaggt ggataacgcc ctccaatcgg gtaactccca ggagagtgtc acagagcagg600 
acagcaagga cagcacctac agcctcagca gcaccctgac gctgagcaaa gcagactacg660 
agaaacacaa agtctacgcc tgcgaagtca cccatcaggg cctgagcttg gcccgttaag720 
aaagggcttt caacaggggg aagtttttag aggggagatg tggccccacc tt 772 

(2) INFORMATION UBER SEQ ID NO: 31: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1031 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 31 

ggggacggaa gcgcagagca cggaccccgc cccctcgcgg ccccgctcgt gacgtcgcgg 60 
ggggcgccgg cctccgcccg gccccgaggg gqtctccccg gaggctcagc cccctctgct 120 
ccccatgggc aactgccagg cagggcacaa cctgcacctg tgtctggccc accacccacc 180 
tctggtctgt gcqactttga tgctgctgct ccttggcctc tctggcctgg gccttggcag 240 
cttcctcctc acccacagga ctggcctgcg cagcctgaca tcccccagga ctgggtctct 300 
tttttgagat cttttggcca gctgaccctg tgtcccagga atgggacagt cacagggaag 360 
tggcgagggt ctcacgtcgt gggcttgctg accaccttga acttcggaga cggtccagac 420 
aggaacaaga cccggacatt ccaggccaca gtcctgggaa gtcagatggg attgaaagga 480 
tcttctgcag gacaactggt ccttatcaca gccagggtga ccacagaaag gactgcagga 540 
acctgcctat attttagtgc tgttccagga atcctaccct ccagccagcc acccatatcc 600 
tgctcagagg agggggctgg aaatgccacc ctgagcccta gaatgggtga ggaatgtgtt 660 
agtgtctgga gccatgaagg ccttgtgctg accaagctgc tcacctcgga ggagctggct 720 
ctgtgtggct ccaggctgct ggtcttgggc tccttcctgc ttctcttctg tggccttctc 780 
tgctgtgtca ctgctatgtg cttccacccg cgccgggagt cccactggtc tagaacccgg 840 
ctctgagggc actggcctag ttcccgactt gtttctcagg tgtgaatcaa cttcttgggc 900 
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cttggctctg agttggaaaa ggttttagaa aaacrtgaaga . gctggaatgt ggijggaaaat 960 
aaaaagcttt tttgcccaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaal020 
aaaaaaaaaa a 1031 



(2) INFORMATION UBER SEQ ID NO: 32: 



(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 739 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(il) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBUOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 32 



cggctcgagc cccgctcagt 
ccggggaagg tcagcgccgt 
caaaaggtgt tgcggcttta 
agagacaaat accgatactt 
gaaaaggata tggcgaaggc 
cgtcagcatc cacagccata 
tacgattgct acaaggtccc 
atgtatcctg attactttgc 
gaacgagagg ttaagcagct 
ttgccccctg cccgaaagga 
ccccgggagc ggcccatgta 
atgttacaga acatgcactt 
aaaaaaaaaa aaaaaaaaa 



cacccgcagc aggcgtgcag 
aatggcgttc ttggcgtcgg 
taagcgggcg ctacgccacc 
tgcttgtttg atgagagccc 
cacccagctg ctgaaggagg 
catcttccct gactctcctg 
agaatggtgc ttagatgact 
caagagagaa cagtggaaga 
gcaggaggaa acgccacctg 
aggtgatttg cccccactgt 
gaaagagaga gacctcatct 
gccctaataa aaaatcagtg 



tttcccggct ctccgcgcgg 60 
gaccctacct gacccatcagl20 
tcgagtcgtg gtgcgtccagl80 
ggtttgaaga acataagaat240 
ccgaggaaga attctggtac300 
ggggcacctc ctatgagaga360 
ggcatccttc tgagaaggca420 
aactgcggag ggaaagctgg480 
gtggtccttt aactgaagct540 
ggtggtatat tgtgaccaga600 
ttcatgcttg caagtgaaat660 
aaatggaaaa aaaaaaaaaa720 

739 



(2) INFORMATION UBER SEQ ID NO: 33: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 651 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 33 

cggctcgagc ctcagttcac cttctcacca tgaggctccc tgctcagctc ctggggctgc 60 
taatgctctg ggtccctgga tccagtgagg atattgtgat gacccagact ccactctcccl20 
tgcccgtcac ccctggagag ccggcctcca tctcctgcag gtctagtcag agcctcttggl80 
atagtgatga tggaaacacc tatttggact ggtacctgca gaagccaggg cagtctccac240 
agctcctgat ctatacgctt tcctatcggg cctctggagt cccagacagg ttcagtggca300 
gtgggtcagg cactgatttc acactgaaaa tcagcagggt ggaggctgag gatgttggag360 
tttattactg catgcaacgt atagaatttc cttacacttt tggccagggg accaagctgg420 
agatcaaacg aactgtggct gcaccatctg tcttca.tctt cccgccatct ggatgagcag480 
ttgaaatctg gaacttgcct ctgttgttgt gcctgcttga ataactttct attcccagag540 
aggggcaaag taacagtgga aggttggatt aacgccctgc aattcgggta actgcccagg600 
gagtagtttt cacagggcag gggcagcaag gacagcacct acagtcttag t 651 

(2) INFORMATION OBER SEQ ID NO: 34: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 823 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 34 

ccgcgtcgac aaattttttt aaagatcatc gatgaagaga gaaaatgcgc ttttctacag 60 
agtccccttc ccacccacag ccccatcccc agataagcgg ggagttccct ggcgcggtgcl20 
cagtttctag ccgctgagtg ggcgtgtgcg cggctccaag tgcgcctgcg tactgctcacl80 
tccccagctc cgcgccctgc tccgttcctc ccaaaactct gaatcgaaga actttccgga240 
agtttctgag agcccagacc ggcgggcacg cgcccatccc caaccccctc tgttaatccc300 
taccagcctg cagtcctggc tgcttccaag caggaggtgg ggcctctggc ctagcggggc360 
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cgaaaggcag tgcccctccc ccgcagtctg atttccotct- tccccccaac ggcaagcacgl20 
aggagcggca ggacgagcat ggctacatct cccggtgctt cacgcggaaa tacacgctgc480 
cccccggtgt ggaccccacc caagtttcct cctccctgtc ccctgagggc acactgaccg540 
tggaggcccc catgcccaag ctagccacgc agtccaacga gatcaccatc ccagtcacct600 
tcgagtcgcg ggcccagctt gggggcccag aagctgcaaa atccgatgag actgccgcca660 
agtaaagcct tagcccggat gcccacccct gctgccgcca ctggctgtgc ctcccccgcc720 
acctgtgtgt tctfcttgata catttatctt ctgtttttct caaataaagt tcaaagcaac780 
cacctggtca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 823 



(2) INFORMATION UBER SEQ ID NO: 35: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 457 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 35 

cataagggaa atgctttctg agagtcatgg atctcatgtg caagaaaatg aagcacctgt 60 
ggttcttcct cctgctggtg gcggctccca gatgggtcct gtcccagctg cagctgcaggl20 
agtcgggccc aggactggtg aagccttcgg agaccctgtc cctcacctgc actgtctctg!80 
gtggctccat cagcagtagt agt tact act ggggctggat ccgccagccc ccagggaagg240 
ggctggagtg gattgggagt atctattata gtgggagcac ctactacaac ccgtccctca300 
agagtcgagt caccatatcc gtagacacgt ccaagaagta cttctccctg aagctgagct360 
ctgtgaccgc cgcagacacg gctgtgtatt actgtgcgag acatgactgg tattacgata420 
ttttgactgg ttatgcgaaa cccggcacag gttcgac 457 

(2) INFORMATION UBER SEQ ID NO: 36: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1203 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 36 

gtcgggggcg cctgcgcagt. cgctcttcct caggcggcgg ccatggcggg acaggaggat 60 
ccggtgcagc gggagattca ccaggactgg gctaaccggg agtacattga gataatcacc 120 
agcagcatca agaaaatcgc agactttctc aactcgttcg atatgtcttg tcgttcaaga 180 
cttgcaacac taaacgagaa attgacagcc cttga.acgga gaatagagta cattgaagct 240 
cgggtgacaa aaggtgagac. actcacctag aacagtgccg tgctgctgct gggaagttgc 300 
tttacacaac acaggccaca tgggaaaggc cccagcagcc ttcagctcct tcctttctcc 360 
ttaaagagca acagggctta ttcttgtttt tcttttttca aaagtgtggc ctttgggctc 420 
tgccatctgg ggtgtggtgt ggtatgtggg aagaagttca gaggaaccgt tggaaacgac 480 
gttaggcatt ttaccttttc agtaacattt tatacatcta cttgtcaatg tatttgagac 540 
attcacagcc aaaagcctgg gactctttgt gaaggtcctc ctcacctcta tctttctttc 600 
tctctctctc aaactttcct taaagttctc attgcctttg cactgcttct gtgaacagtc 660 
tttgtctcct ccccaccttt ggtgggaagt gcggggcagt cctggtcaag acactcatgc 720 
cctggcaatg.tggctgccag agaatgttgt tgctaaccca ccagtttctt gttgatttgg 780 
agaggtcaag gccaggcccc cacttggctt gaagggacat tttcagactt ttctttctgt 840 
cacttggagt gtctatgcct ctcatatttc cctaataaac tcctcaactt tttatctgac 900 
tgctgtgatt atggtgggga gaggagctag agatgggttc acttattgca cagaaatgta 960 
atacatggcg ttattattct aacataaaac tttcagatgt agctgtttga ttcaaagcctl020 
aggtgcttac cagcccaagt ccccatgttt ggactttcag ctgactagct catcttgggal080 
atcatttggt cattcagcac atttaccaag tatttactat gtaggcatgt taaactccaall40 
taaaacatac agcattgaat cagaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa!200 



(2) INFORMATION UBER SEQ ID NO: 37: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 207 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEK0LTYP: aus einzelnen ESTs durch Assemblierung und Edrtierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTJ-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 37 

cggctcgagg ccgccctcgg tgtcagtgtc cccaggacag acggccagga tcacctgctc 60 
tggacatgca ttgccaaagc aatgatgctt attggtacca gtcagaggcc agggccaggcl20 
ccctgtgctt ggtggatccc ttgaaagaac attggaggag. ggcccttcag ggcatgccctl80 
ggagacggat tgctctgggc ttccaac 207 

(2) INFORMATION UBER SEQ ID NO: 38: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 346 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 38 

cggctcgagc ggctcgaggc cgggatggtg ggtgctacgc cccttgggta ctggggccag 60 
ggaaccctgg tcaccgtctc ctcagcctcc accaagggcc catcgggctt cccccgggcal20 
ccctcctcca agagcacctc tgggggcaca gcggccctgg gctgcctggt caaggactacl80 
ttccccgaac cggtgacggt gtcgtggaac tcaggcgcct gaccagcggc gtgcacacct240 
tcccggctgt ctacagtctc aggactctac tcctcagcag cgtggtgacg tgccctccag300 
cagttgggca ccagacctac atctgcaagt gaatcgaagc cagcaa 346 



(2) INFORMATION UBER SEQ ID NO: 39: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 926 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
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(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 39 

cggctctaag gaagcagcac tggtggtgcc tcagccatgg cctggaccgt tctcctcctc 60 
ggcctcctct ctcactgcac aggctctgtg acctcctatg tgctgactca gccaccctcgl20 
gtgtcagtgg ccccaggaca gacggccagg attacctgtg ggggaaacaa cattggaagtl80 
aaaagtgtgc actggtacca gcagaagcca ggccaggccc ctgtgctggt cgtctatgat240 
gatagcgacc ggccctcagg gatccctgag cgattctctg gctccaactc tgggaacacg300 
gccaccctga ccatcagcag ggtcgaagcc ggggatgagg ccgactatta ctgtcaggtg360 
tgggatagta jgtagtgatca ttgggtgttc ggcggaggga ccaagctgac cgtcctaggt420 
cagcccaagg. ctgccccctc ggtcactctg ttcccgccct cctctgagga gcttcaagcc480 
aacaaggcca cactggtgtg tctcataagt gacttctacc cgggagccgt gacagtggcc540 
tggaaggcag atagcagccc cgtcaaggcg ggagtggaga ccaccacacc ctccaaacaa600 
agcaacaaca agtacgcggc cagcagctat ctgagcctga cgcctgagca gtggaagtcc660 
cacagaagct acagctgcca ggtcacgcat gaagggagca ccgtggagaa gacagtggcc720 
cctacagaat gttcataggt tctcaaccct caccccccac cacgggagac tagagctgca780 
ggatcccagg ggaggggtct ctcctcqcac cccaaggcat caagcccttc tccctgcact840 
caataaaccc tcaataaata ttctcattgt caatcaggaa aaaaaaaaaa aaaaaaaaaa900 
aaaaaaaaaa aaaaaaaaaa aaaaaa 926 

(2) INFORMATION UBER SEQ ID NO: 40: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 2384 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MQLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 40 

gcctcccgcc cgccgcctct gtctccctct ctccacaaac tgcccaggag tgagtagctg 60 
ctttcggtcc gccggacaca ccggacagat agacgtgcgg acggcccacc accccagccc 120 
gccaactagt cagcctgcgc ctggcgcctc ccctctccag gtccatccgc catgtggccc 180 
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ctgtggcgcc 
ttctgggact 
gctgacacct 
cctttcggct 
gtgcccaaag 
gagctccgca 
aacaagatct 
tacatctcca 
ctccgcatcc 
aacatgaact 
gccttcgatg 
cccaaagacc 
atcgaactgg 
cagatcagga 
ttggacaaca 
gtggtctatc 
ggcttcgggg 
tactgggagg 
ggcaactaca 
ggaacacagc 
tgcgtccaac 
ctctccctgg 
ttggcctcag 
tttttctcgt 
tgggtcagca 
cgcacctgtc 
cctccccgac 
acccctgctc 
cctctctctc 
tgtgtgtgtc 
ccatctctcc 
ttgagctggg 
agggacagcg 
cggaggaggt 
agtcccagac 
tctcccctta 
tttccattaa 



tcgtgtctct 
tcaccctgga 
cgggcgtcct 
gccactgcca 
agatctcccc 
aggatgactt 
ccaagatcca 
agaaccacct 
acgacaaccg 
gcatcgagat 
gcctgaagct 
tccctgagac 
aggacctgct 
tgatcgagaa 
acaagttggc 
tgcactccaa 
tgaagcgggc 
tgcagccggc 
aaaagtagag 
cagacatcct 
ccagcccccc 
ctcccaaggg 
agctgcccct 
tcactcccaa 
gccaggaggc 
cagacacaca 
tctgcggctc 
cactggccct 
tctctctctc 
ttgtgcttcc 
gaacctggct 
actgctttct 
gtctccccag 
tgggaggtgg 
accggttttc 
tccttctggt 
agaaacaccg 



gctggccctg 
cgatgggcca 
ggacccggac 
cctgcgggtg 
tgacaccacg 
caagggtctc 
tgagaaggcc 
ggtggagatc 
catccgcaag 
gggcgggaac 
caactacctg 
cctgaatgaa 
tcgctactcc 
cgggagcctg 
cagggtgccc 
caacatcacc 
ctactacaac 
cactttccgc 
gcagctgcag 
gatggggagg 
acctcgggtc 
tgcaggtggg 
gctctcccac 
acccaagtgt 
ggtccataag 
tgttctgttc 
ccctcagccc 
tcgaccagtc 
tttctgtgtg 
tcagaccttt 
tcgcctgtcc' 
gtctgtccgg 
cctgccctgc 
aggcccagca 
ctagaagccc 
ccagcgcaag 
tgcaacgtga 



agccaggccc 
ttcatgatga 
tctgtcacac 
gttcagtgct 
ctgctggacc 
cagcacctct 
ttcagcccac 
ccgcccaacc 
gtgcccaagg 
ccactggaga 
cgcatctcag 
ctccacctag 
aagctgtaca 
agcttcctgc 
tcagggctcc 
aaagtgggtg 
ggcatcagcc 
tgcgtcactg 
ccaccgcggg 
cagagccagg 
cctgacccca 
cgcaaggccc 
cacagccacc 
ccaaggctcc 
aatggggaca 
ctcctcctca 
ccttgcaagt 
ctcccttctg 
tgtgtgtgtg 
ctcgcttctg 
ctttcactcc 
cctgcaccca 
tcaggccttg 
tcccgcgcag 
ctcaccccca 
gaggggctgc 
aaaaaaaaaa 



tgcccttt^a 
ncgatga'gga 
ccacctacag 
ccgacctggg 
tgcagaacaa 
acgccctcgt 
tgcggaagct 
tacccagctc 
gagtgttcag 
acagtggctt 
aggccaagct 
accacaacaa 
ggctgggcct 
ccaccctccg 
cagacctcaa 
tcaacgactt 
tcttcaacaa 
accgcctggc 
gcctcagtgg 
aagctaagcc 
gctcgatgcc 
ggcccccatc 
cagaggcacc 
agtcctagga 
gtgggctctg 
tgcatttcca 
tcatggcctg 
ttctctcttt 
tgtgtgtgtg 
agcttggtgg 
acaccctctg 
gcccctgccc 
cccccaaacc 
atgacaccat 
ctggcccact 
ttctgaggtc 
aaaa 



giiagagaggc 2 30 
"agcttcgggc 300 
cgccatgtgt 360 
tctgaagtct 420 
cgacatctcc 480 
cctggtgaac 540 
gcagaagctc 600 
cctggtggag 660 
tgggctccgg 720 
tgaacctgga 780 
gactggcatc 840 
aatccaggcc 900 
aggccacaac 960 
ggagctccacl020 
gctcctccagl080 
ctgtcccatgll40 
ccccgtgcccl200 
catccagtttl260 
gggtctctggl320 
agggcccagc 13 8 0 
ccatcaccgcl440 
acatgttcccl500 
ccatgaagctl560 
gaacagtcccl620 
ccagggctgcl680 
gcctttcaacl740 
t ccctcccagl8 00 
ccccgtccttl860 
tgtgtgtgtgl920 
cct gt t ccct 1980 
gccttctgcc2040 
acaaaacccc2100 
tgtactgtcc2160 
caaccgccag2220 
ggtggctagg2280 
ggtggctgtc2340 
2384 



(2) INFORMATION UBER SEQ ID NO: 41: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 334 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editieaing 

hergestellte partielle cDNA 

(Hi) HYPOTHETISCH: NEIN 



(iii) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 
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(A) BIBLIOTHEK: cDNA library 
(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 41 



ctcgagccga attcggctcg agaggagccc agccctggga ttttcaggtg ttttcatttg 60 
gtgatcagga ctgaacagag agaactcacc atggagtttg ggctgagctg gctttttcttl20 
gtggctattt taaaaggtgt ccagtgtgag gtgcagctgt tggagtctgg gggaggcttgl80 
gtacagcctg gggggtccct gagactctcc tgtgcagcct ctggattcac ctttagcagc240 
tatgccatga gctgggtccg ccaggctcca .gggaaggggc tggagtgggt ctcaggtatt300 
agtggtagtg gtgtgatagt acacactacg caga 334 

(2) INFORMATION UBER SEQ ID NO: 42: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 845 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 42 

gcgttccctc cgccgagcta cttctttctt tccttttttt tttttttctg gctaacagaa 60 
ttttattgtt aaatcacaga aactttagtg caaaacaaaa atcacgaagt ccatttaatal20 
gcaacttcat gtcctgctgg ctttgcttgc tgtctcctgg caaccagaag tggacagaagl80 
cgtgggtgcc caagtgggcc acagacagct tccaaccccc acaccccagc atccaatcca240 
cacccagcag acccttcggc atgccgccct ctaccaggaa gccagaggcc taggagctcg300 
ccatccataf ttatttgaaa aggtcaaaag gagcatctat gagacaaggg aggggtgcag360 
gctgaagcag cgcctcaaca gccagggaca tgtaggcaac acgagcaggc acagcgcggc420 
caccactgtc cacacgctca cacaagccag gcccgcaggg ccttcggaga gctagcaggt480 
tacattcagg cagatggccc tcttcccacc caaacccaca gaaccccaaa caaggcatca540 
ccaggaaaga cacgggaaag ccaaatcaca gttgaaccag ggacagagaa cccttggccc600 
cactgatgtc ccaagccacc agcagctgct tccaaaatcc ctatgctatt acagtgggaa660 
ttacatcatt taaaaagcct gattattccc aggcttctaa tctttcatat aaaactgcct720 
ttgttttgct cctttgttca actcagaggc ccagcaaagc gggcagggtc cctgatcagg780 
gcaggagccc acctcagaag cccatgccgc accagtgccc aagcacatgt cagtgctcag840 
aacaa 945 



(2) INFORMATION UBER SEQ ID NO: 43: 
(i) SEQUENZ CHARAKTERISTIK: 
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(A) LANGE: 2233 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: aus einzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(Hi) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 43 



gaattcagaa gttaatgatg ttgggtaaga 
tatcacctac tfctaatttta tatgagagta 
cactgctcct acccacgcag atttattcca 
cctcccagag tacttccaac tctgggttgg 
cggctggtgg tgccctgcag tcaacagcca 
atctctactc ttaagagact caggccaaga 
cccaatccaa atggcgtctg gaagtccaat 
ctactaattc cacacctttt attgacacag 
ttgaagaaca tgtgagaggt ttgactagat 
ttcatgtaca agatgaagga gaggcaacat 
tgtggcttga gaaatatgga cacttaatac 
tgggattgtg gaatggagat tcagttttca 
acaggtaata taaaaagctt ccatgattct 
gtgttactgt aattcctcaa cgtattgttt 
tagatgaagt tttacattgt tgagctattg 
tcctgaggct ttggatttga cattgcattt 
gggcaatgat gaatgagaat ctacccccag 
ccatattgag tcaaatggta ggcatttcct 
cattcattta gctaaacgga ttccaaagag 
aatgcttttt attattatta ttttttagac 
agtggtgcga tctcagatca gtgtaccatt 
tcagcctccc aagtagctgg gattacaggc 
attttagtag agacagggtt tcaccatgtt 
ggtgatccac ccgcctcggc ctcccaaagt 
cagccatcaa aatgcttttt atttctgcat 
tgatctgttt tgaaggcaaa attgcaaatc 
gtcaaaacta taaatcaagfc atttgggaag 
cacaaacttt tatactcttt ctgtatatac 
cagaatagcc acatttagaa cactttttgt 
ctggtcctaa gcctaaaagt gggcttgatt 
acacataaac ctttttaaaa atagacactc 
ctgatgctta gatgttccag taatctaata 
tttttttcca tctttagaaa actacatggg 
tgtgtgtgtg aatgaacact cttgctttat 
tatattgtgt ttgtgtattt acgctttgat 
attgaacaca aactgtaaat aaaaagaaat. 
gagagaggaa aaggggagga agaggagggg 
aggaggtggg ggg 



gaacaatggt aagagagcaa tctaagaata 60 
catggaggta gctgtgatgt ggaaatgtag 120 
gtgaaacaac aactggaact tcaagtaact 180 
ccccaaatcc aactaatgcc accaccaagg 240 
gtctcttcgt ggtctcactc tctcttctgc 300 
aacgtcttct aaatttcccc atcttctaaa 360 
gtggcaagga aaaacaggtc ttcatcgaat 420 
aaaatgttga gaatcccaaa tttgattgat 480 
gatggatgcc aatattaaat ctgctggagt 540 
ccaaaatagt taagacatga tttccttgaa 600 
taccttgaaa ataagaatag aaataaagga 660 
tttggttcat taattctata aggccataaa 720 
atttatatgt acatgagaag gaacttccag 780 
cgacagcact aatttaatgc cgatatactc 840 
ctgttctctt gggaactgaa ctcactttcc 900 
gaccttttat gtagtaattg acatgtgcca 960 
atccaagcat cctgagcaac tcttgattatl020 
atcacctgtt tccattcaac aagagcacta!080 
tagaattgca ttgaccgcga ctaatttcaa!140 
agtctcactt tgtcgcccag gccggagtgcl200 
tgcctcccgg gctcaagcga ttctcctgccl260 
acctgccacc atgcccggct aatttttgtal320 
gcccaggctg gtttcgaact cctgacctcal380 
gctgggatta caggcttgag cccccgcgccl440 
atgttgaata ctttttacaa tttaaaaaaalSOO 
ttgaaattaa gaaggcaaaa atgtaaaggal560 
tgaagactgg aagctaattt gcattaaattl620 
attttttttc tttaaaaaac aactatggatl680 
tatcagtcaa tatttttaga tagttagaacl740 
ctgcagtaaa tcttttacaa ctgcctcgacl800 
cccgaagtct tttgttcgca tggtcacacal860 
tggccacagt agtcttgatg accaaagtccl920 
aacaaacaga tcgaacagtt ttgaagctacl980 
tccagaatgc tgtacatcta ttttggattg2040 
tcatagtaac ttcttatgga attgatttgc2100 
ggctgaaaga gcaaaaaaaa aggagggcag2160 
ggaaagagaa gggagagaga aggaggggga2220 

2233 



93 



DE 198 18 619 A 1 



(2) INFORMATION OBER SEQ ID NO: 44: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 243 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 44 

ggagcccagc actagaagtc ggcggtgttt ccattcggtg atcagcactg aacacagagg 60 

actcaccatg gagtttgggc tgagctgggt tttcctcgtt gctcttttaa gaggtgtccal20 

gtgtcaggtg cactggtgga gcggggagcg ggtcagcagg agtcctgaat cctgtgacgcl80 

tgatcagtcc tatatcagat ggcgcagctc agcagggtga tggggtatga atgataacat240 

aca 243 



(2) INFORMATION UBER SEQ ID NO: 45: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 817 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 45 



gttttttttt 
aagggtggag 
gagatgggag 
caagggtggg 
ggagcgggga 
aacttggcag 
gtagctgact 
cgtgaaggga 
tatcaacatg 
acagcacagg 
cggaaagcgg 
cggcctcaac 
ttcgcgggag 
ctgtagctag 



tttttttttg 
agacggaaca 
aagggattca 
acgcgacctg 
aacgcgccgg 
aaagagaagc 
tgctcgtaaa 
acgtcctcaa 
atcactcccc 
agcgacgcca 
tgggagaaag 
catcccatcc 
acggcgqcgt 
ggcgtgagta 



aagagagcag 
gccccccagc 
gtctctcgcc 
ggtgacacgg 
ggccctagcg 
ctccgaggag 
ggttgtatat 
tcaacacagc 
aggcgctgag 
taaagaaggg 
cccaggatgc 

gggggcggca 

ctgaaaccaa_ 
ttggaagagc 



attctcttta 
ctcagccctc 
cgggaaaccc 
tgcagggagt 
caccatgtat 
gaggtaaagg 
gttctggggg 
ggaatggaca 
gacgatgccg 
agtcggggat 
cctcgcaggg 
ggcggaaaag 
aactgctcct 
gagggcc 



ttgagatacg 
tccacggggg 
agtcccacag 
ctttaaatag 
tccttgcgct 
cctgcagcga 
ccattctcaa 
ttgaaaaata 
caggcggcca 
cgccgaggtg 
gggcagaggg 
gctgggctcc 
ggggaaacct 



ggacacagcg 60 
ccggatgccal20 
agggcgccggl 8 0 
aggaggggct240 
tattgagccg300 
tgaaacagtt360 
aatctttctc420 
ttccgagcat 4 8 0 
gcttcggccc540 
caagcgggct600 
ggcgtggccc660 
tctcaggact720 
tccttgacct780 
817 



(2) INFORMATION UBER SEQ ID NO: 46: 



(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1644 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblieaing und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 



(Hi) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 46 



gttccggctc acatgggaaa tactttctga 
aacacctgtg gttcttcctc ctgctggtgg 
agctgcagga gtcgggccca ggactggtga 
ctgtctctgg tggctccate agcagtggtg 
cagggaaggg cctggagtgg attgggtaca 
cgtccctcaa gagtcgagtt accatatcag 
agctgagctc tgtgactgcc gcggacacgg 
cctacggtga ctcgagatac tactactacg 
caccgtctcc tcagcatccc cgaccagccc 
ccagccagat gggaacgtgg tcatcgcctg 
actcagtgtg acctggagcg aaagggacag 
caggatgcct ccggggacct gtacaccacg 
tgcctagccg gcaagtccgt gacatgccac 
gtgactgtgc cctgcccagt tccctcaact 
accccatctc cctcatgctg ccacccccga 



gagtcctgga cctcctgtgc aagaacatga 60 
cagctcccag atgggtcctg tcccaggtgc 120 
agccttcaca gaccctgtcc ctcacctgca 180 
gttactactg gagctggatc cgccagcacc 240 
tctattacag tgggagcacc tactacaacc 300 
tagacacgtc taagaaccag ttctccctga 360 
ccgtgtatta ctgtgcgaga gagcatctct 420 
gtatggacgt ctggggccaa gggacccggt 480 
caaggtcttc ccgctgagcc tctgcagcac 540 
cctggtccag ggcttcttcc cccaggagcc 600 
ggcgtgaccg ccagaaactt cccacccagc 660 
agcagccagc tgaccctgcc ggccacacag 720 
gtgaagcact acacgaatcc cagccaggat 780 
ccacctaccc catctccctc aactccacct 840 
ctgtcactgc accgaccggc cctcgaggac 900 
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ctgctcttag gttcagaagc gaacctcacg tgcacactqa cc^gcctgag acja"tgcctca 96Q 
ggtgtcacct tcacctggac gccctcaagt gggaaga^cg ctgctcaagg accacctgagl020 
cgtgacctct gtggctgcta cagctftgtcc agtgtcctgc cgggctgtgc cgagccatggl080 
aaccatggga agaccttcac ttgcactgct gcctaccccg agtccaagac cccgctaaccll40 
gccaccctct caaaatccgg aaacacattc cggcccgagg tccacctgct gccgccgccgl200 
tcggaggagc tggccctgaa cgagctggtg acgctgacgt gcctggcacg cggcttcagcl2 60 
cccaaggacg tgctggttcg ctggctgcag gggtcacagg agctgccccg cgagaagtacl320 
ctgacttggg catcccggca ggagcccagc cagggcacca ccaccttcgc tgtgaccagcl380 
atactgcgcg tggcagccga ggactggaag aagggggaca ccttctcctg catggtgggcl440 
cacgaggccc tgccgctggc cttcacacag aagaccatcg accgcttggc gggtaaaccclSQO 
acccatgtca atgtgtctgt tgtcatggcg gaggtggacg gcacctgcta ctgagccgccl560 
cgcctgtccc cacccctgaa taaactccat gctcccccaa gcaaaaaaaa aaaaaaaaaal620 
aaaaaaaaaa aaaaaaaaaa aaaa * 1644 



(2) INFORMATION OBER SEQ ID NO: 47: 



(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 1133 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(II) MOLEKULTYP: aus elnzelnen ESTs durch Assemblierung und Editierung 
hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 

(Hi) ANTI-SENSE: NEIN 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 47 



atttatctgg gacagacatc ttcagaatga 
aaaggttcag atattatcag attcagaaat 
tttaaacaga aaaagacaaa tgaatgggga 
ccatcacgcc tgcccttcct tgatattgca 
ttctttgtgg acattggccc agtctgtttc 
gaaagaaatt tgaaaaaact ttctctttgc 
ctgaatcctt ccatttcttc tgcacatcta 
gaaggattga tcagagcattf gtgcaataca 
aaaatttgaa tttttttttc aacactctta 
agaaaaccaa aataaaaatt gaaaaataaa 
ttgaatatdt tccacagagg gaagtttaaa 
aaaacacttt cccatgagtg tgatccacat 
tgaggtcctt gttttgtttt gttcataata 
cttgaagaat gttgatggtg ctagaagaat 
atcgtgtggt gtatttttta aaaaatttga 
aattaaaagt atgcagatta tttgcccaaa 
cagtctcatt ttcatcttct tccatggttc 
aaaattaaat tgtacctatt ttgtatatgt 
aaataaagca tgtttggttt tccaaaagaa 



cacatgccaa acagtggttc ttattaaatc 60 
agtgatgctt tgtgtatcta ttttcttctc 120 
aagacaatca ttgaatacaa aacaaataag 180 
cctttggaca tcggtggtgc tgaccaggaa 240 
aaataaatga actcaatcta aattaaaaaa 300 
catttcttct tcttcttttt taactgaaag 360 
cttgcttaaa ttgtgggcaa aagagaaaaa 420 
gtttcattaa ctccttcccc cgctccccca 480 
cacctgttat ggaaaatgtc aacctttgta 540 
aaccataaac atttgcacca cttgtggctt 600 
acccaaactt ccaaaggttt aaactacctc 660 
tgttaggtgc tgacctagac agagatgaac 720 
caaaggtgct aattaatagt atttcagata 780 
ttgagaagaa atactcctgt attgagttgt 840 
tttagcattc atattttcca tcttattccc 900 
tcttcttcag attcagcatt tgttctttgc 960 
cacagaagct ttgtttcttg ggcaagcagal020 
gagatgttta aataaattgt gaaaaaaatgl080 
aaaaaaaaaa aaaaaagtcg acc 1133 
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(2) INFORMATION UBER SEQ ID NO: 48: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 969 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assembllerung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 48 



gaggaggagg gtgtatctcc tttcgtcgga ccgccccttg gcttctgcac tgatggtggg 60 
tggatgagta atgcatccag gaagcctgga ggcctgtggt ttccgcaccc gctgccacccl20 
ccgcccctag cgtggacatt tatcctctag cgctcaggcc ctgccgccat cgccgcagatl80 
ccagcgccca gagagacacc agagaaccca ccatggcccc ctttgagccc ctggcttctg240 
gcatcctgtt gttgctgtgg ctgatagccc ccagcagggc ctgcacctgt gtcccacccc300 
acccacagac ggccttctgc aattccgacc tcgtcatcag ggccaagttc gtggggacac360 
cagaagtcaa ccagaccacc ttataccagc gttatgagat caagatgacc aagatgtata420 
aagggttcca agccttaggg gatgccgctg acatccggtt cgtctacacc cccgccatgg480 
agagtgtctg cggatacttc cacaggtccc acaaccgcag cgaggagttt ctcattgctg540 
gaaaactgca ggatggactc ttgcacatca ctacctgcag tttcgtggct ccctggaaca600 
gcctgagctt agctcagcgc cggggcttca ccaagaccta cactgttggc tgtgaggaat660 
gcacagtgtt tccctgttta tccatcccct gcaaactgca gagtggcact cattgcttgt720 
ggacggacca gctcctccaa ggctctgaaa agggcttcca gtcccgtcac cttgcctgcc780 
tgcctcggga gccagggctg tgcacctggc agtccctgcg gtcccagata gcctgaatcc840 
tgcccggagt ggaagctgaa gcctgcacag tgtccaccct gttcccactc ccatctttct900 
tccggacaat gaaataaaga gttaccaccc agcaaaaaaa aaaaaaaaaa acaagtcgtc960 
gcgtgctgt • g 69 



(2) INFORMATION DEER SEQ ID NO: 49: 

(i) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 617 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 
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(iii) HYPOTHETISCH: NEIN 

(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(Vii) SONSTIGE HERKUNFT: 
(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 49 



cctacaccta ccctcccttt gggtttctat tcggaccgcg atgatttgct ttggaaggct 60 
taaccccctt cttccccaaa cttgcccccg gagaaccccc agccttacga ccctcctcctl20 
gaagatgcaa aaccagcttg ccggccgcgc tctcttccag gacatcaaga agccagctgal80 
agatgagtgg ggtaaaaccc cagacgccat gaaagctgcc atggccctgg agaaaaagct240 
gaaccagggc cttttggatc ttcatgccct gggttctgcc cgcacggacc cccatctctg300 
tgacttcctg gagactcact tcctagatga ggaagtgaag cttatcaaga agatgggtga360 
ccacctgacc aacctccaca ggctgggtgg cccggaggct gggctgggcg agtatctctt420 
cgaaaggctc actctcaagc acgactaaga gccttctgag cccagcgact tctgaagggc480 
cccttgcaaa gtaatagggc ttctgcctaa gcctctccct ccagccaata ggcagctttc540 
ttaactatcc taacaagcct tggaccaaat ggaaataaag ctttttgatg caaaaaaaga600 
ggagggggga aaaaagc 617 

(2) INFORMATION UBER SEQ ID NO: 50: 

(I) SEQUENZ CHARAKTERISTIK: 

(A) LANGE: 704 Basenpaare 

(B) TYP: Nukleinsaure 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: aus einzelnen ESTs durch Assemblierung und Editierung 

hergestellte partielle cDNA 

(iii) HYPOTHETISCH: NEIN 
(iii) ANTI-SENSE: NEIN 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
(C) ORGAN: 

(vii) SONSTIGE HERKUNFT: 

(A) BIBLIOTHEK: cDNA library 

(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO: 50 

ggggagacto gtcaccaggc gtgcagtggg cactgctggg ctcccccatc ccgtcctaac 60 

ccggaacagc cccgggcagg aggcgtggaa agtcgagggg gtaaaccgcg aatgtgcgttl20 

gtgtaagcca cggcgcaggg tggggcgcgg gcgggacttg ggcgggcggg gtgggcttggl80 

ccgagctggc ctccggggca ccgaccgcta taaggccagt cggactgcga cacagcccat240 

cccctcgacc gctcgcgtcg catttggccg cctccctacc gctccaagcc cagccctcag300 

ccatggcatg ccccctggat caggccattg gcctcctcgt ggccatcttc cacaagtact360 
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ccggcaggga gggtgacaag cacaccctga gcaagaagga gct'gaaogag ctgatccaga42C 
aggagctcac cattggctcg aagctgcagg atgctgaaat tgcaaggctg atggaagact480 
tggaccggaa caaggaccag gaggtgaact tccaggagta tgtcaccttc ctgggggcct540 
tggctttgat ctacaatgaa gccctcaagg gctgaaaata aatagggaag atggagacac600 
cctctggggg tcctctctga gtcaaatcca gtggtgggta attgtacaat aaattttttt660 
tggtcaaatt taaaaaaaaa aaaaaaagag aaaaaagggt gage 704 

(2) INFORMATION UBER SEQ ID NO: 51: 

(A) LANGE: 95 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 51: 

PCSSQFHPVE NRSQEPIAGD SMSPRTLPVQ NMNNAMFLQK TLSLSFIGGN HQTTAECRTL60 
SRTTDLSPSH SPYHHKSHNK KEKRYFGFKK SKKIM 95 

(2) INFORMATION OBER SEQ ID NO: 52: 

(A) LANGE: 76 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 52: 

LPRDTWFKLK CLTDHSRHVL .pSYVNVSHLT HVHCLQTEHR LPLflWFENRN RAMPTDPSYV60 
WASKWNCTFI QIFTCL 76 

(2) INFORMATION UBER SEQ ID NO: 53: 

(A) LANGE: 90 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
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(Hi) HYP0THET1SCH: ja 



(vi) HERKUNFT: 

(A) ORGAN1SMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 53: 

RVNNCQEQLV VLKYNPQPRI PPVLQMDQLK QANTEDTKNE VRFIETRVTP LDELNTKMTL60 
TLSRYRSSET CLQNEIPEEF CSYPEIRGSN 90 

(2) INFORMATION UBER SEQ ID NO: 54: 

(A) U^NGE: 117 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 54: 

PLQPPRAMAP RGCIVAVFAI FCISRLLCSH GAPVAPMTPY LMLCQPHKRC GDKFYDPLQH 60 
CCYDDAWPL ARTQTCGNCT FRVCFEQCCP WTFMVKLINQ NCDSARTSDD RLCRSVS 117 

(2) INFORMATION UBER SEQ ID NO: 55: 

(A) LANGE: 103 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 55: 

RVPQPALGWC RVDVGHRGHQ EGSESLGPHQ HTHLMLSRIL EGDLWASSGQ RQGGPQTGHR 60 
MKWAVECVFL WPPNSHSASQ ISGNTSLFLQ AHPGRRIQES SFP ■ 103 

(2) INFORMATION UBER SEQ ID NO: 56: 
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(A) LANGE: 81 Aminosauren - 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 56: 

RCSFHTSGSW PRARRHHHSN SAAGGRRTCP HISCVAGTAS GKESWGPLGL RVSRGAWRCR60 
KWQRQLRCSL GEPWLWWAV E 81 

(2) INFORMATION OBER SEQ ID NO: 57: 

(A) LANGE: 125 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 57: 

RAARADSARA FPLPACKVW PQGPPPGHVG AAGQAFPSFE RGFRCRSRAS GLRSSLPSFR 60 
SWASPPPTH QSRCILGRAL GAMAPRGRKR KAEAAWAVA EKREKLANGG EGMEEAT WI 120 
EHCTS 125 

(2) INFORMATION UBER SEQ ID NO: . 58: 

(A) LANGE: 1 19 Aminosauren 

(B) TYP: Protein . 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(Ii) MOLEKULTYP: ORF 
(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 58:' 

QRSPPPFPPR RSPASLASRL RRPPRPQPYA SSRGEPWRLE PGRECSGTGG WGAETRPLSG 60 
NWATKSAARK LCSYSGNLSQ RKGKLGPQHP RGLEADLGAQ PLCKQGAGRL EPNRLERLE 119 

(2) INFORMATION OBER SEQ ID NO: 59: 

(A) LANGE: 128 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLQGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 59: 

TRQLVQCSIT TVASSIPSPP FASFSRFSAT ATTAASALRF LPRGAMAPRA RPRMQRDWWV 60 
GGGDATTERK LGNEERSPEA LLLQRKPLSK EGKAWPAAPT WPGGGPWGTT TLQAGSGKAR1 2 0 
AESARAAR 128 

(2) INFORMATION UBER SEQ ID NO: 60: 

(A) LANGE: 127 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 60: 

VTVMQFNFEL SFKYVLYSSY- SWLKLDHTIA DCMVFTWTPC RMLDYLYSSY AbJMLWAGEMK 60 
SSSHQDLLFK WLDNWATKEL ELHLLGFELF WHTLLHFGKS KSSASGALSI ENLPS FALECD 120 
VLFFIYT 127 

(2) INFORMATION UBER SEQ ID NO: 61: 

(A) LANGE: 111 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 
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(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 61 : 

SIGPICSQGL GPGGIPSPIT LIKNGCNCKN PCLIYLQLCS HLQMYLLMLS CQVPMQRWRG 60 
LPLCGWGLWV WKDRYQKNA FKCTNLLINI RCLLKKKKKK KBCRVGGVGCI G 111 

(2) INFORMATION UBER SEQ ID NO: 62: 



(A) LANGE: 68 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 62: 

YRSFTTTHKP HPHKGSPRHL CIGTWQLSIR RYICKWEHSC KY1RQGFLQL QPFLIKVIGE60 
GIPPGPRP 68 

(2) INFORMATION UBER SEQ ID NO: 63: 

(A) LANGE: 195 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 63: 

LVQPGGSCSG GRLLGVEFPS APRVRPFERS APAPATSLLG AMTTTTTFKG VDPNSRNSSR 60 
VLRPPGGGSN FSLGFDEPTE QPVRKNKMAS NIFGTPEENQ ASWAKSAGAK SSGGREDLES 120 
SGLQRRNSSE ASSGDFLDLK GEGDIHENVD TDLPGSLGQS EEKPVPAAPV PSPVAPAPVP1 8 0 
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SRRNPPGGKS SLVLG 

(2) INFORMATION UBER SEQ ID NO: 64: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 



(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 64: 

VSQSFPSNLL LENTHAMAHR PKSQGQRETC SSKEKKKRQQ YIKCFFLMKQ IQEMYSQAQV 60 
VQFTSMEETD RTTAFRTVRA NPRRGWTCRQ GDFFWMALGP GPPGWAQAQQ ARASLHSAPG120 
CLASLCPHFH EYHLLPSDLR SLRSLLQRSS FSAVQMTPSL PCHH 164 

(2) INFORMATION UBER SEQ ID NO: 65: 

(A) LANGE: 106 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 65: 

FQAVSLYIQA FLCVRAKALL ISQPVLLLSG YFLRLKNKRQ FLCFAGGKAG GAGLFIVHMS 60 
QEEALSKGHW QVRATPRRLC GETPCGLGPG RNGACGLFMV CPVEAW 106 

(2) INFORMATION UBER SEQ ID NO: 66: 

(A) LANGE: 349 Aminosauren 

(B) TYP: Protein 

(C) "STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xl) SEQUENZ-BESCHREIBUNG: SEQ ID NO 66: 

AALRS DAGMK RALGRRKGVW LRLRKILFCV LGLYIAIPFL IKLCPGIQAK LIFLNFVRVP 60 
YFIDLKKPQD QGLNRTCNYY LQPEEDVTIG VWHTVPAVWW KNAQGKDQMW YEDALASSHP120 
IILYLHGNAG TRGGDHRVEL YKVLSSLGYH WTFDYRGWG DSVGTPSERG MT YDALHVFD1 8 0 
WIKARSGDNP VYIWGHSLGT GVATNLVRRL CERETPPDAL ILESPFTNIR EEAKSHPFSV2 4 0 
IYRYFPGFDW FFLDPITSSG IKFANDENVK HISCPLLILH AEDDPWPFQ LGRKLYSIAA3 00 
PARSFRDFKV QFVPFHSDLG YRHKYIYKSP ELPRILREFL GKSEPEHQH 349 

(2) INFORMATION UBER SEQ ID NO: 67: 

(A) LANGE: 191 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einze! 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 67: 

SGLSRLGPGR NQHAGQDVLC EVAAALHQVL KELLGQGIDY EKILKLTADA KFESGDVKAT 60 
VAVLSFILSS AAKHSVDGES LSSELQQLGL PKERAASLCR CYEEKQSPLQ KHLRVCSLRM120 
NRLAGVGWRV DYTLSSSLLQ SVEEPMVHLR LEVAAAPGTP AQPVAMSLSA DKFQVLLAEL1 8 0 
KQAQTLMSSL G 191 

(2) INFORMATION UBER SEQ ID NO: 68: 

(A) LANGE: 164 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 68: 

FFFFFFFFFF FFFFSLLYFC LFFLLMKTAN NCLSREGKVM LGKVLRSPEP SSQERSEAAG 60 
DLGGQSPGQG LSILEPGLPP EEQFRGRDSI RAGRLHTGLE HPSPQPRELI RVWACFSSAR1 2 0 
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RTWNLSAERD MATGWAGVPG AAATSSRRCT MGSSTDC&RL: ELRV 164 

(2) INFORMATION UBER SEQ ID NO: 69: 

(A) LANGE: 155 Aminosauren 

(B) TYP: Protein 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 69: 

NQGCLPKSSS EGVTPYGQGG STQAWNTLLL SPGSSSGSGP ASVLPGGPGT CLLRGTWQQA 60 
GLGSLGQLPP PAAGAPWALP RIAAGWSSGC SPPASPHLPT YSCVGCRPAS ASARGFASPH120 
NSGTGWPRAL WAAPAAAVHW TRIRHRHCAW PHWRG 155 

(2) INFORMATION UBER SEQ ID NO: 70: 

(A) LANGE: 35 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
ii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 70: 

RRAAVTWVWL GVLCFESAVF TPTEWRTCR LLRFS 35 

(2) INFORMATION UBER SEQ ID NO: 71 : 

(A) LANGE: 32 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 71: 

KRLTQNTTPP TRPKSQLHVF KTSFKVSYFS TS 

(2) INFORMATION UBER SEQ ID NO: 72: 

(A) LANGE: 37 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 72: 

ENRSNLHVLT TSVGVKTADS KHNTPNQTQV TAARLQN 

(2) INFORMATION UBER SEQ ID NO: 73: 

(A) LANGE: 121 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 

j 

(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 73: 

LVKGMTVLEA VLEIQAITGS RLLSMVPGPA RPPGSCWDPT QCTRTWLLSH TPRRRWISGL 60 
PRASCRLGEE PPPLPYCDQA' YGEELSIRHR ETWAWLSRTD TAWPGAPGVK QARILGELLL12 0 
V 121 

(2) INFORMATION UBER SEQ ID NO: 74: 



(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 
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(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 74: 

QACPWASLAQ GQRTRLRRKL DTPVHGGLGL EGWLSGLEVP GGLPAGTRPS AAGWAVPCCC 60 
CPQGLAWAE DGTLSGWIRS PGSSSSRELR HKAGARLYTC RTQESLLQFL PEAPR 115 

(2) INFORMATION UBER SEQ ID NO: 75: 

(A) LANGE: 1 17 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 75: 

RWHLIRLDQV TRQQQLSRAE AQGRGPAVHL QDPGEPVAVL ARSAEIASSV SLQQEQNQLW 60 
PRWVGGSAFL AMAAATPRQE TAECLEGCNT RSNRQPPLIfL MSDGQALQHL DRHGGWS 117 

(2) INFORMATION OBER SEQ ID NO: 76: 

(A) LANGE: 66 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 76: 

PPQERRTIFV LYPRGSGREN MESGFYRLIG PIHKGHDWEK VWEQKENWDF RVQYAHPKLL60 
VAWGMS 66 

(2) INFORMATION UBER SEQ ID NO: 77: 
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(A) LANGE: 81 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 77: 

ALSTRAMTGK RYGSRRRIGI LGCSTLTLNF WWPGACLEAQ TVKQALLACL LVTTSAPAVL60 
RLHPAPGTPP APEPPLSPCD G 81 

(2) INFORMATION UBER SEQ ID NO: 78: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 78: 

TLLTIHIWTR DTVHEVHPSQ GDSGGSGAGG VPGAGWSLKT AGAEWTSKQ ASRACLTVWA 60 
SRHAPGHQKF RVSVLHPKIP ILLLLPYLFP VMALVDRAYQ SIES 104 

(2) INFORMATION UBER SEQ ID NO: .79: 

(A) LANGE: 104 Aminosauren 

(B) TYP: Protein . 

(C) STRANG: einzei 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
i) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 79: 

PSSPRAVRHS GHQDSKMASV VPVKDKKLLE VKLGELPSWI LMRDFSPSGI FGAFQRGYYR 60 
YYNKYINVKK GSISGITMVL ACYVLFSYSF SYKHLKHERL RKYH 104 

(2) INFORMATION UBER SEQ ID NO: 80: 

(A) LANGE: 82 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 80: 

RRGASRGLPW CWHATCSLAT PFPTSISSTS GSANTTEEDT LCTPPPHDLG PSPSVRNTIS60 
IVAESFHILI GINLQIKHDW YV 82 

(2) INFORMATION UBER SEQ ID NO: 81: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 81: 

KDSATIEIVF LTEGLGPRSW GGGVQSVSSS WFAEPLVLE MLVGKGVAKE HVACQHHGNP 60 
RDAPLLHIDV LVWPWTSn KRSENATRTE VPHQDPAWQL PQFDLQKFLV LHWYN 115 

(2) INFORMATION UBER SEQ ID NO: 82: 

(A) LANGE: 187 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
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b) eine allelische Variation der unter a) genannten Nukleinsaure-Sequenzen 
oder 

c) eine Nukleinsaure-Sequenz, die komplementar zu den unter a) oder b) genannten Nukleinsaure-Sequenzen 
ist. 

2. Eine Nukleinsaure-Sequenz gemaB einer der Sequenzen Seq. ID Nos. 1-50, oder eine komplementare oder alle- 5 
lische Variante da von. 

3. Nukleinsaure-Sequenz Seq. ID No. 1 bis Seq. ID No. 50, dadurch gekennzeichnet, daB sie in Blasentumorge- 
webe erhoht exprimiert sind. 

4. BAG, PAC und Cosmid-Klone, enthaltend funktionelle Gene und ihre chromosomale Lokalisation, entsprechend 
den Sequenzen Seq. ID. No. 1 bis Seq. ID No. 50, zur Verwendung als Vehikel zum Gentransfer. 10 

5. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 4, dadurch gekennzeichnet, daB sie eine 90%ige Ho- 
mologie zu einer hurnanen Nukleinsaure-Sequenz aufweist. 

6. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 4, dadurch gekennzeichnet, daB sie eine 95%ige Ho- 
mologie zu einer hurnanen Nukleinsaure-Sequenz aufweist. 

7. Eine Nukleinsaure-Sequenz, umfassend einen Teil der in den Anspriichen 1 bis 6 genannten Nukleinsaure-Se- 15 
quenzen, in solch einer ausreichenden GroBe, daB sie mil den Sequenzen gemaB den Anspriichen 1 bis 6 h ybridisie- 
ren. 

8. Ein Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnet, daB die GroBe des Frag- 
ments eine Lange von mindestens 50 bis 4500 bp aufweist. 

9. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnet, daB die GroBe des Frag- 20 
ments eine Lange von mindestens 50 bis 4000 bp aufweist. 

10. Eine Nukleinsaure-Sequenz gemaB einern der Anspriiche 1 bis 9, die mindestens eine Teilsequenz eines biolo- 
gisch aktiven Polypeptids kodiert. 

1 1 . Eine Expressionskassette, umfassend ein Nukleinsaure-Fragment oder eine Sequenz gemaB einem der Ansprii- 
che 1 bis 9, zusammen mit mindestens einer Kontroll- oder regulatorischen Sequenz. 25 

12. Eine Expressionskassette, umfassend ein Nukleinsaure-Fragment oder eine Sequenz gemaB Anspruch 11, wo- 
rin die Kontroll- oder regulatorische Sequenz ein geeigneter Promotor ist. 

13. Eine Expressionskassette gemaB einem der Anspriiche 11 und 12, dadurch gekennzeichnet, daB die auf der Kas- 
sette befindlichen DNA-Sequenzen ein 

Fusionsprotein kodieren, das ein bekanntes Protein und ein biologisch aktives Polypeptid-Fragment umfaBt. 30 

14. Verwendung der Nukleinsaure-Sequenzen gemaB den Anspriichen 1 bis 10 zur Herstellung von \follangen-Ge- 
nen. 

15. Ein DNA-Fragmcnt, umfassend ein Gen, das aus der Verwendung gemaB Anspruch 14 erhaltlich ist. 

16. Wirtszelle, enthaltend als heterologen Teil ihrer exprimierbaren genetischen Information ein Nukleinsaure- 
Fragment gemaB einem der Anspriiche 1 bis 10. 35 

17. Wirtszelle gemaB Anspruch 16, dadurch gekennzeichnet, daB es ein prokaryontisches oder eukaryontischc Zell- 
system ist. 

18. Wirtszelle gemaB einem der Anspriiche 16 oder 17, dadurch gekennzeichnet, daB das prokaryontische Zellsy- 
stem E. coli und das eukaryontische Zellsystem ein tierisches, humanes oder Hefe-Zellsystem ist 

19. Ein Verfahren zur Herstellung eines Polypeptids oder eines Fragments, dadurch gekennzeichnet, daB die Wirts- 40 
zellen gemaB den Anspriichen 16 bis 18 kultiviert werden. 

20. Ein Antikorper, der gegen ein Polypeptid oder ein Fragment gerichtet ist, welches von den Nukleinsauren der 
Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 kodiert wird, das gemaB Anspruch 19 erhaltlich ist. 

21. Ein Antikorper gemaB Anspruch 20, dadurch gekennzeichnet, daB er monoklonal ist. 

22. Ein Antikorper gemaB Anspruch 20 dadurch gekennzeichnet, daB er ein Phage-Di splay- Antikorper ist. 45 

23. Polypeptid-Teilsequenzen, gemaB den Sequenzen Seq. ID Nos. ORF 51-106. 

24. Polypeptid-Teilsequenzen gemaB Anspruch 22, mit mindestens 80%iger Homologie zu diesen Sequenzen. 

25. Ein aus einem Phage-Display hervorgegangenen Polypeptid, welches an die Polypeptid-Teilsequenzen gemaB 
Anspruch 24 binden kann. 

26. Polypeptid-Teilsequenzen gemaB Anspruch 22, mit mindestens 90%iger Homologie zu diesen Sequenzen. 50 

27. Verwendung der Polypeptid-Teilsequenzen gemaB den Sequenzen Seq. ID No. 51-106, als Tools zum Auffin- 
den von WirkstofFen gegen den Blasentumor. 

28. Verwendung der Nukleinsaure-Sequenzen gemaB den Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 zur Expres- 
sion von Polypeptide^ die als Tools zum Auffinden von Wirkstoffen gegen den Blasentumor verwendet werden 
konnen. 55 

29. Verwendung der Nukleinsaure-Sequenzen Seq. ID No. 1 bis Seq. ID No. 50 in sense oder antisense Form. 

30. Verwendung der Polypeptid-Teilsequenzen Seq. ID No. 51-106 als Arzneimittel in der Gentherapie zur Be- 
handlung des Blasentumors. 

31. Verwendung der Polypeptid-Teilsequenzen Seq. ID No. 51-106, zur Herstellung eines Arzneimittels zur Be- 
handlung gegen den Blasentumor. 60 

32. ArzneimitteL, enthaltend mindestens eine Polypeptid-Teilsequenz Seq. ID No. 51-106. 

33. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 10, dadurch gekennzeichnet, daB es eine genomische 
Sequenz ist. 

34. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 10, dadurch gekennzeichnet, daB es eine mRNA-Se- 
quenz ist. 65 

35. Genomische Gene, ihre Promotoren, Enhancer, Silencer, Exonstruktur, Intronstruktur und deren SpleiBvarian- 
ten, erhaltlich aus den cDNAs der Sequenzen Seq. ID No. 1 bis Seq. ID No. 50. 

36. Verwendung der genomischen Gene gemaB Anspruch 33, zusammen mit geeigneten regulativen Elementen. 
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37. Verwendung gemaB Anspruch 34, dadurch gekennzeichnet, daB das regulative Element ein geeigneter Promo- 
tor und/oder Enhancer ist. 

38. Eine Nukleinsaure-Sequenz gemaB den Anspriichen 1 bis 7, dadurch gekennzeichnet, daB die GroBe des Frag- 
ments eine Lange von mindestens 300 bis 3500 bp aufweist 

Hierzu lOSeite(n) Zeichnungen 
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Partielle \ 
cDNA > 
Sequenz z.B. 
ESTo.Contig 

. s 7 





GCCTCAAGTTATC 




WHILE G > Q , 






r 



Elektronischer Northern-Blot 



Fishers Exakter Test 



IF Ho 



► EXIT 



Automatische Verlangerung 




ATGTCCTA GCCTCAAGTTATC AGATGCAA. 
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HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 82: 

ARAARGARRT SRAVTPTCAT PAGPMPCSRL PPSLRCSLHS ACCSGDPASY RLWGAPLQPT 60 
LGWPQASVP LLTDIAQWEP VLVPEAHPNA SLTMYVCTPV PHPDPPMALS RTPTRQISSS120 
DTDPPADGPS NPLCCCFHGP AFSTLNPVLR HLFPQEAFPA HPIYDLSQVW SWS PAPSRG 180 
QALRRAQ 187 

(2) INFORMATION UBER SEQ ID NO: 83: 

(A) LANGE: 241 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 83: 

FFFFGLSNRC LLRAYAVLRL PFREPHECEA WPLPPGLQAP SLETPRNSRR LLSSSSTQST 60 
SSQPLLGPPE CLSPAGCGGH HGPDLAQVID GVGREGFLGE EVPEHRVKGG EC WAMETAAE 120 
RVGGAICRRI CVTRADLPGG SPGEGHGRVR VGHRGADIHG QTCVRMCLRN QDRLPLGQVC180 
EEWHRGLGHH TQCGLQRGPP EPIAGRVPRA AGRVQGAAQG WRQPAAGHGP RWRCTSRCHS2 4 0 
T 241 

(2) INFORMATION UBER SEQ ID NO: 84: 

(A) LANGE: 113 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 84: 



in 
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MGWAGKASWG KRCLSTGLRV ENAGPWKQQQ RGLEGPSACG 3VSLELICRV GVLERAMGGS 60 
GWGTGVQTYM VRLALGCASG TRTGSHWARS VRSGTEAWGT" TPSVGOSGAP QSL 113 

(2) INFORMATION OBER SEQ ID NO: 85: 

(A) L^NGE: 107 Aminosauren 

(B) TYP: Protein 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 85: 

AFLSFLFSER FKASTTLFPP SLLNLICTKS FALVGWETA LSLSTSVREC EPPWQVPVQG 60 
PAALHLGRVT GAPAVCPKAS PWPFGLSLGR FRTEHQGRQA FQGISIN 107 

(2) INFORMATION UBER SEQ ID NO: 86: 

(A) LANGE: 107 Aminosauren 
(B.)TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 86: 

LRNRLWKVKL EEPDLISPTS KTPSEQQRPQ HPPRTGDSIF MATPCGGRLT TSHHIIPELS 60 
SSSGMTPSPP PPSSSFSSFC LFVSELSCLS FFLRDSKPPR LCFPRPF 107 

(2) INFORMATION UBER SEQ ID NO: 87: 

(A) LANGE: 115 Aminosauren 

(B) TYP: Protein 

(C) "STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 87: 

IQKGRGKQSR GGFESLRKKE RQESSETKRQ KDEKEEEGGG GDGVIPLEEL SSGMMWWLW 60 
NRPPQGVAMK MESPVRGGCC GRCCSDGVFD VGLMRSGSSS FTFQSRFLSQ WQWL 115 

(2) INFORMATION UBER SEQ ID NO: 88: 

(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 



(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 88: 

CSVRNLPRLR PKGQGEAFGH TAGAPVTRPR WRAAGPCTGT CQGGSHSRTL VLRLKAVSTT 60 
PTSANDLVQI KFRRDGGNRV VEALNLSEKR KDRKAQKQRD RRMKKKKREV VGTASSRWRS120 
SALG 124 

(2) INFORMATION UBER SEQ ID NO: 89: 

(A) LANGE: 198 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 



(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 

(vi) HERKUNFT: . 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 89: 

EGAGGEWRCP AAGGPRGEDG PPGLRLTERA GLTTRHLTGT ADPSQKHNLR DPSSSKDERI 60 
QQTQPNREGA CSLNLHGLRY LCKPGVAPVL RVILRSCLFP NGFTSGSCRF SLGLSLILKW120 
VAGGRGWLRP LLALRLQSWE QDTSPEFHFF SCPNHAHTIV QNQSTFEKWL HGHPPGPRKL1 8 0 
HSKGLFTWQQ NPSPAVSP 198 

(2) INFORMATION UBER SEQ ID NO: 90: 
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(A) LANGE: 124 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 90: 

LPPVEPPVLK GSCRKDMHRN EERERSENEV WRARPGPTAQ GSSPPPDAPF HPPPQCLLSP 60 
KAPASEVLGA HPPQCGQGGK GQVLDTAKCP EMTLLLTHFF GPWQSPTCPQ HGAPGRTGRQ120 
EGGW 124 

(2) INFORMATION LIBER SEQ ID NO: 91: 

(A) LANGE: 147 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 91: 

NSGEVSCSQL CSLRASRGRS HPLPPATHFK MRLRPRLKRQ LPEVNPFGKR HERRMTLRTG 60 
ATPGLHKYRR PWRLRLQAPS LLGCVCCILS SLELLGSLRL CFWDGSAVPV RCLWRPALS120 
VSLSPGGPSS PLGPPAAGHL HSPPAPS 147 

(2) INFORMATION UBER SEQ ID NO: 92: 



(A) LANGE: 374 Aminosauren 

(B) TYP: Protein ' 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(II) MOLEKULTYP: ORF 
(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 92: 

SREAPESRRW AVWRSLESLP RHQLLCLPVG APPAPAMLSA LARPASAALR RSFSTSAQNN 60 
AKVAVLGASG GIGQPLSLLL KNSPLVSRLT LYDIAHTPGV AADLSHIETK AAVKG YLGPE1 2 0 
QLPDCLKGCD VWIPAGVPR KPGMTRDDLF NTNATIVATL TAACAQHCPE AMICVIANPV1 8 0 
NSTIPITAEV FKKHGVYNPN KIFGVTTLDI VRANTFVAEL KGLDPARVNV PVIGGHAGKT2 4 0 
IIPLISQCTP KVDFPQDQLT ALTGRIQEAG TEWKAKAGA GSATLSMAYA GARFVFSLVD300 
AMNGKEGWE CSFVKSQETE CTYFSTPLLL GKKGIEKNLG IGBCVSSFEEK MISDAIPELK360 
ASIKKGEDFV KTLK 374 

(2) INFORMATION OBER SEQ ID NO: 93: 

(A) LANGE: 238 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 93: 

LNEGTFHNTF LSIHCIHKGE DKAGAGIRHG EGGRACSGFS LDHLRAGLLD PPSECCQLVL 60 
GKVHLGGALR DQGDDGLPSM ATNDRDVDSS WIQTLQLCNK GVGSDDVQGR HAEDFVGWH120 
SMLLENFCCD GDGGINRIGN DADHGFRAVL GTGSGQGGHN RGIGVEQWP GHAWLSGDSS180 
RNNYHITTFQ AVRQLFRSEV AFHSGFGLDV AQICGHSGCV RDIIEGQAAH QGAVLQEK 238 

(2) INFORMATION UBER SEQ ID NO: 94: 

(A) LANGE: 242 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 94: 

EAGEEKAEEE GVAEEEGVNK FSYPPSHREC CPAVEEEDDE EAVKKEAHRT STSALSPGSK 60 
PSTWVSCPGE EENQATEDKR TERSKGARKT SVSPRSSGSD PRSWEYRSGE ASEEKEEKAH12 0 
KETGKGEAAP GPQSSAPAQR PQLKSWWCQP SDEEEGEVKA LGAAEKDGEA ECPPCIPPPS180 
AFLKAWVYWP GEDTEEEEDE EEDEDSDSGS DEEEGEAEAS S STPATGVFL KSWVYQPGED2 4 0 
TQ 242 
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(2) INFORMATION UBER SEQ ID NO: 95: 

(A) LANGE: 237 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 

20 : 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 95: 

RPGCIGQERT LDSEDKEDDS EAALGEAESD PHPSHPDQRA HFRGWGYRPG KETEEEEAAE 60 
25 DWGEAEPCPF RVAIYVPGEK PPPPWAPPRL PLRLQRRLKR PETPTHDPDP ETPLKARKVR120 
FSEKVTVHFL AVWAGPAQAA RQGPWEQLAR DRSRFARRIT QAQEELSPCL T PAARARAWA1 8 0 
RLRNPPLAPI PALTQTLPSS SVPSSPVQTT PLSQAVATPS RSSAAAAAAL DLSGRRG 237 

(2) INFORMATION UBER SEQ ID NO: 96: 

(A) LANGE: 890 Aminosauren 

(B) TYP: Protein 

35 (C) STRANG: einzel 

(D)TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

40 

(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



55 



so (xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 96: 

QDEHLITFFV PVFEPLPPQY FIRWSDRWL SCETQLPVSF RHLILPEKYP PPTELLDLQP 60 
LPVSALRNSA FESLYQDKFP FFNPIQTQVF NTVYNSDDNV FVGAPTGSGK TICAEFAILR120 
MLLQSSEGRC VYITPMEALA EQVYMDWYEK FQDRLNKKW LLTGETSTDL KLLGKGN 1 1 1 1 8 0 
STPEKWDILS RRWKQRKNVQ NINLFWDEV HLIGGENGPV LEVICSRMRY ISSQIERPIR240 
IVALSSSLSN AKDVAHPJLGC SATSTFNFHP NVRPVPLELH IQGFNISHTQ TRLLSMAKPV3 0 0 
YHAITKHSPK KPVIVFVPSR KQTRLTAIDI LTTCAADIQR QRFLHCTEKD LIPYLEKLSD360 
STLKETLLNG VGYLHEGLSP MERRLVEQLF SSGAIQVWA SRSLCWGMNV AAHLVIIMDT420 
QYYNGKIHAY VDYPIYDVLQ MVGHANRPLQ DDEGRCVIMC QGSKKDFFKK FLYEPLP VES 480 
60 HLDHCMHDHF NAEIVTKTIE NKQDAVDYLT WTFLYRRMTQ NPNYYNLQGI SHRHLSDHLS5 4 0 
ELVEQTLSDL EQSKCISIED EMDVAPLNLG MIAAYYYINY TTIELFSMSL KAKTKVRGLI 600 
EIISNAAEYE NIPIRHHEDN LLRQLAQKVP HPCLNNPKFND PHVKTNLLLQ AHLSRMQLS A6 60 
ELQSDTEEIL SKAIRLIQAC VDVLSSNGWL SPALAAMELA QMVTQAMWSK DSYLKQLPHF720 
TSEHIKRCTD KGVESVFDIM EMEDEERNAL LQLTDSQIAD VARFCNRYPN IELSYEWDK7 8 0 
65 DSIRSGGPW VLVQLEREEE VTGPVIAPLF PQKREEGWWV VIGDAKSNSL ISIKRLTLQQ840 
KAKVJCLDFVA PATGAHNYTL YFMSDAYMGC DQEYKFSVDV KEAETDSDSD 890 

(2) INFORMATION UBER SEQ ID NO: 97: 
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(A) LANGE: 281 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKClLTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 97: 

GDGSAEHGPR PLAAPLVTSR GAPASARPRG ALPGGSAPSA PHGQLPGRAQ PAPVSGPPPT 60 
SGLCHFDPAA PWPLWPGPWQ LPPHPQDWPA QPDIPQDWVS FLRSFGQLTL CPRNGTVTGK120 
WRGSHWGLL TTLNFGDGPD RNKTRTFQAT VLGSQMGLKG SSAGQLVLIT ARVTTERTAG180 
TCLYFSAVPG ILPSSQPPIS CSEEGAGNAT LSPRMGEECV SVWSHEGLVL TKLLTSEELA24 0 
LCGSRLLVLG SFLLLFCGLL CCVTAMCFHP RRESHWSRTR L 281 

(2) INFORMATION UBER SEQ ID NO: 98: 

(A) LANGE: 206 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 98: 

RLEPRSVTRS RRAVSRLSAR PGKVSAVMAF LASGPYLTHQ QKVLRLYKRA LRHLESWCVQ 60 
RDKYRYFACL MRARFEEHKN EKDMAKATQL LKEAEEEFWY RQHPQPYIFP DS PGGTS YER12 0 
YDCYKVPEWC LDDWHPSEKA MYPDYFAKRE QWKKLRRESW EREVKQLQEE T PPGGPLTEA1 8 0 
LPPARKEGDL PPLWWYIVTR PRERPM 206 

(2) INFORMATION UBER SEQ ID NO: 99: 

(A) LANGE: 139 Aminosauren 

(B) -TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 
(Hi) HYPOTHETISCH: ja 
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(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 99: 

PLVPSFPSAV SSTVLSWQSN QDTLPSQKDA SHLSTILGPC SNRISHRRCP QESQGRCMAV 60 
DADGTRILPR PPSAAGWPSP YPFHSYVLQT GLSSNKQSIG ICLSGRTTTR GGVAPAYKAA120 
TPFADGSGRV PTPRTPLRR 139 

(2) INFORMATION UBER SEQ ID NO: 100: 

(A) LANGE: 79 Aminosauren 

(B) TYP: Protein 

(C) STRANG: elnzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 100: 

APFWDLVAIV SLIGGAPRRV REDVWLWMLT VPEFFLGLLQ QLGGLRHILF ILMFFKPGSH60 
QTSKVSVFVS LDAPRLEVA 79 

(2) INFORMATION UBER SEQ ID NO: 101: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 101: 

VGGACAVALP QAAAMAGQED PVQREIHQDW ANREYIEIIT SSIKKIADFL NSFDMSCRSR60 
LATLNEKLTA LERRIEYIEA RVTKGETLT "89 

(2) INFORMATION 0BER SEQ ID NO: 102: 
(A) LANGE: 88 Aminosauren 
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(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 102: 

NSAVLLLGSC FTQHRPHGKG PSSLQLLPFS LKSNRAYSCF SFFKSVAFGL CHLGCGWCG 60 
KKFRGTVGND VRHFTFSVTF YTSTCQCI 88 

(2) INFORMATION UBER SEQ ID NO: 103: 

(A) LANGE: 89 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKOLTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 103: 

HFIHLLVNVF ETFTAKSLGL FVKVLLTSIF LSLSLKLSLK FSLPLHCFCE QSLSPPHLWW60 
EVRGSPGQDT HALAMWLPEN WANPPVSC 89 

(2) INFORMATION OBER SEQ ID NO: 104: 

(A) LANGE: 240 Aminosauren . 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 
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(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 104: 

REQILFIEIR DTAKGGETEQ PPSLSPLHGG RMPEMGEGIQ SLARETQSHR GRRQGWDATW 60 
VTRCRESLNR GGAGAGKRAG ALAHHVFLAL IEPNLAEREA SEEEVKACSD ETWADLLVK1 2 0 
WYVLGAILK IFLREGNVLN QHSGMDIEKY SEHYQHDHSP GAEDDAAGGQ LRPTAQERRH 180 
KEGSRGSPRC KRARKAVGES PGCPRRGAEG AWPRPQPSHP GAAGGKGWAP LRT FAGDGAV2 4 0 

(2) INFORMATION UBER SEQ ID NO: 105: 

(A) LANGE: 136 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vl) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 105: 

RLYMFWGPFS KSFSVKGTSS INTAEWTLKN IPSIINMITP QALRTMPQAA SFGPQHRSDA 60 
IKKGVGDRRG ASGLGKRWEK AQDALAGGQR GRGPGLNHPI RGRQAEKAGL LSGLSRETAP120 
SETKTAPGET FLDLCS 136 

(2) INFORMATION UBER SEQ ID NO: 106: 

(A) LANGE: 173 Aminosauren 

(B) TYP: Protein 

(C) STRANG: einzel 

(D) TOPOLOGIE: linear 

(ii) MOLEKULTYP: ORF 

(iii) HYPOTHETISCH: ja 



(vi) HERKUNFT: 

(A) ORGANISMUS: MENSCH 



(xi) SEQUENZ-BESCHREIBUNG: SEQ ID NO 106: 

LQRSRKVSPG AVLVSDGAV& RESPERSPAF SACRPRMGWL RPGPRPLCPP ARASWAFSHR 60 
FPSPLAPRRS PTPFFMASLL CCGPKLAACG IVLSAWGVIM LIMLGIFFNV HSAVLIEDVP120 
FTEKDFENGP QNIYNLYEQV SYNCFIAAGL YLLLGGFSFC QVRLNKRKEY MVR 173 



Patentanspriiche 

1. Eine Nukleinsaure-Sequenz, die ein Genpnxinkt oder ein Teil davon kodiert, nmfassend 

a) eine Nukleinsaure-Sequenz, ausgewahlt aus der Gruppe Seq. ID No. 2-5, 7-13, 16, 18, 20, 23, 26-27, 
31-32, 36, 45. 
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DECLARATION OF AUDREY D, GODDARD, Ph.D UNDER 37 CF.R. § 1,132 



Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

I, Audrey D. Goddard, PLD. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3 . My scientific Curriculum Vitae, including my list of publications, is attached to and 
' forms part of this Declaration (Exhibit A). 
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4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application, 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et al, PCR 
Methods Appl., 4:357-362 (1995) (Exhibit C) and Heid et al, Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers! The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative . 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et al, Proc. 
Natl. Acad. Sci. USA 95(25).i4717-14722 (1998) (Exhibit E); Pitti et al, Nature 
396(67 12);699-703 (1998) (Exhibit F) and Bieche et al, Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et al have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et al studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et al used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i:e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a rumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereoa 



Date 



Audrey D. Goddard, Ph.D. 



AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 
1 DNA Way 

South San Francisco, CA, 94080 

650.225.6429 

goddarda@gene.com 



110 Congo St. 

San Francisco, CA f 94131 

415.841.9154 

415.819.2247 (mobile) 

agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Genentech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1998 - 2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: ' . ■ 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs t and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 
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1993-1998 



Scientist 



Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89-12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocyte leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 

"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics. 



1989 



Honours B.Sc 

"The in vitro metabolism of the cytochrome P-448 
inducer p-naphthoflavone in C57BL/6J mice." 
Supervisor: Dr. G. D. Sweeney 



McMaster University, 
Hamilton, Ontario, Canada. 
Department of Biochemistry 



1983 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellowship 1989-1992 

Medical Research Council Studentship 1983-1988 

NSERC Undergraduate Summer Research Award 1983 

Society of Chemical Industry Merit Award (Hons. Biochem.) 1983 

Dr. Harry Lyman Hooker Scholarship 1981-1983 

J.L.W. Gill Scholarship 1981-1982 

Business and Professional Women's Club Scholarship 1 980-1 981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel, genes. Functional Genomics: From Genome to Function Litchfield 
Park, A2, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA February 
2000 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA. September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society/Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 
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PATENTS 

Goddard A, Godowski PJ, Gurney AL NL2 Tie ligand homoiogue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homoiogue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N t Tumas D, 
Schwall R. NL4 Tie ligand homoiogue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homoiogue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19,2002. 

Goddard A, Godowski PJ and Gurney AL: Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL. FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD f Grimaldi JC f Lee J, Dowd P, Colman 
S\, Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J. Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD: Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- .. 
1 7 receptor homolog IL-1 7Rh1 . Journal of Biological Chemistry 276(2): 1 660-1 664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2 T 4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watana.be CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal . 
development. Nature 408: 366-369. 

Yan M ( Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino. acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell.ScL 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 
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Yan M, Lee J t Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood W I, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and. colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. Set. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficienUroute to human bispecific IgG. Nature Biotechnology 16(7): 677-681 . 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821.. 
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Goddard AD, Dowd P, Chernausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL f Moffat B, Vandlen R t Simmons L, Gu Q, 
Hongo JA, Devaux B, Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature, 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL, Scott MP, Pennica D> 
Goddard A, Phillips H, Noll M, Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD f Bauer KD and . 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(12)' 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. Sci. USA 93: 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D, Henzel W and Bass S. (1 996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp. 187-21 5. 

Treanor JJS, Goodman L t de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F f Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-7113. 

Winslow JW, Moran P t Valverde J t Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, ZeiglerFC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. ScL USA 92: 1 866-1 870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6: 732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC T Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS P Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 1 0720-1 0728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T f Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocyte leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl 3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet. Cell Genet 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acting as the p-subunit of prolyI-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet Cell. Genet. 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
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We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (4s) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 

Although the potential benefits of PGR 1 10 clin- 
ical diagnostics arc wetl known*- 5 , it is still not 
widely used in this setting, even though U is 
fenxr years ciuco thermostable DNA poSym^i-* 
ase$* made PCR practical. Some of the reasons for its $low. 
Acceptance are high cost, feck of automation of pre-t and 
post-PCR processing steps, and false positive results, from 
canyaveT-contaminatioD, The first two points arc related 
in that labor is the largest contributor to cost ait the present 
stage of PCR development. Most current assays require 
some form of "downstream" processing once tbermocy* 
ding ts done in order to determine whether the target 
DNA sequence was- present and has amplified. These 
include DNA hybrtdwoon*'* gel electrophoresis with or 
without use of restriction digestion*;*, H?tX?, or capillary 
electrophoresis 10 . These methods are labor-intense, have 
low throughput, and arc difficult to automate. The third 
point is aLo-cU»cry related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that aropiified DNA . wilt 
spread through the typing' lab, resulting in * a risk of 



"carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays in whkh such different processes take 
place without. the need to separate reaction components 
have been termed \T^mogeneous'\ No truly hbmogc-. 
rieous PCR assay has been demonstrated to date, akhough 
progress towards this end has been reported.- Chehab, et 
al. ! % developed a PCR product detection schentc using 
fluorescent primers that resulted in a fluorescent PGR 
product, AHcte-specific primers, cad* with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers rnust still be 
removed in a do wnstream process in order to visualize the 
result Recently, Holland, et al> l? , developed an assay in 
whkh the endogenous 5' exonudease assay of Tof DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only deave if PCR axnp£fV 
cation had produced its complettientary sequence. In 
order to detect the dcavage products, however, a subse- 
quent process is again needed. 

We have developed a truly hotnogeneouA assay for PCR 
and PCR. product detection based upon tbe grearJy in- 
creased fluorescence that ethidfcun btoinMe and other 
DNA binding dyes exhibit When they ate bound io.ds- 
DNA l *^ lq . As outlined in Figure 1; a proiocypk PCR 
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HtWU 1 $*rindptc of simultaneous ampGficaticn and - detection Of 
PCR product: The cOtfipoucntfc of a PCR COotaiinhitf EcSr rhat arb 
0uoreso2i« »re listed— EtBr itself, EtBr bound to either cDNA or 
daDN A There is 4 lar^e fijwrc^cccwx cnhaaGuncnt when 'EtBr b 
bound to DNA and bmdixig u greatly enhanced when DNA is 
doublc-strawlcd. After sunidc^t <n)..cvdcs of PCR» the .net 
iacrcafte m dupNA re^tdts in additional -£t&r binding and & net 
increase in total fluorescence: . 
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BGSIBS 2 Gd electrophoresis of PCS. amplification products of the 
human, midcar gene, HLA DQti, made in the presence of 
increasing amounts of EtBr (up to 9 H-g/ml). The presence of 
£tBr lias no obviouj effect on tltc yield or specificity of amplifi- 
cation. 




B. 




IHMREffi: & (A) fluorescence measurement* from PCRs that contain 
0.5 pgfai! EtBr 2nd that are specific for Y^rotnosonie repeat 
seq<tetice*. Five replicate PCRs tvert begun containing each of the 
DNA* specified. At 'each radicated cyde, one of the five replicate 
PCRs for each DNA -was removed from thcrmocydmg and H* 
fluorescence measured. Unit* of fluorescence are arbitranr. (ft) 
UV photography of PGR tube* (0.5 ml Eppcndorf-*tylc, polypro- 
pylene m»av~<entrifuffc tubes) conUHling reactions, those xta.tt* 
ing from 2 ng male DNA and control reactions without any DNA, 
from (A), 



begins with primers that are ungle-straoded DNA (as- 
DNA), dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present* This amount tan vary, depending 
or the application, front single-cell amounts of DNA 17 to 
micrograms per PCR- 8 . If EtBr is present the reagents 
that will fluoresce, in order of incrtasuig' fluorescence, are 
free EtBr "itself* and EtBr bound to the singk^Lraxidcd 
DNA primers ami to the doublc*stnmded target DNA (by^ 
its tnterealauon between the stacked bases of the DNA 
dooblc-hc&t). After the first denaturation cyde, target 
DNA will be largely single-stranded. After a PCS is 
completed, the most significant change; h the increase in 
the amount of dsDNA (the PCR product itself) of up to 
several tnicrograms- Formerly "free EtBr is bound to the 
additional clsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much Jess than to dsDNA, che effect of -this change on 
the total fluomccncc of the sample is small* The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 



before and after, or even continuously during, thennocy- 
ding;. 



RESULTS 

PGR in the presence of Ettf lu order to assess the 
affect of EtBr io PGR, amplifications of the human Hi j\ 
DQa gene" were performed with the dye present at 
concentrations from 0,06 to 8.0 figfrnl (a tyDtca| concen- 
tration of EtBr used in staining of nucleic aads following 
gel electrophoresis is 0.5 ugfcnf)„ As shown in Figure 2> gel 
electronhorcsis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrate™ $, indicat- 
ing that EtBr does not inhibit PGR, 

Dcfectxro of human Y-c&roiifcCsa>fliKs spedfic we- 
«juesi>cesv ScqiietK^-specific, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 iigfrnl EtBr and primer* 
specific to repeat DNA sequences found on the humaa 
V-chromosomc 20 - These PCRs initially contained either 
60 ng male. 60 ng female, 2 ng roaSc human or no DNA. 
Five replicate PCRs were begun for each DNA. After 9, 
17, 21, 24 and 29 cydes of thermocyding, a.PCR for cadi 
DNA was removed from the therroocyder, and its. fluo- 
rescence measured in a spectrofluorometer and plotted 
vs. amplification cyde number (Fng. 3A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number. 
As shown, the fluorescence increased about, three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not siguificandy increase 
for negative control PCRs, which contained cither no 
DNA or human female DNA*, The more male DNA 
present to begin wkh— 60 ng versus t ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electrophoresis 00 the products of these 
amplifications showed that DNA fragments of the ex- 
pected vzc were made ia the male DNA containing 
reactions and that liule DNA synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed, unopened PCRs c>o » UV 
transUhiminatox and photographing them through a red" 
filter. This is shown ia figure 5B tor the reactions thai 
began with 2 ng male DNA and those with no DNA. 

Utettfttttort of specific alleles of tiro human #-globin 
gene, in order to demonstrate that this approach hat 
adequate spedfidty to allow genetic screeiung, a dttccuon 
of the $icMe-cdI anemia mutation was performed. Figure 
4 shows the fluorescence from completed ampMeaticrtu 

coriuinictg EtBr (0.5 ng/ml) a* detected by photography 
of the reaction tubes on a UV oupsiUraninator, These 
reactions were performed using- primer* specific for ci- 
ther the wild -type or sickle-ceil mutation of the human 
p-globin gene* \ The specificity for each allele is imparted 
by placing the sickle-mutation site at the terminal 3' 
nucleotide of one primer. By using an appropriate primer 
annealing temperature* primer extension— -and thus am- 
plification—can take place only if the 3' nucleotide of the 
primer is complementary to the p-globan aUdc present** • 
Each pair of amplications shown in Figure 4 consists of 
a reaction with etcher the wild-type alldc spedfic (left 
tube) or stcklc-alide spedfic (right tube) primers. Three 
different DN As were typed: DNA from a homozygous, 
wild-type p-giobin individual (A A); from a heterozygous 
sickle p-globin individual (AS); and from a homozygous 
sickle p-gtobw individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pairs 
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0 f reactions each). The DUA .type vas reflected in the 
^tfve fluorescence intensities in each pair of completed 
amplifications There was a significant increase in fluores- 
cence Qftly where a 0-globin allele DNA matched the 
primer When measured on a spectrofltioronictcr 
(data not shown), this fluorescence was about three tones 
jrtt present in a PGR where both fr-globm alleles were 
jnisitiatchcd to the primer set* Gel electrophoresis (not 
P hown) established that thw increase in fluorescence vas 
<jue to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for P-globin. There was 
fttdc synthesis of dsDNA in reactions m. which the allele* 
jipednc primer was mismatched to both alleles. 

Cotiruruou^ monitoring of a PCR. Using a fiber optic 
dcVteerk is possible to direct excitation illumination from 
3 sppctrofluorometer to a PGR undergoing thcrmocyding 
and to return its fluorescence to the Kpeetroftuorowctcr. 
The fluorescence readout of such an arrangement, de- 
tected At an Etfir-concaintng amplification of Y<hroxno- 
joflae speci&c sequences from 25 r»g of human male DNA* 
is shown tn Figure S. The readout from a control r*CR 
with no target DNA is also shown. Thirty cycles of PCR 
were monitored for cach- 

The fluorescence trace ax a function of time clearly 
shows the etiect of the themocyding. Fluorescence inten- 
sity riser and. falls uiYCrscJy with temperature. The fluo- 
reteence intensity is minimum at the denaturation tem- 
perature (&4°C) and maximum at the anncalin g/extemion 
temperature (50°C). In the negative-control PCR, these 
fluorescence maxima and minima do not change, signifi- 
cantly over the thirty tncrraocycJes, indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is little if any Wcachlrig of EtBr during 
(he continuous illumination of the sample. 

In the PCR containing male DNA, the fluorescence 
maxima at the annealing/exxensioD temperature begin to 
increase at about 4000 seconds' of therroocycling, and 
continue to increase with time, indicating that dsDNA is 
bctng produced at a detectable level Note that the fluo- 
rescence minima at the denaturatba temperature do not 
significantly increarc, presumably because ax thh temper- 
ature there is no dsDNA for EtBr to bind- Thus the course 
of the amplification is followed by tracking the fluorc*-. 
cence increase at the auxicaKng temperature. Analysis of 
ihc products of these two amplifications by gel ejeetropho- 
tepls showed * DNA fragment of the expected siie for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample* 

DISCUSSION 

Downstream processes such as hybridization to a se- 
qucncen&pecifk probe can' enhance tlie specificity of DNA 
dccevtiuii Irr PGR, The ctunioatkm of thcac proccj^co 
means that* the spednehy of this homogeneous assay 
depends solely on that of F<2lL la the case of sickle-celi 
disease, we have shown that PCR alone has sufficient DNA 
sequence apectficity to permit jjeneiic screening. Using 
appropriate amplification conditions, there is little non- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required" to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a vir*J genome that can be at the level of n few copies 
per thousands of hose cells*. Compared with genetic 
screening, which is performed on ceils containing at Jean 
one copy of the target sequence* HIV "detection requires 
both more specificity and the input of more total 
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WWt 4 UV photography of PCR tubes containing tunp&ficftUoiu 
using EtBr wwt are specific "to wiM-typc (A) or licxte (S> alleles of 
the human p-globin genc. The left of «*ch pair of tub« contain* 
alkle-spcafic primers to the wild-type alleles, the right tube 
primers to the skWe atWe- The photograph was tafwn after 50 
cycles of FOVaod the input DNAs and- the alleles they contain 
are indicated, fifty tag of DNA was used to bean PCFL Typing 
was done in triplicate (3 pairs of PCHs) for each input DNA 
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n&N&S Continuous* realtime monitoring of a PCR. A fiber optic 
was wed to carry- excitation Tight tn a PCR in progress aod aUo 
emhted light back to a Booromctcx (sec Exoentticntal Protocol), 
AmptificaUooimog human m<do-DNA specific primer* 7n a PCR 
Starting with 10 ng of hwmaa male DNA (topj, or in * control 
FCK without DNA {ooitom), were nwnhorrd. Thirty cydrj of 
ICR were followed for each. The' temperature Cydcd between 
94*C (detuturation) and 50*C (annealing and extension). Note in 
the male DNA PCR,. the cycle (time} depeorfcet increase in 
fluorescence at the aoueaBrtg/extenaion temperature. 
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DNA — lip to microgratn amounts— in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA m an amplication sigfti&cauUy increases 
the background fluorescence over which any additional 
fluorescence produced by PCR- must be detected. An 
additional complication that occurs wiih targets "m low 
copy-number h the formation of the ^Y^mer-duner" 
artifact. This is the result of the extension of one primer 
using the other primer as a tcropJate, Although this occurs 
infrequently, once R occur* the extension product is a 
substrate for PGR amplification, and can compete whh 
true PGR targets if those targets are rare. The primer- 
dimcr product is of course dsDNA and thus is- a potential 
source of faUe signal in this homogeneous a*$ay. 

To increase FCR specificity and reduce the eEcet of 
prirner-dimcr antpUfvcatioa, ve are investigating a num- 
ber of approaches, 'including the use of ncstedrprimer 
amplifications that take placr in a single tube 8 , and the 
''hot-start**, in which nonspecific amptffkatfon is reduced 
by raising the temperature of the reaction before DWA 
synthesis begins**. Prelhninary results us'mg these ap- 
proaches suggest tbatprwncrHiihiCT is effectively reduced 
and it is possible to detect the increase in Ecfir fluores- 
cence in a PCR instigated by a -single HIV genome in a 
background of 10 9 ceils. With larger -number* of octls, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To, reduce this background, it may 
be possible to use sequence -specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to 
the oligonvcleoddc primer*'*. 

We have shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PCR is completed and continuousDy during 
thermocyding. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The Auorescence analysis 
of completed PCR* is alreadypossiblc with existing instru- 
mentation in 96-well fbrmar*. In tliis format, the fluores- 
cence in each PCR can be <juantitated before, after, and 
even at selected points during therrhocycurte by moving 
the rack of PCR* to a ^rnicTOWcll plate fluorescence 
reader 40 , . „ 

Hie instrumentation necessary to oontinuoiisly monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberoprics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number, figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase k detected. Prelimi- 
nary experiments {Higuchi and DoWinger, manuscript in 
preparation) with continuous monitoring' have shown a 
scosiuYity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules 
known — a$ it can be in genetic screening-^rcontinuows 
monitoring may provide a means pf detecting fa tec posi- 
tive and false negative result* With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by 3 predictable number of cydes of PCR* 
Increases in fluorescence detected before or after, that 
cycle would indicate potential artifacts. False negative 
results due to, for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PGR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more' than arc necessary to detect a true 



positive- If a sampJc fails mi have a fluorescence increase 
after this many cycks, iohfcniton may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorcsccmce signal alone, such controls may 
be important- In any event before any test based on this 
principle is ready for the chnio ah assessment of its false 
positive/false negative rates wOl need to be obcained'using 
a large number of known samples. 

In summary-, the inclusion bi PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA amplification from oucside 
the PCR tube. In the future, instruments based upon this 
principle may faciKlate the more widespread use of PCR, 
in applications that demand the high throughput of 
sample** 

EXPERIMENTAL PROTOCOL 

Hainan HLA-X)Q« gene Amplifications ctrataixuag Eujx, 
PCRs were set up in 100 u4 volumes containing 10 mM Tris-HCh 
pH 8.3; 50 mM KCI; 4 raM MgO t : *-5 units of Taa DNA 
polymerase (PerUm-EJmcr Ccnn, Norwalk, CT); 20 pvriote each 
of human HlA-DQa ' gene specific oligonucleotide primers 
GH26 and CH27 19 and apprcoomately 10* copies of DQd PCR 
product diluted from a predou* reaction- EuSidium bromide 
(EtRr; Sigma) was used at the couceutaiuoiu; indicaied u> Figure 
2. Thermocyding proceeded for 20 cycles in a model 480 
thcrmocydcr (Perfe-Elmer Cctua, Norwalk, CT) usinga "rtcp- 
cyckf program of 94°C for 1 m jo.. denatu ration and 60V. for "SO 
sec knnealmg and 72'C for 30 *ce. extension, 

Y-chromowme specific PCR. PCRs (100 ul total reaction 
volume) containing K*» ^cAttl JEtBr were prepared as described 
For HLA-DQ<x, except with different primers and target DNAs. 
These PCRs contained 1 5 pmotc each male DN A-spcctfic priow 
YM and Vl.2* 0 , and cither 60 ng male. 60 Otffemale, 2 ng mak, 
or no human DNA. Thermocyding was 94*Ctor I min- and 6<PC 
for 1 min using a "step-cycle* program- The number of cycles for 
a sample were as indicated in figure 3. Fluorescence measure- 
ment k described below. ^ , 

AUck-flpccific, human frgkAnn pro* PCR. Amplifications of 
100 fd volume' using 0 5 P^/ml of £tBr were prepared ** 
described for HLA-OQm above except with different pruncr* and 
target DNAs. These PCRs. contained tuber pdmcr pair 'HGrt' 
TO HA <wBd-type globin speculc primers) or HGF24ifn4$ (»<*- 
Ic-fflobin specific primers) at 80 pmole e*ch primer per PCR. 
These primers were developed by Wu ct aL 2 \ Three different, 
target DNA a were u*cd in separate amplifkation&--50 ng eaefc of 
human DNA that was homozygous for the sfcWc trait (SS), DNA 
that was lieterorrgous for the sickle wait <AS), or DhiA that vrtu 
hotnOTCrgous For Ae w.t- globin (AA)- ThcTmocycBng.was pw » 
cycles at for 1 mm. and 55*C Cor 1 min. itsure a w step-cycte 
program. AnanneaKng temperature of 55^ b*d heen shown ty 
Wu et al. 2< to provide allclc-«pccinc awplifeauon. Complied 
PCRs were phoWraphcd uirough a red filter (WraUenJttA) 
after placing the rcaSon tubes a«y" a model TM-S6 transffl>iCfti- 
n3tor <UV-pTOductt 5atv Gabriel, CA). 

Fhtorescenee measnrcmem. Ft«ote*ce*>ce nxas«rcraenw were 
made on PCRs containing EtBr m a Fiuorolog-2 uUororncCcr 
(SPEX Edison. NJ). txciurtion was at the 500 nra baiwl *jth 
about 2 nm bandwidth warn a GG 435 nty cut-off ^S**g5 
Crist. Inc^ Irvine. CA) to exclude sccond^fder hght v»™*a 
H K ht was detected at 5 V0 nm with a bandwidth of about 7 nm. An 
<X> 530 am cut-off fiJtcr Was used to remove the ocatAttOn 

ContitHKKifl ftnomcence mraitormg of IrdU C^ntoiuOuS 
monitoring oiF a KJR in progress was accomplisbed using the 
Bpcctrofluorometcr and setungB descrrbod above as weU a$a 
fiberoptic accessory (SPEX caL no. 1950) tobotV send exotaoon 
fight to. aad tecdvc emitted light from, a PCR placed m a wcU «J 
a model 4d<> ibermocydcr (Pcrkm-EJmer Cetus). The probe end 
of the fiberoptic cable was attached yrfili "5 own.utc-cpoxy to Oft 
open top of a PCR cube (a 0.5 ml potoropytom cimmfogc woe 
with its cap removed) effectively scaling «- The cxp05edy?f<n 
Che PGR tunc and the end of the fiberoptic caWc were _jbjcWcd 
from room light and the rooco lights were kept dimmed during 
each vup. The monitored PCR was an ampWeiuOn of V-<*^ 0 ' 
tnosornMpcdfk: repeat sesueDCeS as dVscribcd nbove, except 
usJnff an anncating/extertfkm temperfluire of 50^C. ThereaOKH* 
was covered y*f& iwijwtrM oil (2 drops) to prevent evapdranon. 
Thennocydiiiff and fluorescence roc^urcmcot were started 
multancously- A time-base BCJ» a with a 10 second mtegrafaon tunc 
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vnn u«sd and the emUstoa signal was ratiocd to' tbc excitation 
alga*] to control for change* in iigfn-*ourcc intensity. Pat* were 
collected using the draSOOOf, version 15 (SPEX) data system. 

Wc ttani Bob Jones for help with the spectrafluormciric 
OKStutfrementt and Hcalherhell Fong for editing this manuscript. 
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IMMUNO BIOLOGICAL LABORATORIES 



Trauma, Shock and Sepsis 




The CO-14 molecule is expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CO-14 is a receptor for Itpopofysaccharicte 
(LPS) complexed to LPS-8inding-Protein (LBP). The 
concenuatlon of te soluble form is altered under 
certain pathological conditions. There, is evidence for 
an Important role of $(XM4.with polytrauma, sepsis, 
burnings and inflammations. 
During septic conditions and acute infections it seems 
to be a prognostic marker and is therefore of vatue in 
monitoring these patients. 



IBL offers an EUSA for quantitative determination of 
soluble CD-14 in human serum, -plasma, cel|-cutture 
supernatants and other biological fluids. 
Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monocjonai antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/m) 
detection limit 1 ng/ml 
CV: intra- and interassay < q% 

For more information call or fax 




0ESELLSCHAFT FOR IMMU WCHEMIE UND -BIOLOGIE MBH 

05TERSTRASSE 86 • D - 2000 HAMBURG 20 - GERMANY TEL. +40/491 00 61-64 - FAX +40 /40'n 93 



PAGE 6/6 * RCVD AT 7/19/2004 3:10:03 PfVI [Pacific Daylight Time] * SVR:SVCS01/0 ' DfJIS:6638 1 CSD:650 952 9881 * DURATION (mm-ss):0446 



From C613) 991-5695 Order A5811209DP04816741 Mon 24 Oct 2005 0' :35 PM EDT Page 2 of 7 

Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth j. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Ciusti, and Karin Deetz 

Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' ->3' nude- 
olytlc activity of Taq DNA polymerase. 
In this study, probes with the 
quencher dye attached to an internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 -end nucleotide. In all 
cases, the reporter dye was attached 
to the S' end. All Intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 -end nucleotide also exhibited 
an increase In reporter fluorescence 
intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



/^homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. n > 
The assay exploits the 5'-* V nucle- 
oiytic activity of Taq DNA poly- 
merase (2 - 3) and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET). (4 ' 5J During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' 3' nucleolytlc 
activity. Jf the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. <6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3* end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 

MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6<arboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphaiink 
for attaching a 3'-blocklng phosphate 
were obtained from Perkin-EImer, Ap^ 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeied phos- 
phoramidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phaiink at the 3' end. Following de- 
protection and ethanoi precipitation 
TAMRA NHS ester was coupled to the 
lAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of S' nuclease assay. Stepwise representation of the 5' — 3' nudeolytic ac- 
tivity of Taq DNA polymerase acting on a fluorogenic probe during one extension phase of PGR. 



mM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C e 220x4.6- 
mm column with 7-u.m particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0,1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5 J end. 



PCR Systems 

All PCR amplifications were performed 
In the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-u.l reactions that con- 
tained 10 mM Tris-HCl (pH 8.3), '50 mM 
KC1, 200 \lm dATP, 200 u-M dCTP, 200 \lM 
dGTP, 400 (jum dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajlma-Ujima et al.) <7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. <8) Actin am- 
plification reactions contained 4 mw 
MgCI 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



TABLE 1 Sequences of Oligonucleotides 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl 2 , 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (1 min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-pJ 
aliquot of a sample was transferred to an 
individual well of a white, 96-well micro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a 515-nm emission filter Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q. value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



Name 



Type 



Sequence 



F119 

R119 

P2 

P2C 

PS 

P5C 

AFP 

ARP 

Al 

A1C 

A3 

A3C 



primer 
primer 
probe 

complement 
probe 

complement 
primer 
primer 
probe 

complement 
probe 

complement 



ACCCACAGGAACTGATCACCACTC 

ATGTCGCGTTCCGGCTGACGTTCTGC 

TCGCATTAC1 GATCGTrGCCAACCAGTp 

GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'ITTGCTGGTATCTATG ACAAG G ATp 

TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 

CAGCGGAACCGCIX^TTGCCAATGG 

ATCCCCTCCCCCATGCCATCCTGCGTp 

AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCrGGACTTCGAGCAAGAGATp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this study, the nucleic acid sequence is given, written m the 
5' -* 3' direction. There are three types of oligonucleotides: PCR primer, fluorogenic probe used 
in the 5 f nuclease assay, and complement used to hybridize to the corresponding probe. For the 
probes, the underlined base indicates a position where LAN with TAMRA attached was subsrt- 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A 1 -2 RAQGCCCTCCCCCATGCCATCCTGCGTp 

A1 -7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1 -1 4 RA7X5CCCTCCCCCAQGCCATCCTGCGTp 

A1 -1 9 JRATGCCCTCCCCCATGCCAQCCTGCGTp 

A1 -22 RATGCCCTCCCCCATGCCATCCQGCGTp 

A1-26 RatgccctcccccatgccatcctgcgQp 



Probe 


518 nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ tomp. 








Al-2 


25.5 ± 2.1 


32.7±1.9 


38.2 + 3.0 


38.2 ±2.0 


0.67 ±0.01 


0.86 ±0.06 


0.19 + 0.06 


A1-7 


53.5 ± 4.3 


395.1 ±21.4 


108.5 ±6 3 


110.3 ±5.3 


0.49 + 0,03 


3.58 ±0.1 7 


3.09 ±0.1 8 


A1-14 


127.0 ±4.9 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6.3 


1.16 ±0.02 


4.34 ±0.15 


3.18±0.15 


A1-19 


187.5 ±17.9 


422.7 ±7.7 


70.3 ± 7.4 


73.0 ± 2.8 


2.67 ±0.05 


5.80 ±0.15 


3.13*0.16 


A1-22 


224.6 ±9.4 


482.2 ± 43.6 


100.0 + 4.0 


96.2 ±96 


2.25 ± 0.03 


5.02 ±0.11 


2.77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 + 18.4 


93.1 ±5.4 


90.7 ± 3.2 


1.72 ±0.02 


5.01 ±0.08 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template ( + temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to- well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RQ + ). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human p-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ~ values indi- 
cate that probes Al-14, Al-19, Al-22, and 
A 1-26 probably have reduced quenching 
as compared with A 1-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 
A3-24 


54.6 ± 3.2 
72.1 ± 2.9 


84.8 ± 3.7 
236.5 ± 11.1 


116.2 ±6.4 
84.2 ± 4.0 


115.6 ± 2.5 
90.2 ± 3.8 


0.47 ± 0.02 
0.86 ± 0.02 


0.73 ± 0.03 
2.62 ± 0,05 


0.26 ± 0.04 
1.76 ± 0.05 


P2-7 
P2-27 


82.8 ± 4.4 
113.4 ±6.6 


384.0 ±34.1 
S55.4 ± 14.1 


105.1 ± 6.4 
140.7 ± 8.5 


120.4 ± 10.2 
118.7 ±4.8 


0.79 ± 0.02 
0.81 ± 0.01 


3.19 ±0.16 
4.68 ±0.10 


2.40 ±0.16 
3.88 ± 0.10 


P5-10 
P5-28 


77.5 ± 6.5 
64.0 ± S.2 


244.4 ± 15.9 
333.6 ±12.1 


86.7 ± 4.3 
100.6 ± 6-1 


95.8 ± 6.7 
94.7 ± 6.3 


0.89 ± 0.05 
0.63 ± 0.02 


2.55 ± 0.06 
3.53 ± 0.12 


1.66 ±0.08 
2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ, We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5 1 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 * concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 24 " is only slightly higher than 
RQ at 10 mM Mg 2+ . For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 24, are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 "*" followed by 
a gradual decline as the Mg z+ concen- 
tration Increases to 10 mM. Probe Al-14 
shows an Intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2+ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5 f end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This Implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 ' 


509.38 


53.50 


93.86 


0.81 


5.43 


A3-6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


S4.63 


121.09 


0.64 


0.58 


T2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


P5-10 


27.34 


144-85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentraUon of SO nM indicated probe, 10 mM Tris-HCl (pH 83), 50 mM KG, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3^ and A3-24, 100 om P2C for probes P2-7 and P2-27, or 100 nM 
PSC for probes P5-10 and PS-28. Before the addition of MgCI 2 . 120 pi of each sample was heated 
at 95°C for 5 min. Following the addition of 80 ^1 of 25 mM MgCl 2 , each sample was allowed to 
. cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
(quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ", which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ~ include 
the particular reporter and quencher 
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mM Mg 

FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 5 18 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Tris-HCl (pH 8.3), 50 mM KC1, and varying amounts (O-10 mw) of MgCI 2 . The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (him 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is rhe efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe/ 0 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the .3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ' values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ" than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ~ 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ~. 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe A 1-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and PS 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency nf probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PGR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the S' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of Increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al/ 1 * demonstrated that allele-spedflc 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AFSQ8 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. <IO) describe just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR arriplifi- 
. cation. 
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Wc have developed a novel "real time" quantitative PCR method. The method measure* PCR product 
accumulation through a duaMabded fluorogenlc probe (U., TaqMan Prob*)> This method provides very 
accurate and reproducible quantitation of Rene copies. Unlike otlwr quantitative PCR methods, real-lime PCR 
does nor require posvPCR sample handling, preventing potential PCR product carry-over contamination and 
resulting In much fester and higher throughput assays. The reaMlmft PCR method has a very large dynamic 
ranee of starting career molecule determination (at least five orders of magnitude). Real-time quantitative 
PCR is extremely accurate and less-labor-intensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence analysis Mas 
had an imporlnnt role in many fields of biologi- 
cal research. Measurement of geue expression 
(UNA) has been used extensively In monitoring 
biological responses to various stimuli (Tan ^ al» 
1994; Huang et at. 199Sa,h; I'rud'homme et. al. 
1995), Quantitative gene analysis (DNA) has 
In-cn toed to determine the genome quantity of 3 
l^rvicular gene, as in the case, ot the human H1LR2 
gene, which Is amplified in -30% of breast tu- 
mors (Starnon e.t -al. 1987). Gene and genome, 
quantitation (DNA and RNA) also have been used 
fur analysis of human immunodeficiency vims 
(IllV) burden demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; Platak et al. ivv;sh; 
J : urtado et al. 

Many methods have heen described for the 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern VJ/b; Sharp ct 
al. 19B0; Thomas 1980)." Recently, PCR lias 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase (KT)-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This lias mode pos- 
sible many experiments that could nol have, been 
performed with traditional methods. Although 
PCR has provided o powerful tool, it is imperative 
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thai It be uaed properly f«r quantitation (tt»uy- 
maekers 1995). Many early reports of quantita- 
tive PCK and R'l-PCR described quantitation of 
the PCR product but did not measure! the initial 
target sequence quantity, It is essentia) to design 
proper controls for the quantitation of the initial 
target sequences (Pcrrc. 1992; Clement I et al. 
100?.) 

KcsMirchcrs have developed several methods 
of quantitative PCR and RT-PCR. One approach 
measures PCR product quanlity in the lug phase 
of the reaction before the plateau (Kellogg et al. 
1990; Pang ct a). 1990). This method requires 
(hat each sample has equal Input amounts of 
nucleic add and that each somple under analysis 
amplifies with Ulenlic.il efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained in nil samples at relatively constant quan- 
tity, such as p-aclln) can be ustsd for sample, 
.amplification «fticlcncy normali2ation. Using 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure . 
that all samples are analyzed during the log phase 
of the reaction (for bolh the target gene and the 
normalization gene). Another method, quantita- 
tive competitive (QC)-POR, has been developed 
and is used widely for PGR quantitation. QC-l'CR 
relics on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
PlataK cl oh 1993»,1>). The efficiency of each re- 
action is normalised to thr. inlcrnol competitor. 
a tnnwn amount of Intfcniai competitor can be 
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added to each cample. To obtain relative rpianU 
ration, the unknown target 1'CR product is com- 
pared witli the known competitor i*CM product. 
.Success of « quantitative competitive PC It assay 
relics on developing an Internal control thai am- 
plifics with the Same efficiency as tlic Uugel moh 
cculc. The design of the conipcUtor *ud the vali- 
dation of amplification efficienc-iea. jcquJrc n 
<lcdicatcd effort. However, because QC^-PCK does 
nut require (hat PCR pioducts be analyyxxl during 
the log phase of (he. amplification, it is tin: easier 
vf the two methods to use* 

Severn I detection Kysteim ate used for quan 
tltative i'CK and RT-Pc:n Analysis; (1) a^irt^o 
gel*, (2) Jluumwii! labeling of PCR products and 
detection with laaiTr-iiiclucxx! fluorescence using 

capillary electrophoresis (h'asco ct al. 199$; Wil- 
liams et al. 1996) or acrylamidc gels, and (3) plate 
capture and sandwich probe hybridist I lost (Mul- 
der el al. 1994). Although these method* pmv«*l 
successful, each method requires post-PCR ma- 
atpuIarJons That add time to the analysis and 
may lead to Jaliuratoty i outdid 1 nation. The 
sample throughput of these met buds is limited 
(wllh the i-xccpilon of the plate capture ap- 
proach), and, thttrv.fm-e., these methods are not 
well >uitcvl fuj u.m:^ demanding high -sample 
throughput (i.e., screening of large numbers of 
hit>iuwlcrv.ulrr>, i>i analyzing S^nn/lva fwj diagnos- 
tics pr clinical trials). 

Here, yve report the development of a novel 
a.vsay for quantitative DNA analysis. The assay is 
has«d cm lh^ «Sf. <>f th«* S' -nuclease assay first 
described by Holland' el ah (199] ), The method 
uses the -5' nucleate activity of l\u( polymerase to 
dcavc a noncxlcndlblc hybridlswiion probe dur- 
ing the extension phaar of I'Clt. T)n\ npproach 
uses dual-In bclcd fluorogenic hybridisation 
probes (Lcc ct id. 1993; ilasslcr ct ah 1993; Uvafc 
el nl, 1$95a,b). One fluorescent dye serves «s a 
reporter |FAM (i.e., 6-c«rboxyfhiore»cein)| and i1s 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (I. a., o-carboxy-ietramcthyl- 
rliodaminc). The nuclease degradation of the hy- 
bridization probe releases the quenching of Ihe 
I'AM fluorescent emission, resulting In an in- 
crease in peak fluorescent emission at 5JB nin. 
The use Of <a sequence detector (Am Prism) allows 
measurement of fluorescent spectra of all 9o wells 
of the thermal cycler continuously during the 
i*GK amplification. Therefore, .the n.-actioix» a;c 
monitored in real lime, flic output data is de- 
scribed and quantitative analysis of input target 
l)NA sequences is discussed below. 



RESULTS 



PCR Produce Dcrecrlon in R«al Time 

The goal %va* to develop a high-throughput, sen- 
xitive, and nceuratc gene quantitation assay for 
use in monitoring lipid mediated thftrapeu Tic- 
gene delivftry. A plasmld encoding human factor 
VIII gene sequence, pI'8TM (see Methods). w;ix 
used as a model therapeutic k« iu » The ass*«y usr< 
fluorescent Taqwan methodology and an instru- 
ment capable of measuring fluorescence in real 
time <AB1 Prism 7700 Sequence Dctcclnr). 'Ilu: 
TcKpu;«u reaction requires n hybridist ion prnhr 
laixled with two different fluorescent dyes. One 
dye Is a reportvr dyv (l ; AM), the other ix X cpicnch- 
hig dye (TAMRA). When thv prubt: Is intact, fiuo- 
iea<cji( eitcrgy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAMRA). During Die extension 
phase of the l'CK cycle, Ihe fluorescent hybrid- 
ization probe is d caved by the 5'-'l' nuclcolytic 
octlvity of the DNA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
suiting hi ait increase of the report or tlyo fluorct- 
cei^t cint.uiioji wp4*rCtra, 1*CU primers uud probuN 
were der»igneil foi I he human fiiclor VJ11 se- 
quence and human p-actln gene (as dv..ieribed in 
\(cttu»ds). Optimization reactions were per- 
formed to choose the approprlutc probe uiul 
magnesium concentrations yield irjg the higin-^i 
intensity of reporter fluorescent signal vfithout 
sacrificing specificity. The Instrument ur.es a 
chaigo coupled device (i.e., CCD camera) for 
measuring the fluorescent emission speelm from 
. c t(H} to C,$0 mti, Kach PCR tube wa» jnonitured 
sequentially fnr 2/> rnsue with continuous moni- 
toring throughout the ouiplifieatkm. Uach lube 
wa,% rti-exandncd every fi.5 sec. Computer s<>f(- 
wnre. was designed to exnmijir the fluorescent In- 
tensity of both the reporter dye (I-AM) and 
the quenching dye (TAMRA). The H uo resccnt 
intensity of the quenching dye, TAMUA, changes 
very litlff! over the course of the PCR amplify 
cation (data not shown). Therefore, the Intensity 
<>f TAM31A dye emission serves as ;m iuiornxil 
.tt^iulard with which to tioitnallxe the reporter 
dye (1 : AM) emission variatJons. The software cal- 
culous a value termed AHn (or Al^Q) ualng the. 
following equations ARn - (Un J ) (Rn"), where- 
Kn 4 • ernlsMlon intcjishy of reporter/emission in- 
tensity of quencher at any given lime In n reae 
tlon tube, and Ru r- emission ihttmsitity of re- 
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poncr/emisslon Jme-miiy uf qucneiier measured 
prior lo PCX ampliiicalioii in lhar same reaction 
tube. Vor the purpose of quantitation, the last 
three data points (ARm) collected during the. ex* 
tension step for each PCK cycle were analysed. 
The nucleolytic degradation of the Jiyurklffeiiion 
probe occurs during the 'extension phase of I'tat,- 
and, therefore, reporter fluorescent t:iim,m*rt In- 
creases; during this ilme. Jin: Uuw data points 
were averaged for each k;K cycle and the mean 
value for each was plotted in an "amplification 
plot" shown in J J ijjurc 3 A- The AKn mean value is 
plotted on the j*-axJs, and time, representee) by 
cycic mimhrr, is plot I act on thv;r-axU. During t tic 
earJy cycles of the PCR amplification, the AKn 



value remains at base Jjno When sufficlenl hy- 
brid; /a I Ion probe has been cleaved by die Tmj 
polymera:tc nuflftAAO activity, the intensity of re- 
porter fluorescent emission inereut;et.. Most PCU 
amplify* I ions reach a plateau phone of reporter 
fluorocv.ut cm is si on If the reaction Is carried nut 
to high cycle munbcis. The amplification plot \ f J 
examined cuiJy jn tint fraction, at a point thai 
icprcscnU ilu- log phAW of product arrmnula* 
tion. This is done by ussignlng an arbibary 
threshold thai \a based on the variability of the 
baseline daia. In Figure 1A, the threshold whs set 
at 10 stnndard deviat ion* above the mean Of 
base line emfemuu calculated froru tyc:Ie^ 1 lo 1 fv 
Once the threshold is chosen, the point at which 
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Figure 1 PCR product detection in real time. {A) The Model 7700 Mjflware- will construct amplification plot* 
from the extension phase fluorescent emission data collected during the PCR amplification. The standard de- 
viation is determined from the data points collected from the base line of the amplification ploL C, values are 
calculated by determining the point at which the fluorescence exceeds a threshold limit (usually 10 times the 
Standard deviation of the base line). (B) Overlay of amplification plots of serially (1:2) diluted human genomic 
DNA somplcs amplified with p-actin primers. (Q Input DNA concentration of the samples plotted versus C T . All 
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the amplification plot crossi-d the thrcsholdivde 
fined as C r . C, is reported us l he cycle number ;*t 
this point. Ar will be demons! rut «d, thw C, .value 
Ik pn^licdve of the quantity of input target. 

Cj Values Provide a Quantitative 'Measurement, of 
Input Target Sequences 

Figure IB shows amplification plot* of'l^dirfcw- 
enl PGR amplifications overlaid. The amputa- 
tions wore performed on a 1:2 serial dilution ■•«« 
human genomic J)NA. The amplified targol w:u 
human p actin. The amplification plots «hift to 
the right (to higher threshold cycles) n* the inj>ut 
target quantity reduced, 'ilw jjs ex|X?cted he. 
<:u\\m rtiHctioriK with fewer starting copiux nf the 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence, An arbitrary threshold of 10 stan- 
dard deviations above the base Jine was used to 
determine the C r values. Figure 1C represents the 
C r values plotted versus the sample dilution 
value. Each dilution was amplified in triplicate 
VCM amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C r values decrease linearly with increas- 
ing target quantity. Thus, C, values can be used 
as a quantitative measurement of the input target 
number. H should be noted that the amplifica- 
tion plot for the 15.6- ng sample shown In Mgure 
in does not reflect the same fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15,6-ng sample also achieves endpoint pla- 
teau at a lower fluorescent value than would he 
expected based on the input -PNA. This phenonv 
enon has been, observed, occasionally with other . 
samples (daui not shown) and may be attribut- 
able to lute cycle inhibition; this hypothesis is 
still under Investigation, it is important to note 
that the flattened slope and early plateau do not 
impact significantly the calculated C, value as 
demonstrated by the fll on the line shown in 
Figure 1C. All triplicate amplifications resulted in 
very similar Cr values— the standard deviation 
did not exceed 0,5 for any dilution. This experi- 
ment contains a > 1 00,000-fold range of input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. oi Huorcsccnl in- 
tensity measurement of the ABI Prism 7700 Se- 



mtmts over n very large r;in|»c nf relative cianlng 
target quantities. 

Sample Preparation Validation 

Several parameters influence the efUclenry'nf 
PCM amplification: magnesium and salt concen- 
tration*, reaction conditions (i.e., time and ie.m- 
pe.rature), PCM target si zc and composition, 
primer sequences, and sample purity. Ail of The 
above laclon are. common to a single J'CR assay, 
except sample to sample purity. In an effort to 
validate the method of sample preparation for 
the factor Vlii assay, PCR amplification reproduc. 
ibihty and oiflcicncy ol JO replicate sample 
piejwrations were examined. After genomic DNA 
was prepared from the 10 replicate samples, the 
DNA was quamitalcd by ultra violet spectroscopy. 
Amplifications were performed analyzing p-aciin 
gene, content in 100 and 2$ ng of total genomic 
DNA. Hach I'CK amplification was performed in 
triplicate. Comparison of C r values for each trip- 
licate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Table 1). Therefore, each ol the triplicate PCU 
amplifications was highly reproducible, th 
stratlng that real time PCJR using this Instrumen- 
tation introduces minimal variation Into the 
quantitative J'CR analysis. Comparison of the 
mean C n values of the. JO replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for fi-acUn gene quantity. The highest CV 
difference between any of rue samples was 0.S5 
and 0.73 for the 100 and 25 ng samples, respec- 
tively. Additionally, the amplification of each 
sample exhibited an equivalent rate of fluorc> 
cent emission intensity change per amount of 
DNA target analyzed ns indicoicd by similar 
slopes derived from Ihe sample, dilutions (Pig! 2). 
Any sample containing an excess of a i'CK inhibi- 
tor would exhibit a greater measured 0-aciln C r 
value for a given quantliy of DNA. In addition, 
the inhibitor would be diluted along with Ihe 
sample in the dilution analysis (Hg. Z), altering 
the expected c, value change. Each sample am- 
plification yielded a similar result in the analysis, 
demonsira ling that this method of .sample prepa- 
ration is highly reproducible with regard lo 
sample purity. 

Quantitative Analysis of a Plasmid After 
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Table 1. RoproduclbBUty of &»roplo Preparation Method 



100 ng 



25 ng 



Samplo 



no. 



8 
9 

10* 
Mean 



standard 
mean deviation 



CV 



18.24 

18.23 

1v\33 

18.33 

18.35 

18,44 

18.3 

18.3 

18.42 

18.15 

18.23 

18.32 

18.4 

18.38 

18.46 

18.54 

18.67 

IV 

18.28 

18.36 

18-52 

18.45 

18.7 

18.73 

18.18 

18.36 
18.42 
18.57 
18.66 

0 10) 



10.27 0.06 
18.17 0.06 

18.34 0.07 

18.23 0.O8 

1B.-12 0.O4 

18.74 0.21 

18.39 0.12 

18.63 0.16 

18.29 0.1 

18,55. 0.12 

18.<12 0.17 



0.32 
03? 
0,36 
0.46 
0,23 
1.26 
0.66 
0,B3 
0..W 

0.65 
0,90 



20.48 
20.55 

20.61 

20.59 

20.41 

20.54 

20.6 

20.49 

20.48 

20,44 

20.38 

20.68 

20.87 

20.63 

21.09 

21,04 

21.01 

20.67 

20,73 

20.65 

20.98 

20.84 

20.75 

20,46 

20.54 

20.48 

20.79 

20.78 

20.62 



standard 
mean deviation CV 



20»S1 0.03 0.17 

?0..*4 0.11 0.54 

20.54 0.06 0,28 

20.43 0.05 0.26 

20.73 0.1 3 0.61 

21.06 0.03 0.15 

20.68 0.04 0.2 

20.86 0,12 0.57 

20.51 0,07 0.32 

20.73 0.1 0.-16 

20.66 0.19 0.94 



tor containing a partial cDNA for human fat-tor 
Vlll, pl-BTM. A scries of transfectiom was *oi 
up using a decreeing amount of the plasmid\40, 
*, w\5, and O.l u,g). t\vi-my-four hours post- 
transfetii on, total r^NA w<is purified froin each 
flask uf irlh. p-Actiu geiiejjuaijllly was chosen as 
a value foi* normo]i^ti«#n of genomic. DNA con- 
centration frum each siuupic. In this cxucunient, 
|i-actin gene content should remain constant 
relative to rota] genomic DNA. Figure 3 shows the 
result of the p-actln DNA measurement (100 ng 
total DNA determined hy ultraviolet spectros- 
.copy) ot each sample. Kaeh Sample was analysed 
in triplicate and the mean |i-actin Gj- values of 
the triplicates were .plotted (error bars represent 



between any two sample moam was 0.95 C... Ten 
nanograms of total UNA of each sample were also 
examined f«r p-acUn. The results again showed 
that very similar amounts of genomic DNA were 
present; the maximum mean p set in C t value 
difference wa.s 1.0. As Figure 3 shows, Hit- rate of 
p-aetiii C r change between, the 100 and 10-ng 
sample* was similar (sJopc values rangw barwoon 
3.56 and -3,45). Thlx vcriAes again \hni the 
method of .sample preparation yields samples of 
identical PCR integrity (i,e- # no sample contained 
an excessive amount of a PCR inhibitor). How- 
ever, these results indicate that each sample con- 
tained slight diffeiencex in the actual amount of 
genomic DNA analyzed. Determination of actual 
ucuumic DNA Concent ration was accomplished 
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Ffcyurv 2 Souiple preparation purity. 1 he replicate 
samples shown In Tabl<? 7 wore also amplified In 
tripicate using 25 ng of each DNA sample. The fig- 
uie showi die input DNA concentration (100 and 
25 ncj) vs. O, In ihf* iipur^. ihp 100 and 75 ng 
points for «ach sample are connected by a line. 



by plotting the mean p-aetio O, value obtained 
for uarti 100 sample un a ft-aclln standard 
ianve (shown In J'Sg. 4C). The actual genomic 
ONA concentration of each 5uth]>1«, «, was ob 
l allied by extrapolation to tliu a'-;**!*. 

Figure, 4 A sliows the measured. (I.u v n«n- 
liorill (tilled) quautitki-s of factor VI}) pin sin id 
ONA (pJ'tSTM) from each' of tlu*. four transient cell 
lr<i"*fccUon&. Each reaction contained 100 n£ of 
total sample DNA (as determined by UV spectrtw 
copy), llach sample w;is analyzed in triplicate 



Ptlk amplifications. As shown, pI-BTM purified 
,hxue_fbe 293 cells decrease.*; (mean C, values: in- 
Ct«iisi^) with decreasing amounts of pi asm Id 
ilrumft'UttL Th« inc»" O a values obtnincd for 
pFfcTM in Tlgurc 4 A were plotted on ;i standard 
curve com prbed uf si* i hilly diluted pKHTM, 
shown .In figure 4R. The quanlily uJ nl'KTM, b, 
found in each of the four tranftfcctJom was de- 
termined- by extrapolation to the x axis of the 
standard curve in I'igurc 41*. Thttsc uncorrected 
m values, b, for pWJ'M were norundl^id to deter- 

mine Uie actual amount of pl'8TM fuuml per 100 
iik of genomic DNA by using the: equation:. 

b x 10 0 tig jscliial pI-TTITwl copies i>er 
(/ * 100 ng of genomic DNA 

where a •- actual genomic DNA in u sample and 
b v. pVlYVM copies from the standard curve, 'Die 
norma.) ixed quantity- of pl'6TM per 100 ng of ge- 
nomic DNA for each o/ the four lran.Vfectlon.s M 
shown in Hgu re 4J i . 'Hi ttse ro u 1 1 s Mi ow i h a i t he 
quantity of factor vin plasiuld associated wiih 
the 273 eel) a, Z1 hr after irujusfuciiun, dei.ie.ises- 
wlth decreasing pJasniui uh Herniation u.icd in 
the iraiis/cclion. The quantity of pl'BTM associ- 
ated wiirx 293 cells, after transection with 40 (xg 
of nlasmid, was 35 pg per 100 ng genomic DNA. 
Tills results In -520 plasinid copies per cell. 
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Figure 5 Artalysb uf ticmsfectcd cdl DNA quantity 
and purity. I he DNA preparations of the lour 293 
eel! transactions (40^ A, 0.5, and 0.1 ug of pF8TM) 
were analysed for the P-actln gerie* 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM that was transacted, the {i-actln 
C T values are plotted versus the total input DNA 



DISCUSSION 

Wo have described a new method for quantitnt- 
inft gcjic copy numbers using real*timc analysis 
of PCK amplificatlon.s. ReaMimc PCK is compat- 
ible with cither of the two FCR. (KT-FCR) ap- 
proacho: (1) quanlllative cc>iiir>elitivt: where an 
internal wmpclltOl' for each target sequence is 
used for normalixatlon (data not shown) or (2) 
quantitative comparative PCK usli;y a nonintli^- 
tion gene contained within the sample (i.e., p-nc- 
tin) or a "housekeeping" gene for RT-POK. If 
equal amounts of nucleic add are analyzed for 
each sample and if the amplification cf/itiency 
before quantitative analysis identical for each 
sample, the huernal contiol (nojmoliraliou jjene 
or competitcjr) should Rive equal MKoals for fllJ 
samples. 

The real-time PCH method offers sevenil ad- 
vanlages over the other two methods t:urrcntly 
employed (see the Introduction). Tirsl, the real- 
time PCR method is perfomu-d in a closed-tube 
system and requires no past-PCK manipulation 
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Flgurn 4 . OuantUativo AnofyKi/c of pF8TM in transfccted cell*. </4) Amount of 
plasmid DNA used for (he transaction plotted against the ineun C, value deter- 
mtnod for prfitM remaining JH hr alter tremlectlon. (0,0 Standard curves of 
pf-ftTM *nd P-actln, respectively. pfOTM DNA (0) and genomic DNA (Q were 
diluted serially 1;5 before amplification with the appropriate primers. The p-actin 
standard curve wh* usod to normalise the results of A to 1 00 ruj of genomic DNA. 
(0) Tho amount of pFSTM present per 100 of genomic DNA, 



of sample Therefore, lh<» potent in) for rCK con- 
lamination in the laboratory is reduced because 
Amplified products cam he ;mHlyy-ed and disposed 
of without opening tho reaction tubes. Second, 
this method suppojU the use of is iiorm;diy.,itk>u 
8« nci P-flctin) for quantitative. PGR or house- 
keeping genes for quantitative RT-l'CR controls. 
Analysis h performed in real time during the Jog 
phase of product accumulation. Analysis during 
kiK phase permits in any different genes (over « 
wide input target range) to he analyzed simulta- 
neously, without concern of reaching rend (on 
plateau at different cycle*, This will make jhuUI- 
■jgene. analysis assays much cwici tv develop, be- 
cause individual internal uuitpetUuis will nut. bc 
necded for coch gene under analysis- Third, 
sample throughput will h mcasc Uitmicdindly 
with the new method because there is no post- 
PCR processing lime. Additionally, wusking In a 
96-well format is highly compatible with auto* 
illation technology. 

The real-time PCR method "J* highly repro- 
ducible. Replicate amplifications can be analyzed 



for c-ach sample minimising potential error. The. 
sy Kit tin allows for a very large us say dynamic 
range (upproaehiiig 1,000,000-fold starting Un- 
gel). Using u standnrd curve for the target oJ in- 
terest/ rcJuTivu copy number values can be deter- 
mined for any unknown sum pic. Fluorescent 
threshold Values, C p cojirJatr. linearly with rela- 
tive 1>NA copy numbers. Real time quantitative 
KT-PCK methodology (C.iibson et al v this laujc?) 
has- also been de.vdopcd. finally, real time qu«iii^ 
tifative PCtt methodology can be used lu develop 
high-throughput screening assays for a variety of 
applications [quantitative gene CKpiea&iuii (KT- 
PCft), Rune copy assays (11cr2, 131V, etc.)* £cm> 
*yp' n ft (knockout mouse analysis), and Jmmum>- 
PCUJ. 

Real-time POt it i ay al.to Ik: ]>crformcd using 
intercalating dyes (Htguc.hi et ul. such as 

ctJvlditim bromide. The fluorogenic prohe. 
method offers a major advantage aver inter- 
calating dyes- greater specificity (i.e., primer 
dimers and nonspecific PCR products are not de- 
tuned). 
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METHODS 

Generation of <t Piasmld Containing a Partial 
cDNA for Human Factor YII1 

Total RNA Yf«> harvested (UNAko* tt from To! Test, Inc., 
r nc.n<lSWOOd, TX) from iixnsfccled wtth a factor VIII 
exjirvsshm vector. pC:iS2.tk-?.&0 (Kmtm ci tt\. WHO; Goi* 
mnn ct al. li>90). A factor VIII partial ellNA wpientv W«S 
^i-rutn>lca by irr K:1t IGoneAnip It/ (Tin RNA t»r,U Kit 
(pan N80K-OT79, l'li Ai»pi»«) hiosystews, I'osiei CUy, CA)J 

using ihe I J C:U primurs Wfor mi«I PHrcv (primer sequence* 

arc shown below). The ampHcon was rcamplified uslnfi 
modified i*for and Wrcv primers (appended with -tow/Ill 
and //j/jcltli restriction site sequences «t tliv V ei»l> and 
cloned Into jXiKM- 3Z (1 v k>jt«^« Corp., Mudwon, Wl). The 
resulting clone, pPSTM. was used lor transient transfectloti 
oJ" 293 cell*. 



Amplification of Target DNA ami Dvlecilon of 
Amplicon Factor VIII Plasmid DNA 

(ph'STM) was tunpllftatl with die p.imeii F8f<*r r» '-<:<;< 

crrcKU^vAUAUiixjAtxiicn'C'a' and wrev .v-aaacct- 

t^OOCHTKiATCUJTAOa-fl'.Thti ivmciUiu pioduvrd w *IW- 
up i'C:K produce The forward primer w«» designed to w 
ogul/.i! u unique M'i]uniu- Ami id lit (lie f>' untranslated 
region of tin: paitriil pClS2.tkZ3l> pldMiiiO mid ihercfore 

doV5 not K'^w^iiUv mid amplify ihv llumm> factor VIII 

gem*. I'rimorn wore oh a*: on with the avsiv1imr(»'of tlx* com- 
puter program Oligo 1,0 (Nntionul UUuctonecs, Inc„ I'ly. 
mouth, MN). The human p-actt" g*Mie was Amplified with 
ihc primus (i-tw-eiii forward primer S^TCACCCAOAf TGT 
GGCCATd'AGGA-.V and fj-adin reverse p»irncr 5'-( lAG- 
CGGAACCXicri^AruK :c;AArGG-3\ The reaction pro- 
duced a 2VS-hp vCM product. 

Amplification reactions (SO pJ) contained a PNA 
sample, )0X I'CK Uuffc.r tl (S ^\), 200 ikm dAIT, dClT, 
dGTT, And 400 jim rflHT, 4 inxi NigGI?, Units AmpJI 
7W<; PNA poiymciasc. 0.5 unit Ampftrnse uracil N-fily- 
uoityluftv (UNO), 50 pinole of each faciei VUl julnwi/ unci 15 
pninle of ouch p AC. tin pdiner. The U'ac lUuvt, aUo iH>ntutncd 
OtlC Of the following (lt*tcet l«n prolM»s (100 nM.rticli)* 

i'8jin.»i»<- '^'(KAW)Af:crjYrj , <:c:Ac:c:TC5frri , (rrrr<:TC3T- 

GCCTT(TAMRA)p 3'«ud p-nciiji probv S' (TAM)ATCf^:c:- 
X(TAMKA)C:C<Xr;ATGC0;A'ICp-3' where p indiealcs 
phosphorylAlion nnd X indicates a linker arm nucleotide. 
Reaction tuln.*? wrn- Mt<:n>Ait\p Optical Tubes (port AUiTi- 
ln-rNkOI 0933/rcrkln lUmur) thai wore f rot tod (id i»ert;l«i 
F.lnicr) to prevent Ugltl from /cflcclhl^t Tube caps wen* 
simitar 1» MieroAi^ip C;nj>3 but specially desiflrtcd to pre- 
vent U^h I seu tiering. All ol Ui<« Vi'M t^Mi«wm(d>|e* wero »^>»- 
pln:d by PK Applied Itioftyritm* (l^wter C!Wy, CA) CXCCpt 
tho factor VIU priiuera, wbicb wcic syn»besi/.cd al Ccnen 
lech, Inc. (South 5t«t Fraticlsco, CA). l*rol»cy wiw desigrn.nl 
using the Oligo 4.0 software, follvwln}; guidelines 

ucsieo in me Model 7700 .sequence l>euH*U>r liMttumenl 
manual, briefly, protn: 'J' m iJiimld be a I least 5 U C ))l«ber 
than che auueidlux letupeMtiirc used during Ihcrmul ey- 
rhugi prhncrs should nt>l fuitn sUlde duplexed with the 
probe. 

The theriuol cycling Conditions Included 2 Jiitn at 
SO'Cind 10 Fnhi al 9S°C. Ilicrmal cycling proceeded wJlh 
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reactions were |xsrfonued m» the Model 7 700 Sequence Uc- 
t 4 ^ior (Pt Appltetl Utusyvtviuv), wlilrh coiuahn; -a Ccne- 
Allip SyMvm VJOUO. Ueaellon etnitlitio.^ w. rr- pvn, 

Rruiimtcd urn .i I'uww Maciiih»b V100 (Apple O.iinpiitnr, 
Soma CJaru, c:A) linKcd djrvoly to the Model won .St'- 
cjuence IXdecior. Afi»V^ u of daU w * v alK ° pe^onned on 
the Mni UUish eompxder. ( ;nllnrtit)n ;tnd iuinlycU coftw:*re 
w»> Oevvlo|Wl at l*K Applied nio«yxtuitis. 

Transection of Cells with rector VIII Convtruci 

Pour 1175 flasks of 293 cells {ATCX: CJXl. 1573), n human 
fciol kidney «>spen*ion cell line, were ^niwn to 80% con* 
(tuertcy And tranifcrted pt : H*i*M. CJelis were Krown In the 
follnwiug medlA! 50% HAM'X Hi 2 without GMT, 30% Ioim 
glucose nulbccen's inodlflrd Kojjle inediuin (UMKM) with, 
om glyiiuc: widi flodium bicarbtinatc, 10% ietal 1*mvc 
seruth, 2 mim L^luUinjno And 1% pcnidliiti-strcptomy- 
ilo. The mcdlfl wo» dunged 30 ndn Mo«' »hc t^nsfee 
lion, pl-UTM DNA amouiua of 40, 4, OS, and 0.1 h; **en> 
iidtlcd to 1.5) ml of a solution a^nialnhiR 0.125 m CiiCl,; 
and 1 X Ml'j'ltf. The four mixturert were left al rtH»n tern- 
l.ie.rtiHiri' foi It) mlrt and then i#dded dmpwlftc* to ihe cell*. 
TJic n«^x ^v»%- h«-uljuled at 37°C'arw1 SM* CX\ for 24 hr. 
washed with PUS, a»d rofluspended In PUS. The tvs\\n 
^AMided cells were divided into «li(|t«>la uod UNA w^a c*- 
trueted Inimcdiutely usinK IheQiAanip KUhkI Kit (Qbpen. 
Ciiot^Yvorth, VA), DNA w u s elided Into 2(K) j*t c»l 30 
r lVls-H0) utpll 8.0." 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISPS , these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tet racy line repress ible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24.3. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to >40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3j3 (GSK-3/3) resulting in an increase in 
0-catenin levels. Stabilized 0-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and- binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (A PC) tumor suppressor gene 
also plays an important role in Wnt signaling by. regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3 t a member of 
the transforming growth factor (TGF)-/3 superfamiiy, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA • 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 

Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
^To whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 of poly(A)* RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
. coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 mM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. AH tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplifled, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW.403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation fluorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzoL 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2< Act ) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WISP- specific signal was 
normalized to that of the glyceraIdehyde-3-phOsphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/ Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. I A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on j3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

. An independent, but similar, system was used to examine 
WISP expression-after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1 . The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of «=40,000 (M r 40 K). Both have 
hydrophobic N-terminai signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 7A\ ' 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of "-27,000 (M T 27 K) (Fig. 2B). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 

CS7MG 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poiy(A) + RNA (2 /ig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- 1 -specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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FiC. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WJSP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-L 

Identification of WlSP-3. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WlSP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354 7 aa protein with a predicted molecular 
mass of 39,293. WlSP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
' human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-l (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogenic factor for 
fibroblasts that is implicated in wound healing and flbrotic 
disorders and is induced by TGF-0 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, . similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerizatiori and receptor binding (26). The CT domain is 
present in all CCN family members described, to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown).. 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP- 1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig; 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A y C, £, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and /^.expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-l was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(iod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus.. 
Genomic DNA from human colon cancer cell lines was 
. assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP- 1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-l genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (5) Southern blots containing genomic DNA (10 /ig) 
digested with EcoRJ (WISP-l) or Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-l probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patienL 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and' 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1 , WISP-2, and WISP-3, 
are members of the CCN. family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of. 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through 0-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such asTGF-ft platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v fr serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-01 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the rumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP^ was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it m2y function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt : l transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. , 
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methods. Peptides AENKor AEQKwere dissolved in water, made isotonic with 
NaCl and diluted into RPMI growth medium. T- cell -proliferation assays were 
done essentially as described- 021 . Briefly, after antigen pulsing (JOjigmr' 
TTCF) - with tetrapeptides (l-2mgrar'), PBMCs or EBV-B ceils were 
washed in PBS and fixed for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-ceU clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 u,Ci of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u.g TTCF with 0.25 p.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for.l h at 37 °C. 
Glycopeptide digestions. The peptides H1DNEEDI, HIDN(N-glucosamine) 
EEDI and HIDNESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced, carboxy-methylated human transferrin followed by 
concanavalin A chromatography". Olycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-terminal sequencing. The lyophilized transferrin - 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mU ml" 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgml" 1 a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. . 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. I). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells'. Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and. lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL' (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6,7 , Apo3L/TWEAK 8v , or OPGL/TRANCE/ 
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RANKL 10 "' 2 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
i.l±0.lnM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig; 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 pLgml" 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (AICD) of* mature T lymphocytes, a FasL- 
dependent process'. Consistent with previous results 13 , activation 
of interleuicin-2-stimuIated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 
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Figure 1 Primary structure and expression of human DcR3. a. Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the/V-linkedglycosyiation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL. peripheral blood 
lymphocyte: 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes 1J4 -"\ Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at —1 u-gmf 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and r Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bonom), incubated with 
0cR3-Fc (solid line, shaded area), TNFR1-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference [P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRKS. PE. phycoerythrirv 
labelled cells, b, 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1. DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFR 1 -Fc. DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-Fas L antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d. Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 
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Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasUSngmr 1 ) oligomerized 
with anti-Flag antibody [0.1 iigmr 1 ) in the presence of the proposed inhibitors 
DcR3-Fc. Fas-Fc or human IgGl arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b, Jurkat cells were incubated with sFasL-Flag plus anti-Flag antibody 
as in a, in presence of 1 u.g ml -1 DcR3-Fc (rilled circles), Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2, 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGl, Fas-Fc, or DcR3-Fc (lOjig m|- t ). 
After 16 h, apoptosis of CD4* cefls was determined (mean ± s.em of results from 
five donors), d, Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles). Fas-Fc (open 
circles) or human IgGl (triangles), and target-cell death was determined by 
release of 5, Cr (mean ± s.d. for two donors, each in triplicate).- 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM2 1 8xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic done that 
carries DcR3, and sequenced the ends of the clone s insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3> the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3-linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre* of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. ' . 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand-family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG U9 . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 
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Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c. d. f, g, h, j, k, r), seven squamous-cell carcinomas (a, e, 
m, n, o. p, q), one non-small-cell carcinoma (b), one small-cell carcinoma (i). and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means * s.e.m. of five experiments done in duplicate, c. in situ hybridization 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd). the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's f-test 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin- 1 has been 
described 21 . In' addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL- related apoptosis- 
inducing molecule Apo2L". Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG\ which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. Q 



Methods 

Isolation of DcR3 cONA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the codirigfregibns of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described 23 . >. 1 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u,g), together with pRK5 encoding CrmA 
(2 jig) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc orTNFRl.-Fc and then with phycoerythrin-conjugated 
streptavidiri (GibcoBRL); and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitu Lively on 293 cells. 

I mmu no precipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 3S Slcysteine and [ J5 S]methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (10u.M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5 jjtg), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 u,g) (Alexis) was incubated 
with each Fc-fusion protein (1 u,g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
Fas L antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25u.g) was 
incubated with buffer or with DcR3-Fc (40 u.g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6- ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 fxl aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 
702 



IgG, blocked with 2% BSA in PBS- DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay. Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3* lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u-g ml" 1 ) for 24 h, and cultured' 
in the presence of interleukin-2 ( 100 U mP 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin- V-binding of CD4* cells". 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with sl Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of *'Cr in effector-target co- 
cultures relative to release of 51 Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR 1 " 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to —500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' -( FAM-ACACG ATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5 '-fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 <iCT \ where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous E coli 
proteins is composed of ABC transporters 2 . Many eukaryo tic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of 5. typhimurium and E co/i' J-B is a 
well -characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
■properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
. presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity 8 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded p- 
sheet (p3 and p8-p 12) spans both arms of the L, with a domain of a 
a- plus p-rype structure (pi, p2, p4-p7, al and ct2) on one side 
(within arm I) and a domain of mostly a-helices (a3-a9) on the 




Figure 1. Crystal structure of HisP a, View of the dimer along an axis 
perpendicular to its two-fofd axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A, comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View atong the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The 0-strands at the dimer interface are labelled, c. View of one monomer from 
the bottom of arm I, as shown in a. towards arm II. showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression off 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase,' rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbBZ 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique wilt make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661 -666, 1 998. 
© 1998 Wilcy-Uss. Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et a!., 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
. to be amplified frequently in breast cancers include mvc (8q24), 
ccndl ( 1 1 q 1 3), and erbB2 ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the. myc, ccndl, and erbBl proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Bems et ai, J 992; 
Schuuring et ai, 1992; Stamon et ai, 1987). Muss et ai (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et ai (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard cu^e- is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et ai, 1 996; Heid et 
aL, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et ai (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et ai, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA '(i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5 '-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye {i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why Q is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (in) use of C, 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
rumors (myc, ccndl and erbBZ), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number: 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C t (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C, and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4q 1 1 -q 1 3. in which no genetic alterations have been found in 
breast-rumor DNA by means of CGH (Kallioniemi et al, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: 

copy number of target gene (app, myc, ccndl. erbBl) 

N = = . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicrqAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
ng of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 1 0-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (10 s copies of each gene) to 
10" !0 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, cJCTP, dGTP, and 400 
HM dUTP, 5 mM MgCl 2 , 1 .25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates Q and determines the starting copy number in the 
samples. 



GENE AMPLIFICATION BY REAL-TIME PCR 



663 



Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbB2 proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (21q21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as I0 3 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes fapp and alb) 

The app to alb copy-number ratio was determined in 1 8 norma! 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app, 2Iq21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ai, 1994). The ratio for 
the 1 8 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary : breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and crbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for mvc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 ± 0. 19) for erbB2. Since N values 
for myc, ccndl and erbB2 in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccnd 1 and crbB2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table 1. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erb&2 (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl 2 to 15.1 for er£B2, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccnd] gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (T118). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbB2 and ccnd] were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbB2 in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 5.0% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erb&2). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysts of myc, ccndl and erbB2 amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N ^ 5). However, there were cases (I myc, 6 ccndl and 4 erbB2) 
in which real-time PCR showed gene amplification whereas 
Southem-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND erbBl GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification level (N) 




<0.5 


0.5-1.9 2^t.9 


2:5 


myc 

ccndl 

erbB2 


0 
0 

5 (4.6%) 


97(89.8%) 11 (10.2%) 
83(76.9%) 17(15.7%) 
87 (80.6%) 8 (7.4%) 


0 

8(7.4%) 
8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minima! 
quantities of tumor DNA (small early-stage rumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et al. t 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamicrange of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C, value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C, ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai. 
1996; Slamon et ai. 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature, (i) Chromosome regions 4q 1 1 -q 1 3 and 21q21.2 
(which bear alb and app t respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (ii) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1992; Borg et ai, 1992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbB2 amplifica- 
tion, confirming the findings of Borg et ai (1 992) and Courjal.e/ ai 
(1997). (iv) The maxima of ccndl and erbB2 over-representation 
were 1 8-fold and 15-fold, also in keeping with earlier results (about 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: T118 (EI 2, C6, black squares), Tl 33 (Gl I, B4, red squares) 
and T145 (A8, C8, blue squares). Given the Q of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Bems et al. 1 992; Borg et al. 1 992; Courjal et 
al, 1997). (v) The erbBl copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et al, 1995; Deng et al, 1996; Valeron 



et al, 1996). Our results also correlate well with those recently 
published by Gelmini et al. (1997), who used the TaqMan system to 
measure erbBl amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Ehner Applied Biosystems) which only allows end- 
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TABLE If - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




Uccndl/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


T118 


4525. 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


mi 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Nccndllalb) is determined by dividing the average ccndl 
copy namber val ue by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and*er£>B2) observed by means of real-time 
quantitative PCR as compared with Southern-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et aL, 1992; 
Slamone/fl/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbBl (but not of the other 2 proto- 
oncogenes) in several tumors; erbh2 is located in a chromosome 
region (I7q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a* place in routine clinical 
gene dosage. 
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Editorial: 

Editorial Board Member: Current Biology 
. Associate Editor, Clinical Cancer Research. 
Associate Editor, Cancer Biology and Therapy; 

Refereed papers: 

1 . Gertler, A., Ashkenazi, A., and Madar, Z. Binding sites for human growth 
hormone and ovine and bovine prolactins in the mammary gland and liver of the 
lactating cow. Mol Cell Endocrinol 34, 51-57 (1984). 

2. Gertler, A., Shamay, A., Cohen, N., Ashkenazi, A., Friesen, H., Levanon, A., 
Gorecki, M., Aviv, H., Hadari, D., and Vogel, T. Inhibition of lactogenic 
activities of ovine prolactin and human growth hormone (hGH) by a novel form of 
a modified recombinant hGH. Endocrinology 118, 720-726 (1986). 

3. Ashkenazi, A., Madar, Z., and Gertler, A. Partial purification and characterization 
of bovine mammary gland prolactin receptor. Mol Cell Endocrinql 50, 79-87 
(1987). 

4. Ashkenazi A., Pines, M., and Gertler, A. Down-regulation of lactogenic 
hormone receptors in Nb2 lymphoma cells by cholera toxin. Biochemistry 
Intematl 14, 1065-1072 (1987). 

5. Ashkenazi, A., Cohen, R., arid Gertler, A. Characterization of lactogen receptors 
in lactogenic hormone-dependent and independent Nb2 lymphoma cell lines. 
FEES Lett 210, 51-55 (1987). 

6. : Ashkenazi, A., Vogel, T., Barash, L, Hadari, D., Levanon, A., Gorecki, M., and 

Gertler, A. Comparative study on in vitro and in vivo modulation of lactogenic 
and somatotropic receptors by native human growth hormone and its modified 
recombinant analog. Endocrinology 121, 414-419 (1987). 

7. Peralta, E., Winslow, J., Peterson, G., Smith, D., Ashkenazi A., Ramachiandran, 
J., Schimerlik, M., and Capon, D. Primary structure and biochemical properties 
of an M2 muscarinic receptor. Science 236, 600-605 (1987). 

8. Peralta, E. Ashkenazj A., Winslow, J., Smith, D., Ramachandran, J., and Capon, 
D.J. Distincrit primary structures, ligand-binding properties and tissue-specific 
expression of four human muscarinic acetylcholine receptors. EMBO J. 6, 3923- 
3929(1987). 

9. Ashkenazi, A., Winslow, J., Peralta, E., Peterson, G., Schimerlik, M., Capon, D., 
and Ramachandran, J. An M2 muscarinic receptor subtype coupled to both 
adenylyl cyclase and phosphoinositide turnover. Science 238, 672-675 (.1987), 



1 0. Pines, M., Ashkenazi, A., Cohen-Chapnik, N., Binder, L., and Gertler, A. 
Inhibition of the proliferation of Nb2 lymphoma cells by femtomolar 
concentrations of cholera toxin and partial reversal of the effect by 1 2-o- 
,tetradecanoyl-phorbol-13-acetate. 1 Cell Biochem. 37, 119-129 (1988). 

11. Peralta, E. Ashkenazi, A., Winslow, J, Ramachandran, L, and Capon, D. 
Differential regulation of PI hydrolysis and adenylyl cyclase by muscarinic 
receptor subtypes. Nature 334, 434-437 (1988). 

12. Ashkenazi., A. Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
Functionally distinct G proteins couple different receptors to PI hydrolysis in the 
same cell Cell 56, 487-493 (1989). 

13. Ashkeriazi, A., Ramachandran, J., and Capon, D. Acetylcholine analogue 
stimulates DNA synthesis in brain-derived cells via specific muscarinic 
acetylcholine receptor subtypes. Nature 340,146-150(1989). , 

14. Lammare, D., Ashkenazi, A., Fleury, S., Smith, D., Sekaly, R., and Capon, D. 
The MHC-binding and gpl20-binding domains of CD4 are distinct and separable. 
Science 245, 743-745 (1989). 

15. Ashkenazi, A., Presta, L., Marsters, S., Camerato, T., Rosenthal, K., Fendly, B;, 
and Capon, D. Mapping the CD4 binding site for human immunodefficiency 
virus type 1 by alanine-scanning mutagenesis. Proc. Natl Acad. Sci. USA. 87, 
7150-7154(1990). 

16. Chamow, S., Peers, D., Bym, R., Mulkemn, M., Harris, R., Wang, W., Bjorkman, 
P., Capon, D., and Ashkenazi, A. Enzymatic cleavage of a CD4 immunpadhesin 
generates crystallizable, biologically active Fd-like fragments. Biochemistry 29, 
9885-9891 (1990). 

17. Ashkenazi, A., Smith, P., Marsters, S., Riddle, L., Gregory, T., Ho, P., and 
Capon, P. Resistance of primary isolates of human immunodefficiency virus type 
1 to soluble CP4 is independent of CP4-rgp 120 binding affinity. Proc. Natl 
Acad Set USA. 88, 7056-7060 (1991). 

18. Ashkenazi, A., Marsters, S., Capon, P., Chamow, :S,, Figari., L, Pennica, P., 
GoeddeL, P., Palladino, M., and Smith, P. Protection against endotoxic shock by 
a tumor necrosis factor receptor immunoadhesin. Proc. Natl Acad, Set USA. 88, 
10535-10539(1991). 

19. Moore, J., McKeating, J., Huang, Y., Ashkenazi, A ., and Hb, P. Virions of 
primary HIV- 1 isolates resistant to sCP4 neutralization differ in sCP4 affinity and 
glycoprotein gpl20 retention from sCP4-sensitive isolates. J. Virol 66, 235-243 
(1992). 
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20. Jin, H., Oksenberg, D., Ashkenazi, A.. Peroutka, S., Duncan, A., Rozmahel., R., 
Yang, Y., Mengod, G., Palacios, J., and ODowd, B. Characterization of the 
human 5-hydroxytryptamineiB receptor. J. Biol Chem. 267, 5735-5738.(1992). 

21. Marsters, A., Frutkin, A., Simpson, N., Fendly, B. and Ashkenazi, A. 
Identification of cysteine-rich domains of the type 1 tumor necrosis receptor 

. involved in ligand binding. J. Biol Chem. 267, 5747-5750 (1992). 

22. Ghamow, S., Kogan, T., Peers, D., Hastings, R., Byrn, R., and Ashkenazi, A. 
Conjugation of sCD4 without loss of biological activity via a novel carbohydrate- 
directed cross-linking reagent. 1 Biol Cherru 267, 15916-15922 (1992). 

23. Oksenberg, D., Marsters, A., ODowd, B., Jin, H., Havlik, S., Peroutka, S., and 

Ashkenazu A. A single amino-acid difference confers major pharmacologic 
variation between human and rodent 5-HTib receptors. Nature 360, 161-163 

(1992) . 

24. Haak-Frendscho, M., Marsters, S., Ghamow, S., Peers, D., Simpson, N., and 
Ashkenazi. A. Inhibition of interferon y by an interferon y receptor 
immunoadhesin. Immunology 79, 594-599 (1993). 

25. Penica, D., Lam, V., Weber, R., Kohr, W., Basa, L., Spellman, M., Ashkenazi. 
Shire, S., and Goeddel, D. Biochemical characterization of the extracellular 
domain of the 75-kd tumor necrosis factor receptor. Biochemistry 32, 3131-3138. 

(1993) . ; 

26. Barfod, L., Zheng, Y., Kuang, W., Hart, M., Evans, T., Cerione, R., and 
Ashkenazi, A. Cloning and expression of a human CDC42 GTPase Activating 
Protein reveals a functional SH3-binding domain. J. Biol Chem. 268* 26059- 
26062 (1993). 

27. Ghamow, S., Zhang, D., Tan, X., Mhtre, S., Marsters, S., Peers, D., Byrn, R., 
Ashkenazi, A., and Yunghans, R; A humanized bispecific immunoadhesin- 
antibody that retargets CD3+ effectors to kill HIV-1 -infected cells. J. Immunol 
153,4268-4280(1994). 

28. Means, R., Krantz, S., Luna, J., Marsters, S., and Ashkenazi, A, Inhibition of 
murine erythrbid colony formation in vitro by iterferon y and correction by 
interferon y receptor immunoadhesin. Blood 83, 91 1-915 (1994). 

29. Haak-Frendscho, M., Marsters, S., Mordenti, J., Gillet, N., Chen, S., 
an dAshkenazi, A. Inhibition of TNF by a TNF receptor immunoadhesin: 
comparison with an anti-TNF mAb. Immunol 152, 1347-1353 (1994). 
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30. Chamow, S., Kogan, T., Venuti, M., Gadek, T., Peers, D M Mordenti, J., Shak, S., 
and Ashkenazi, A. Modification of CD4 immunoadhesin with monomethoxy- 
PEG aldehyde via reductive alkilation. Bioconj. Cherru 5, 133-140 (1994). 

31. Jin, H., Yang, R., Marsters, S., Bunting, S., Wurm, F., Chamow, S., and 
Ashkenazu A. Protection against rat endotoxic shock by p55 tumor necrosis factor 
(TNF) receptor immunoadhesin: comparison to anti-TNF monoclonal antibody. J. 
Infect Diseases 170, 1323-1326 (1994). 

32. Beck, J., Marsters, S., Harris, R., Ashkeriazu A., and Chamow, S. Generation of 
soluble interleukin- 1 receptor from an immunoadhesin by specific cleavage. Mol 
Immunol 31/1335-1344 (1994). 

33. Pitti, B., Marsters, M., Haak-Frendscho, M., Osaka* G., Mordenti, J., Chamow, S., 
and Ashkenazi, A. Molecular and biological properties of an interleukin- 1 
receptor immunoadhesin. Mol Immunol 31, 1345-1351 (1994). 

34. Oksenberg, D., Havlik, S., Peroutka, S., and Ashkenazi, A. The third intracellular 
loop of the 5-HT2 receptor specifies effector coupling. J. Neurochem. 64, 1440- 
1447(1995). 

35. Bach, E., Szabo, S., Dighe, A., Ashkenazi, A., Aguet, M., Murphy, K., and 
Schreiber, R. Ligand-induced autoregulation of IFN-y receptor p chain expression 
in T helper cell subsets. Science 270, 1215-1218 (1995). 

36. Jin, H., Yang, R., Marsters, S-, Ashkenazi, A., Bunting, S., Marra, M., Scott, R., 
and Baker, J. Protection against endotoxic shock by bactericidal/permeability- 
increasing protein in rats. J. Clin. Invest 95, 1947-1952(1995). 

37. Marsters, S., Penica, D., Bach, E., Schreiber, R., and Ashkenazi, A. Interferon y 
signals via a high-affinity multisubunit receptor complex that contains two types 
of polypeptide chain. Proc. Natl Acad. Set USA. 92, 5401-5405 (1995). 

38. Van Zee, K., Moldawer, L., Oldenburg, H., Thompson, W., Stackpole, S., 
Montegut, W., Rogy, M., Meschter, C, Gallati, H., Schiller, C., Richter, W., 
Loetcher, H., Ashkenazi, A , Chamow, S., Wurm, F., Calvano, S., Lowry, S., and 
Lesslauer, W. Protection against lethal E. coli bacteremia in baboons by 
pretreatmeht with a 55-kDa TNF receptor-Ig fusion protein, Ro45-208 1 . J: 
Immunol 156,2221-2230 (1996). 

39. Pitti, R., Marsters, S., Ruppert, S., Donahue, C, Moore, A., and Ashkenazi, A . 
* Induction of apoptosis by Apo-2 Ligand, *a new member of the tumor necrosis " 

factor cytokine family. J. Biol Chem. 271, 12687-12690 (1996). 
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40. Marsters, S., Pitti, R., Donahue, C, Rupert, S., Bauer, K., and Ashkenazi, A. 
Activation of apoptosis by Apo-2 ligand is independent of FADD but blocked by 
CrmA; Curr. Biol. 6, 1669-1676 (1996). 

41 . Marsters, S., Skubatch, M., Gray, C, and Ashkenazi, A . Herpesvirus entry 
mediator, a novel member of the tumor necrosis factor receptor family, activates 
the NF-kB and AP-1 transcription factors. /. Biol. Chem. 272, 14029-14032 
(1997). 

42. Sheridan, J., Marsters, S.,. Pitti, R., Gumey, A., Skubatch, M., Baldwin, D., 
Ramakrishnan, L., Gray, C., Baker, K., Wood, W.L, Goddard, A., Godowski, P., and 
Ashkenazi. A. Control of TRAIL-induced apoptosis by a family of signaling and 
decoy receptors. Science 277, 818-821 (1997). 

43. Marsters, S., Sheridan, J., Pitti, R., Gumey, A., Skubatch, M., Balswin, D., Huang, A., 
Yuan, J., Goddard, A., Godowski, P., and Ashkenazi. A. A novel receptor for 
Apo2L/TRAIL contains a truncated death domain. Curr. Biol. 7, 1003-1006 (1997).. 

44. Marsters, A., Sheridan, J., Pitti, R., Brush, J., Goddard, A., and Ashkenazi. A. 
Identification of a ligand for the deamrdomain-containing receptor Apo3. Curr. Biol. 
8, 525-528 (1998). 

"45. Rieger, J., Naumann, U., Glaser, T., Ashkenazi, A ., and Weller, M. Apo2 ligand: 
a novel weapon against malignant glioma? FEBS Lett 427, 124-128 (1 998). 

46. Pender, S., Fell, J., Chamow, S., Ashkenazi. A ., and MacDonald, T. A p55 TNF 
receptor immunoadhesin prevents T cell mediated intestinal injury by inhibiting 
matrix metalloproteinase production. J. Immunol. 160, 4098-4103 (1998). 

47. Pitti, R., Marsters, S., Lawrence, D., Roy, Kischkel, F., M., Dowd, P., Huang, A., 
Donahue, C, Sherwood, S., Baldwin, D., Godowski, P., Wood, W., Gurney, A., 
Hillan, K., Cohen, R., Goddard, A., Botstein, D., and Ashkenazi. A. Genomic 
amplification of a decoy receptor for Fas ligand in . lung and colon cancer. Nature 
396, 699-703 (1998). 

48. Mnri } Marakami-Mori. K.. Nakamura. S.. Ashkenazi. A ., and Bonavida, B. 
Sensitization of AIDS Kaposi's sarcoma cells to Apo-2 ligahd-induced apoptosis 
by actinomycin D. J. Immunol. 162, 5616-5623 (1999). 

49. Gurney, A. Marsters, S., Huang, A., Pitti, R., Mark, M., Baldwin, D., Gray, A., 
Dowd, P., Brush, J., Heldens, S., Schow, P., Goddard, A., Wood, W., Baker, K., 
Godowski. P.. and Ashkenazi. A.- Identification of a new member of the tumor 
necrosis factor family and its receptor, a human ortholog of mouse GITR. Curr: 
Biol. 9, 215-218 (1999). 
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50. Ashkenazi. A ., Pai, R., Fong, s., Leung, S., Lawrence, D., Marsters, S., Blackie, 
C, Chang, L., McMurtrey, A., Hebert, A., DeForge, L., Khoumenis, I., Lewis, D., 
Harris, L., Bussiere, J., Kpeppen, H., Shahrokh, Z., and Schwall, R. Safety and 
anti-tumor activity of recombinant soluble Apo2 ligand. J. Clin. Invest. 104, 155- 
162 (1999). 

5L Chuntharapai, A., Gibbs,V., Lu, J,, Ow, A., Marsters, S., Ashkenazi, A., De Vos, 
A., Kim, K J. Determination of residues involved in ligand binding and signal 
transmission in the human BFN^x receptor 2. J. Immunol. 163, 766-773 (1999). 

52. Johnsen, A.-C., Haux, J., Steinkjer, B., Nonstad, U, Egeberg, K., Sundan, A., 
Ashkenazi. A., and Espevik, T. Regulation of Apo2L/TRAIL expression in NK 

. cells - involvement in NK cell-mediated cytotoxicity. Cytokine 11, 664-672 
(1999). 

53. . Roth, W., Isenmann, S>, Naumann, U., Kugler, S., Bahr, M., Dichgans, J,, 

. Ashkenazi. A., and Weller, M. Eradication of intracranial human malignant 
glioma xenografts by Apo2L/TRAIL. Biochem. Biophys. Res. Commun. 265, 479- 
483 (1999). 

54. Hymowitz, S.G., Christinger, H.W., Fuh, G., Ultsch, M., O'Connell, M., Kelley, 
R.F., Ashkenazi. A. and de Vos, A.M. Triggering Cell Death: The Crystal 
Structure of Apo2L/TRAIL in a Complex with Death Receptor 5. Molec. Cell 4, 
563-571 (1999). 

55. Hymowitz, S.G., O'Connel, M.P.fUtsch, M.H!, Hurst, A., Totpal, K., Ashkenazi, 
A„ de Vos, A.M., Kelley, R.F. A unique zinc-binding site revealed by a high- 
resolution X-ray structure of homotrimeric Apo2J7TRAlL. Biochemistry 39, 633- 
640 (2000). 

56. Zhou, Q., Fukushima, P., DeGraff, W., MitcheU, J.B., Stetler-Stevenson, M., 
Ashkenazi.- A., and Steeg, P.S. Radiation and the Apo2L/TRAIL apoptotic 
pathway preferentially inhibit the colonization of premalignant human breast 
cancer ^ cells overexpressing cyclin D l. Cancer Res. 60, 2611-2615 (2000). 

57. Kischkel, F.C., Lawrence, D. A., Chuntharapai, A., Schow, P., Kim, J., and 
Ashkenazi. A. Apo2I/TPvAIL-dependent recruitment of endogenous F ADD and 
Caspase-8 to death receptors 4 and 5. Immunity 12, 61 1-620 (2000). 

58. Yan, M., Marsters, S.A., Grewal, I.S., Wang, H., * Ashkenazi, A-, and *Dixit, 
V.M. Identification of a receptor for BlyS demonstrates a crucial role in humoral 
immunity. Nature Immunol. 1,37-41 (2000). 
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59. Marsters, S.A., Yan, M., Pitti, R.M., Haas, P.E., Dixit, V.M., and AshkenazL A. 
Interaction of the TNF homologues BLyS and APRIL with the TNF receptor 
homologies BCMA and TACL Curr.Biol 10, 785-788 (2000). 

60. Kischkel, F.C., and Ashkenazi, A . Combining enhanced metabolic labeling with 
immunoblotting to detect interactions of endogenous cellular proteins. 
Biotechniques 29, 506-512 (2000). 

61. Lawrence, D., Shahrokh, Z., Marsters, S., Achilles, KL, Shih, D. Mounho, B., 
Hillan, K., Totpal, K. DeForge, L., Schow, P., Hooley, J., Sherwood, S., Pai, R., 
Leung, S., Khan, L., Gliniak, B., Bussiere, J., Smith, C, Strong S., KeHey^ S., 
Fox, J., Thomas, D., and Ashkenazi, A. Differential hepatocyte toxicity of 
recombinant Apo2I/TRAIL versions. Nature Med. 7, 383-385 (2001). 

62. , Chuntharapai, A., Dodge, K., Grimmer, K., Schroeder, K., Martsters, S.A., 

' Koeppen, H., Ashkenazi A .» and Kim, K.J. Isotype-dependent inhibition of 
tumor growth in vivo by monoclonal antibodies to death receptor 4. Immunol* 
166,4891-4898(2001). 

63. Pollack, IF.. Erff,M.« and Ashkenazi, A . Direct stimulation of apoptotic 
signaling by soluble Apo2IVtumor necrosis factor-related apoptosis-inducing 
ligand leads to selective killing of glioma cells. C/m. Cancer Res. 7, 1362-1369 
(2001). 

64. Wang, H., Marsters, S.A., Baker, T., Chan, B., Lee, W.P., Fu, L., Tumas, D., Yan,. 
M., Dixit, V.M., * Ashkenazi, A ., and *Grewal, I.S. TACI-ligand interactions are 
required for T cell activation and collagen-induced arthritis in mice. Nature 
Immunol!, 632-637 (2001): 

65. Kischkel, F.C., Lawrence, D. A., Tinel, A., Virmani, A., Schow, P., Gazdar, A.^ 
Blenis, J., Amott, D., and Ashkenazi A . Death receptor recruitment of 
endogenous caspase-10 and apoptosis initiation in the absence of caspase-8. 7. 
Biol Chem. 276, 46639-46646 (2001). 

66. LeBlanc, H., Lawrence, D.A., Varfolomeev, E., Totpal, K., Morlan, J., Schow, P., 
Fong, Si, Schwall, R., Sinicropi, D., and Ashkenazi, A T umor cell resistance to 
death receptor induced apoptosis through mutational inactivation of the 
proapoptotitc Bcl-2 homolog Bax. Nature Med. 8, 274-28 1 (2002), 

67. Miller, K., Meng, G., Liu, L, Hurst, A., Hsei, V., Wong, W-L., Ekert, R., 
Lawrence, D., Sherwood, S., DeForge, L., Gaudreault., Keller, G., Sliwkowski, 
M., Ashkenazi, A ., and Presta, L. Design, Construction, and analyses of 
multivalent antibodies. J. Immunol. 170, 4854-4861 (2003). 
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68. Varfolomeev, E., Kischkel, R, Martin, R, Waah, H., Lawrence, D., Olsson, C, 
Tom, L., Erickson, S., French, D., Schow, P., Grewal, I. and Ashkenazi, A. 
Immune system development in APRIL knockout mice. Submitted. 

Review articles: 

1 . Ashkeiiazi, A., Peralta, E;, Wirislow, J., Ramachandran, J., and Cappri, D., J. 
Functional role of muscarinic acetylcholine receptor subtype diversity. Cold 
Spring Harbor Symposium on Quantitative Biology. LIII, .263-272 (1988). 

2. Ashkenazi, A ., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. . 
Functional diversity of muscarinic receptor subtypes in cellular signal 
transduction and growth. Trends Pharmacol Sou Dec Supplement, 12-21 (1989). 

3. Chamow, S., Duliege, A., Aramann, A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
R., Gapon, D., Ward, R., and Ashkenazi, A . CD4 immunoadhesins in anti-HIV 

. therapy: new developments. Int J. Cancer Supplement 7, 69-72 (1-992). 

4. ' Ashkenazi, A ., Capon, and D. Ward, R. Immunoadhesins. Int Rev. Immunol. 10, 

217-225 (1993). 

5. Ashkenazi/A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
and Channels. (S. Peroutka, ed.), CRC Press, Boca Raton, Vol. I, p. 1-27, (1994). 

6. Krantz, S. B., Means, R. T., Jr., Lina, J., Marsters, S. A., and Ashkenazi, A . 
Inhibition of erythroid colony formation in vitro by gamma interferon. In 
Molecular Biology of Hematopoiesis (N. Abraham, R. Shadduck, A. Levine R 
Takaku, eds.) Intercept Ltd. Paris, Vol. 3, p. 135-147 (1994). . 

7. Ashkenazi, A . Cytokine neutralization as a potential therapeutic approach for 
SIRS and shock. Biotechnology in Healthcare 1, 197-206 (1994). . 

8. Ashkenazi, A ., and Chamow, S. M. Immunoadhesins; an alternative to human 
monoclonal antibodies. Immunomethods: A companion to Methods in 
EnzimologyS, 104-115 (1995). 

9. Chamow, S., and Ashkenazj A . Immunoadhesins: Principles and Applications. 
Trends Biotech. 14, 52-60 (1996). 

10: Ashkenazi, A ., and Chamow, S. M. Immunoadhesins as research tools and 
therapeutic agents. Curr. Opin. Immunol. 9, 195-200 (1997). 

11. Ashkenazi, A ., and Dixit, V. Death receptors: signaling and modulation. Science 
281, 1305-1308 (1998). 

12. Ashkenazi A ., and Dixit, V. Apoptosis control by death and decoy receptors. 
Curr. Opin. Cell Biol 11,255-260(1999). 
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13. Ashkenazi, A . Chapters on Apo2L/TRAIL; DR4, DR5, DcRl, DcR2; and DcR3. 
Cn\]mr '"•ytHr™- H*nHW.k (www apnet.com/cvto kinereference/). 

14. Ashkenazi. A . Targeting death and decoy receptors of the tumor necrosis factor 
. superfamily. Nature Rev. Cancer 2, 420-430 (2002). 

15. LeBlanc, H. and Ashkenazi, A . Apoptosis signaling by App2L/TRAIL. Cell Death 
and Differentiation 10, ; 66-7 5 (2003). 

16. Almasan, A. and Ashkenazi, A . Apo2L/TRAIL: apoptosis signaling, biology, and 
potential for cancer therapy. Cytokine arid Growth Factor Reviews 14, 337-348 
(2003). 

Book: 

Antibody Fusion Proteins (Chamow, S., and Ashkenazi, A ., eds., John Wiley and 
• Sons Inc.) (1999). . . ' 

Talks: 

1 . Resistance of primary HIV isolates to CD4 is independent of CD4-gpl20 binding 
affinity. UCSD Symposium, HTV Disease: Pathogenesis and Therapy. 
Greenelefe, FL, March 1991. 

2. Use of immuno-hybrids to extend the half-life of receptors. IBC conference on 
Biopharmaceutical Halfiife Extension. New Orleans, LA, June 1992. 

3^ Results with TNF receptor hnmunoadhesins for the Treatment of Sepsis. IBC 
conference on Endotoxemia and Sepsis. Philadelphia, PA, June 1992. 

4. hnmunoadhesins: an alternative to human antibodies. IBC conference on 
Antibody Engineering. San Diego, CA, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic sliock. . 
American Society for Microbiology Meeting, Atlanta, GA, May 1993. 

6. Protective efficiacy of TNF receptor imriiunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th Intemational Congress on TNF. 
Asilomar,CA, May 1994. 

7. Interferon-y signals via a muMsubunit receptor complex that contains two types of 
polypeptide chain. American Association of hnmunologists Conference. San 
Franciso, CA, July 1995. 

8. Immunoadhesins: Principles and Applications. Gordon Research Conference on 
Drug Delivery in Biology and Medicine. Ventura, CA, February 1 996. 
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9. Apo-2 Ligand, a new member of the TNF family that induces apoptosis in tumor 
cells. Cambridge Symposium on TNF and Related Cytokines in Treatment of 
Cancer. Hilton-Headi NC, March 1996. 

10. Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 
Orleans, LA, June, 1996. 

1L Apo2 ligand, an extracellular trigger of apoptosis. 2nd Clontech Symposium, 

Palo Alto, CA, October 1996. 
12. Regulation of apoptosis by members of the TNF ligand and receptor families, 

Stanford University School of Medicine, Palo Alto, CA, December 1996. 
1 3 Apo-3 : ano vel receptor that regulates cell death and inflammation. 4th 

International Congress on Immune Consequences of Trauma, Shock, and Sepsis. 

Munich, Germany, March 1997. 

14. New members of the TNF ligand and receptor families that regulate apoptosis, 
inflammation, and immunity. UCLA School of Medicine, LA, CA, March 1997. 

15. Immunoadhesins: an alternative to monoclonal antibodies. 5th World Conference 
on Bispecific Antibodies. Volendam, Holland, June 1997. 

16. Control of Apo2L signaling. Cold Spring Harbor Laboratory Symposium on 
Programmed CellDeath. Cold Spring Harbor, New York. September,: 1997. 

17. Chairman and speaker, Apoptosis Signaling session. EC's 4th Annual 
Conference on Apoptosis. San Diego, CA., October 1997. 

18. Control of Apo2L signaling by death and decoy receptors. American Association 
for the Advancement of Science. Philadelphia, PA, February 1998. 
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DECLARATION OF PAUL POLAKIS, Ph.D. 



I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2 I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 

— significantly higher levels-^ we- 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and farther that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine of imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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I, Paul Polakis, Ph.D., declare and say as follows; 

I am currently employed by Genentech, Inc, where my job title is Staff 
Scientist. 

Since joining Genemech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers that find use 
as targets for both the diagnosis and treatment of cancer in human*. 

As I stated in my previous Declaration dated May 7, 2004 (attached as 
Exhibit A), my laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(ii) and that are either (i) not 
expressed, or (ii) expressed at detectably lower leveb, on normal tissue(s); 

In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

As shown in Exhibit B, of the 31 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in rhe vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 



6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of rhe relevant scientific literature, ir is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
rhe normal tissue. In face, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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EXHIBIT A 



DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a PhD. by the Department of Biochemistry of the Michigan 
State University in 1 984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of ray primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in humain tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used teclinique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 

expressed trom these differentially express^^ne^arr^pts^ndiiavenisedHhese 

antibod ies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor ceils are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including thedata discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor ceil relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. Whiie there have been 
published reports of genes for which such a correlation does not exist, it is my. 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declarethat all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the val idity of the application or any patent issued 
thereon. - 
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Review 

Paul a. Haynes Proteome analysis: Biological assay or data archive? 

Steven P. Gygi 

Daniel Figeys j n tn j s rev j ew we examine the current state of proteome analysis. There are 

Ruedi Aebersold three main issues discussed: why it is necessary to study proteomes; how pro- 

teomes can be analyzed with current technology; and how proteome analysis 
Department or Molecular C£m be used tQ enhance biological research. We conclude that proteome anal- 

Biotechnology, u n ,ve «jty of ysjs is an essential tool in the understanding of regulated biological systems. 

Washington, Seattle, WA, UbA Current technology, while still mostly limited to the more abundant proteins, 

enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 
technology- 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [1]. In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([15-17]). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy [19, 20] and serial analysis of gene expression 
(SAGE) [21, 22], it is now feasible to establish global and 
quantitative rriRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 



© WILEY-VCH Verlag GmbH, 69451 Weinheim. 1998 



0173-0835/98/1 1 1 1-1862 $17.50+. 50/0 



171 



/ Urt"'pf"" i ' s ' x . 



1998. /9. 1862-1871 



Proteomt analysts: Biological assay or data archive 



1863 



re 
ro- 
ns 
al- 
ts, 
is, 
ns 
»Ie 
;al 
;al 



E), 
:id 
el- 
tly 

It!- 

ne 
rn- 
air 

)0- 

rn- 
ire 
za- 
lif- 
ro- 
\ro- 



no- 
ne, 
an- 
ell 

.3], 
ise 
:ri- 
he 
ne 
nd 



ne 
;es 
of 
He 
he 
re- 
iry 
;al 
lis- 
Vo- 
on 
ad 



protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information, could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 



2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. R Gygi et aL, submitted for 
publication). mRNA expression levels were calculated, 
from published SAGE frequency tables [22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots separated by high resolution 2-DE 
and mass spectrometry identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
et al. 7 submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitination or one or 
more of many other modifications [23] and many pro- 
teins are only functional if specifically associated or conv 
plexed with other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and protein levels in yeast cells. 
For a selected population of 80 genes, protein levels were measured 
by, 35 -S-radiolabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also see Figs. 5 and 6, (S. P. Gygi et at., 
submitted). 



are important indicators for the state of the system. The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 



2.3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions; temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translational and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state, are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of bio molec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis* degradation, 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot' be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many, proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics , of post-transtationai protein edit- 
ing thus significantly complicates proteome studies 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other. deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained 
Lastly, the number of proteins in a given celt system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful 
general proteomics technology requires high sensitivity 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample 
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samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25] and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using ^/-terminal sequencing (26 27] 
internal peptide sequencing [28, 29], immunoblotting or 
emigration with known proteins [30]. The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31-33]. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame 



3.2 2-D electrophoresis - mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
ol proteins by two-dimensional (IEF/SDS-PAGE) eel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 

p A e oo y ,S °u e!eCtriC f0CUsing (IEp ) and ^en by SDS- 
PAOb, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig 2 Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
tor analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins 
Ihey can be grouped into two categories, digestion in 
the gel slice (28, 34] or digestion after electrotransfer out 
ot the gel onto a suitable membrane ([29 35-37] and 
reviewed in [38]). In most instances either technique is 

,w c le and yieIds g00d results - ^ analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins 
Proteins are identified, by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tamed in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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Figure 2. Schematic diagram of a procedure for identification of gel- 
scparated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy [39, 40] or capillary electrophoresis [41], the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33]. The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
. robust. 



3.3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

jt has been demonstrated repeatedly that MS has a very 
nigh intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
oT Iow nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42]. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large , column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 urn ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
et.at., submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ton trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level [43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has hot been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure 3. Schematic illustration of a 
microfabricated analytical system for CE, 
consisting of a micromachined device, 
coaled capillary electroosmolic pump, 
and microelectrospray interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from [45|, with permission. 



microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 urn in depth and 50-70 urn in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels are 
connected to an external high voltage power supply [45]. 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. . 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmoi/uL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases, 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 
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Figure 4. MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from [451, with permission. 
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hxttre 5. 2-DE separation of a lysate of yeast cells, with identified proteins highlighted. The first dimension of separation was an IPG from 
pll 3-10. and the second dimension was a 10%T SDS-PAGE gel. Proteins were visualized by silver staining. Further details of experimental 
procedures are included in S. P. Gygi et al. (submitted). 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells [16]. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48]. 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher Sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 



calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified (> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in ceils. In contrast; 
eodon bias values calculated for the entire yeast genome 
(F| g- 6a) show that the majority of proteins present in 
[lie proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 
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Figure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. 1 and 5. Further details of experimental procedures are 
included in S. P. Gygi et at. (submitted). 

4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive. The most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. V 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent [49]. 
It is obvious that the. protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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live comparative analysis of hundreds to a few thousand 
proteins. Comparative analysis oF the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ- 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 





Figure 7. High resolution 2-DE map of proteins isolated from rat serum with or without prior exposure to an inflam- 
mation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion is a 7.5-1 7.5%T gradient SDS-PAGE gel. Proteins were visualized by staining with' amido black. Further details 
of experimental procedures are included in [14, 49). 
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Proteome analysts as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
p/) resolved by the gel, because they are not soluble, 
under the conditions used, or for other reasons. . 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that 1 execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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ABSTRACT 

Generic changes underlie tumor progression and may lead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations in cancer* but very few of the genes 
affected are known. Here, we performed higb-resolation CGH analysis on 
cDNA microarrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent ampjicons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing overcxpression and 10*5% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel amplicon at 17q21*3 was validated In 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overcxpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be idea! targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid rumors. Besides amplifications of known oncogenes, over 
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Fig. 1. Impact of gene copy number on global gene expression levels. A. percentage of 
over- and underexprcsscd genes (Y axis) according to copy number ratios (X axis). 
Threshold values used for over- and undcrexpression were >2.184 (global upper 7% of 
the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B, percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were >1.5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to; (a) 'determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and {b) identify and characterize those genes whose mRNA expres- 



5 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR. reverse transcription-PC R. 
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Fig. 2. Genome -wide copy number and expression analysis in the MCF-7 breast cancer cell line. A. chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue 
line) across the entire genome from lp telomere to Xq telomere is shown along with it I SD (orange lines). The black horizontal line indicates a ratio of 1.0; red line, a ratio of 0.8; 
and green line, a ratio of 1.2. B~C genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In B, individual data points are connected with a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the 
copy number ratio of 1 .0. In C, individual data points arc labeled by color coding. according to cDN A expression ratios. The bright red dots indicate the upper 2%, and dark red dots, 
the next 5% of the expression ratios in MCF-7 cells (overexpresscd genes); bright green dots indicate the lowtet 2%, and dark green dots, the riexl 5% of the expression ratio* 
(undercxpicssed genes); the rest of the observations are shown with black crosses. The chromosome numbers arc shown at the bottom of the figure, and chromosome boundaries arc 
indicated with a dashed tine. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND NJETHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, HS5781, MCF7, MDA-361, MDA-436, MDA-453. MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols, 

Copy Number and Expression Analyses by cDNA Micro arrays, The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (1 1-13). Of these clones, 244 represented uncharac- 
tcrized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
1 5). Briefly, 20 p.g of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Alu\ and Rsa\ (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
\*% of digested cell line DNAs were labeled with Cy3-dUTP (Amershani 
Pharmacia) and normal DNA with Cy5-dUTP (Amcrsham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybridization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty jig of reference RNA were 
labeled with Cy3-dUTP and 3.5 ixg of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on micToarrays as described (1 3, 1 5). For both 
microarray analyses, a laser con focal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quab'ty measurements {i.e., copy number data with mean reference 
intensity <I0O fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and'or with spot size <50 units) 



were excluded from the analysis and were treated as missing values, The 
distributions of fluorescence ratios were used to define curpoints for increased/ 
decreased copy number. Genes with CGH ratio > 1 .43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and.cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, > 1 .43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to- noise 
statistics (I). We calculated a weight, w g% for each gene as follows: 

M ' 8 " " 8 i + o-go 

where m gU cr ffl and m^, 0*^ denote the means and SDs for the expression 
levels for amplified and nonamplified cell lines, respectively. To assess the 
statistical significance of each weight, we r^rfornted 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Amplicon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigcnc cluster using the 
Unigcne Build 141. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldcnPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio > 2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



* Internet address: hupy/rcscarch.nhgri.n ih.gov/microarray/doM-nloadable_cdna.htmJ. 
7 Internet address: www.gcnomc.ucsc.edu. 
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Tabic I Summary of independent amplicons in 14 breast cancer cell lines by 
CGH microarray 

Location 

1 P 13 
lq21 
Iq22 
3pl4 

7pl2.i-7plU 
7q31 
7q32 

8q2UI-6q2l.I3 
Sq2t.3 

8q23.3-8q24.14 
8q24.22 
9pl3 

I3q22-q3l' 
16q22 
I7qll 

I7ql2-q2l.2 
I7q2l32-q21.33 
17q22-q23.3 
17q23.3-q24J 
19ql3 
20qll-22 
20ql3.12 
20ql3.l2-ql3.l3 
20ql3.2-qU.32 



Ci.fi SUM 


Fnrf 'MM 
cikj \mu) 


Size 'MM 


132.79 


132.94 


0.2 


173.92 


17705 


3.3 


179.28 


179.57 


0.3 


71.94 


74.66 


2.7 


55.62 


60.95 


5.3 


125-73 


130.96 


5.2 


140.01 


140.68 


0.7 


86.45 


92.46 


6.0 


98.45 


103.05 


4.6 


129 88 


142.15 


123 


151.21 


152.16 


1.0 


38.65 


39.25 


0,6 


77.15 


8IJ8 


42 


86.70 


87.62 


0.9 


29.30 


30.85 


1.6 


39.79 


42.80 


3.0 


52.47 


55.80 


33 


63.81 


69.70 


5.9 


69.93 


74.99 


5.1 


40.63 


41.40 


0.8 


34.59 


35.85 


1.3 


44.00 


45.62 


1.6 


46.45 


49.43 


3.0 


51.32 


59.12 


7.8 



CGH were validated, with iq21, 17qlZ— q2l.2, 17q22-q23, 20ql3.1, 
and 20ql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression «!ata on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lp!3, I7q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates £GFR as the most highly overexpressed and amplified 
gene at 7pl l-pl2 (Fig. 3^4). In BT-4 74, the two known amplicons 
at I7ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 35). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB 7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3B, inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, andZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplificd clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RPU-361K8 was la- 
beled with SpectnimOrangc (Vysis, Downers Grove, IL), and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the N1H. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, Wl) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 '-G AGC AG AGGG ACTCGGACTT-3 ' 
and 5'-GCGTCAGGTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. \B\ Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. A, genes in the 
7pl 1 -pi 2 amplicon in the MDA-468 cell line are highly expressed (red dots) and include 
the EGFR oncogene. B. several genes in the 17ql2, 17q21.3, and 17q23 amplicons in the 
BT-474 breast cancer cell line are highly overexpressed {red) and include the HOXB7 
gene. The data labels and color coding arc as indicated' for Fig. 2C. Insels show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-468 (A) and rYGXfl7-speciftc probe {red} and chro- 
mosome 1 7 centromere (green) lo BT-474 cells (B). 
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Fig. 4. List of 50 genes with a statistically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene arc indicated. The genes hove been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes « shown in supplemental Fig. B. 




amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data® 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) arc implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in > 1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



* Internet address: htro://www.geneontology.org/. 



° Internet address: http^/www.ncbi.njm.nih.gov/entrct 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
tow-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model, system (22-24]i. 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the homeobox gene region at I7q21.3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
turaorigenictty and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across ail 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC t 
EGFR, ribosqmal protein s6 kinase, and AIB3, but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270, genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in canceT progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
aeries, the ovcrexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
1 7q2 1.3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Contributed by Patrick O. Brown, August 6, 2002 

Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization {array C6H) analysis off DMA copy number variation in 
a series of primary human breast tumors. We have profiled DMA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number Is associated with a corre- 
sponding 1.5-f old change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). Whfle 
some of these regions contain known or candidate oncogenes 
[e.g. FGFR1 (8pll), MYC (8q24), CCND1 (llq!3), ERBB2 
(17ql2), and ZNF217 (20ql3)] and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)]> the relevant gene(s) within 
other regions (eig., gain of lq, 8q22, and 17q22-24, and loss of 
8p) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel m 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 

www.pnas.org/cgi/doi/IO.1073/pnas.l62471999 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell Lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridisations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et al (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. 4 ^^ M JP N * 
(from tumors and cell lines) was f luorescenth/ labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbi.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significandy from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating' 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 



Abbreviation: CGH. comparative genomic hybridization. 
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identifying the starting position of the best and longest match of 
anv DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genorne ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
eenes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the mi- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according tor the Golden 
Path" (http://gcnome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g.. 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. lb), aswe did before 
(7) demonstrated the sensitivity of our method to detectsuigle- 
Spy loss (45, XO), and 1.5- (47.XXX) . * W«XX) , or 
2.5-fold (49.XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site), fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig la), detected in the tumors despite, the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and toss were readtty iden- 
tifiable. For example, gains within lq. 8q, 17q, and 20q were 
observed in a high proportion of breast cancer ceD lines/rumors 
(90%/69%, 100%/47%, 100%/60%, ^ 90%/44% respechvp- 
ry), as were losses within lp, 3p, 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectively), consistent 
W iih published cytogenetic studies (refs. 2-4; a complete ^hsting 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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number of genomic alterations (gams and losses) was found to 
be significantly higher in breast tumors that were high grade (P = 
0-008), consistent with published CGH data (3), estrogen recep- 
tor negative (P « 0.04), and harboring TP53 mutations (P = 
0.0006) (see Table 4; which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 6\ which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the FN AS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that arc also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 



Pollack ef a/. 



WAS | October 1, 2002 | vol.99 \ no. 20 | 12965 



' BT«74 
1470 

NORWAY 27 
NORWAY 7 
NORWAY JOB 

NORWAY» 
NORWAY 109 
NORWAY lit . 

norway 17- 

CTANFOgO» 
STANFORD 36 

MORwmriia 

STANFORD? 

— 

S 

29 

^n IVU 101 

-NORWAY.4S 

NORWAY 41 
NORWAY 66 
NORWAY 18 
STANFORD 33 
NORWAY "IS 

STANFORD 1ft 

STANFORD 21 
-STANFORD ">7 
NORWAY 104 
NORWAY 10 
NORWAY tl 
NORWAY K» 




ratio: 



.■'5 : .f :» 



I 




iw 



i7y 



1M» 



17ptef 



17qler 



are Illustrated for breast cancer cell lines and tumor*. Breast cancer cell l.nes and tu »"^^ 
■dentkalsampleorder^ 

and mRNA levels were determined, are ordered by position along the chromosome selected genes are im»w« 
Fluorescence ratios (test/reference) are depicted by separate tog 2 pseudocolor scales (indicated). 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; i.e., a significant 
fraction of highly amplified genes appear to be correspondmgty 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). , 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a), For both the 
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breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student s 
i tests comparing adjacent classes: cell lines, 4 X 10" ,1 x 10 . 
5 x 10-', 1 x 10- J ; tumors, 1 x 10"« 1 x 10~™ 5 x 10"<', 
1 X 10 -4 ). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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luraors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. Ad). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data roost reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. Ad). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide; array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we arc examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be gencralizablc 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et aL (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-foJd increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et of. (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unarnplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity arnplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes arc not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of arnpli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenorypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in ceil metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability arid directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 1 0%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
titfh. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7-% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (< 1 cm) tumor size. 

The choice of methodology for determination of HER- 2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluoro'uracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin® (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin® in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin® in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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HER-2/neu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

88271 x2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
8829 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 

Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest^ The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinely processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site. The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at I7ql 1 .2-12 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 17 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vysis-certified Cytogenet- 
ics laboratory at SHMC. 
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ABSTRACT , the consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph 1 ) has been observed in cells of multiple hematopoietic 
lineages/This translocation creates a chimeric gene composed 
of breakpoint-cluster-region (bcr) sequences from chromosome 
22 fused to a portion of the abl oncogene on chromosome 9. The 
resulting gene product (P210 c ' abl ) resembles the transforming 
protein of the Abelson murine leukemia virus in its structure 
and tyrosine kinase activity, P210 c " abl is expressed in Ph 1 - 
positive cell lines of myeloid lineage and in clinical specimens 
with myeloid predominance. We show here that Epstein-Barr 
virus-transformed B-lymphocyte lines that retain Ph 1 can 
express P210 c " abl , The level of expression in these B-cell lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation in the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B-iymphbid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene on chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6). Although the precise 
translocation breakpoints are variable, an RN A-splicing 
mechanism generates a very similar 8-kilobase (kb) mRNA in 
each case (5-9). The hybrid bcr-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c_abl (10-13), with an amino-terininal segment derived 
from a portion pf the exons of bcr on chromosome 22 and a 
carboxyl-terminal segment derived from a major portion of 
the exohs of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting P210^" abl is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P160 v ~ abI transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of the c-abl 
gene product (15). 
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In concert with structural modification of the arnino- 
terminal portion of the abl gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and P210 coW than 
the c-abl mRNA forms (6 and 7 kb) and P145 c * abI gene product 
(5, 8, 9, li), The higher level of expression of the chimeric 
bcr-abl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gens, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5, 6). 

We have analyzed a series of Epstein-Barr virjis-immor- 
talized B-lymphpid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of Ph 1 always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph* do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable' than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c abl suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-abl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus- transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase C!ML patients as report* 
ed (16). The cell lines are designated according to patient 
number, karyotype, and lineage.. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from 
the same patient. Repeat karyotype analysis was performed 
to verify ihe retention of Ph 1 just prior to analysis for a bl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of in vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Ceils (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 
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supplemented with 5% dialyzed fetal bovine serum. Cells 
were then resuspended in 2 ml of the minimal medium. 
Labeling was started with the addition of [ 32 P]orthophos- 
phate (1 mCi/ml; ICN; 1 Ci = 37 GBq) and continued at 37°C 
for 3-4 hr. 

Immunoprecipitation and Immunoblotting. Immunoprecip- 
itations were carried out as described (10). Cells (1.5 x 10 7 ) 
were washed with phosphate-buffered saline and extracted 
with 3-5 ml of phosphate lysis buffer (1% Triton X-100/0.1 
NaDodSO 4 /0.5% deoxycholate/10 mM Na 2 HP0 4 , pH 7.5/ 
100 mM NaCl) with 5 mM EDTA and 5 mM phenylmethyl- 
sulfonyl fluoride. Extracts were clarified by centrifugation 
and precipitated with normal or rabbit anti-abl sera (anti- 
pEX-2 or anti-pEX-5) (17)! The precipitated proteins were 
electrophoresed in a NaDodS0 4 /8% polyacrylamide gel. 
32 P-labeIed proteins were detected by autoradiography. 
Alternatively, abl proteins were detected by immunoblotting. 
Extracts from unlabeled ceils were clarified, and proteins 
were concentrated by immunoprecipitation with rabbit anti- 
sera against aW-encoded proteins [anti-pEX-2 and anti-ppX- 
5 combined (17)] and then fractionated in 8% acrylamide gels. 
The proteins were transferred from the gel to nitrocellulose 
filters, using protease-facilitated transfer (18). The abl- 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and peroxidase-conjugated goat anti- 
mouse second stage antibody (Bio-Rad) for development. 
Rabbit antisera and mouse monoclonal antibodies to abl 
proteins were prepared using bacterially expressed regions of 
the v-abl protein as immunogens (17, 19). Anti-pEX-2 anti- 
bodies react with the internal tyrosine kinase domain and 
anti-pEX-5 antibodies react with the carboxyl-termihal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10 8 cells by the 
NaDodS0 4 /urea/phenol method (20). Polyadenylylated 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a 1% agarose/formalde- 
hyde gel and transferred to nitrocellulose, abl RNA species 
were detected by hybridization with a nick-translated v~abl 
fragment probe (21). 

DNA Analysis. DNA was prepared from 5 x 10 7 cells of 
each cell line and processed for Southern blots with a v-nW 
probe as described (21). 

RESULTS 

Variable Levels of P210 cabl Are Detected in Ph ! -Positive Cell 
Lines. Ph l -positive and Ph 1 -negative, Epstein-Barr virus- 
transformed B-lymphocyte cell lines derived from the same 
patient were examined for P210°" abl synthesis by immuno- 
precipitation of [ 32 P]orthophosphate-iabeled cell extracts 
with anti-abl sera (Fig. 1). The normal c-aW protein P145 c ' abl 
was detected at a similar level in multiple Ph^positive and 
Ph^negative cell lines. P210 cabl was only detected in the 
Ph^positive cell lines because the bcr-abl chimeric gene 
which encodes P210 c abl resides on the Ph 1 (4, 5, 11, 13). The 
level of P210 c abl was about 4- to 5-fold higher than the level 
of P145 c abl in the SK-CML7Bt-33 cell line (Fig. 1A, +). The 
Ph^positive erythroid-progenitor cell line K562 (C) showed 
a level of P210 cabI about 10-fold higher than P145" M . 
However, the level of P210 c " abl was about one-fifth that of 
p 145 c.abi in the Prepositive SK-CML16Bt-l cell line (Fig. IB, 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P210 cabl varies over a 
20-fold range between these Ph^positive B-cell lines. Anal- 
ysis of four additional Ph^positive B-cell lines demonstrated 
that the level of P210 c abl fell into two general classes; some 
cell lines had a level of P210 c abl similar to SK-CML7Bt-33 
and others had the low level similar to SK-CML16Bt-l (Table 
1). This differs from previous studies with Ph l -positive 
myeloid cell lines and patient samples derived from acute- 
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Fig. 1. Detection of variable levels of P210 c '* bI in PhSpositive 
B-cell lines. Production of P145 ceM and P2W* hi in Epstein-Barr 
virus^transformed B-cell lines derived from a blast-crisis (A) and a 
chronic-phase (£) CML patient was examined by metabolic labeling 
with [ 32 P]orthophosphate and immunoprecipitation. Ph'-negative 
(-) and Ph^positive (+) cell lines derived from each patient were 
analyzed. The Ph^negative cell line in A,- is SK-CML7BN-2 and in 
B y - is SK-CML16BN-1. The Ph^positive cell line in A,+ is 
SK-CML7Bt-33 and in is SK-CML16Bt:i. The K562 cell line, a 
Ph^positive erythroid progenitor cell line spontaneously derived 
from a blast-crisis patient (33), is represented in C. Cells (1.5 x 10 7 ) 
were metabolically labeled with 2 mCi of [ 32 P]orthophosphate for 3-4 
hr and then were extracted and clarified by centrifugation. Samples 
were i mm u no precipitated with control normal serum (lanes 1), 
anti-pEX-2 Oanes 2), or anti-pEX-5 (lanes 3) and analyzed by 
NaDodS0 4 /8% PAGE followed by autoradiography with an inten- 
sifying screen (3 days for A and C, 10 days for B). 

phase CML patients, in which P210 Crab! was detected at a 
10-fold higher level than P145 c ' abl (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNA and 
P210 c * abl expressed in four myeloid/erythroid-lineage Ph l - 
positive cell lines (K562, EM2, EM3, CML22, and BV173; 
refs. 9 and 11), despite a 4- to 5-fold amplification of 
a^/-related sequences in the K562 cell line. 

Detection of different levels of P210 c abI in Fig. 1 could be 
due to decreased phosphorylation of P210 c ?bl , a lower level 
of P210^" abl synthesis, or altered stability, of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210 c ' abl in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Btr33 (Fig. 2A t +) 
had a higher level of P210 c abI than P145, similar to the results 
with metabolic labeling (Fig. 1). We did not detect P210 c abI 
by immunoblotting with 2 x 10 7 cells of line SK-CML8Bt-3 
(Fig. 2B, + ). Reconstruction experiments using dilutions of 
cell extracts showed that we could detect about 5-10% the 
level of P210 c abI expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210 c ' abI in 
SK-CML8Bt-3 is lower than the level in SK-CML7Bt-33 by 
a factor of at least 10. The level of P210 c ab! detected in these 
assays correlated with the amount of P210 c abl tyrosine kinase 
activity that could be detected in vitrp (data hot shown). 

Different Levels of P210^ aW Are Reflected in the Amount of 
Stable bcr-abl mRNA. To identify the basis for detection of 
variable levels of P210 CiabI , we examined the production of 
the abl RNA. RNA blot hybridization analysis using a v-abl 
probe (Fig. 3) showed that the normal 6- and 7-kb c-abl 
mRNAs were present at a similar level in Ph^positive and 
-negative cell lines derived from different patients. However, 
the 8-kb mRNA that encodes P210 c " ab! was detected at a 
10-fold higher level in SK-CML7Bt-33 (Fig. 3A, +) than in 
SK-CML16BM (£, 4-), which correlated with the relative 
level of P210 c abl detected in each cell line; Analysis of 
additional cell lines demonstrated that the level of 8-kb RNA 
directly correlated with the level of P210 c abl (Table 1). The 
variation in level of 8-kb RNA detected in these cell lines was 
not due to loss or gain of Ph 1 ,. because cytogenetic analysis 
confirmed the presence of Ph 1 in these cell lines (ref. 16 and 
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Table 1. Relative levels of bcr-abl expression in Epstein-Barr 
virus-immortalized B-cell lines and myeloid CML lines 



8-kb 



Cell line* 


CML phase* 


Ph 1 * 


P2105 


ihRNA^ 


SK-CML7BN-2 


BC 






■ - 


SK-CML8BN-10 


Chronic 


- 


- 


. - 


SK-CML8BN-12 


Chronic . 








SK-CML16BN-1 


Chronic 








SK-CML35BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ 


+ + + 


++ + 


SK-CML21BM 


Acc 


+ ' 


+ + + 


+ + + 


SK-CML21Bt-6 


Acc 


+ 


+++ 


+++ 


SK-CML8Bt-3 


Chronic 


+ 




± 


SK-CML16BM 


Chronic 


+ 




+ 


SK-CML35Bt-2 


Chronic 


+ 


+ 


+ 


K562 


BC 




+ + + +.+ 


+++ + + 


BV173 


BC 


+ 


+ + +++ 


+ + + + + 


EM2 


BC 


+ 


+ + ++ + 


+ + +++ 



*Cell lines derived from CML patients by transformation with 
Epstein-Barr vims as described (16). Names of cell lines indicate 
patient number and Ph l status: SK-CML7Bt indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; N 
indicates a normal karyotype. Myeloid-erythroid cell lines (K562, 
EM2, and BV173) are described in previous publications (9, 11, 22, 
33). 

T Status of patient at the time cell line was derived. BC, blast crisis; 
Acc, accelerated phase. 

^Presence (+) or absence (-) of Ph 1 as demonstrated by karyotypic 
or Southern blot analysis. 

*P210 c_llM detected as described in legend to Fig. 1. B-cell lines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+ + +) than levels of P145. Chronic- 
phase-derived cell lines had P210 levels lower than or just equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (+ + + + +). 

^Eight-kilobase bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: ±, borderline detectable; .+ + + ++, level of 8-kb 
mRNA 5- to iO-fold higher than that of the 6- and 7-kb c-abt mRNA 
species; +++, level of 8-kb mRNA 3- to 5-fold higher than that of 
the 6- and 7-kb species; + , a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of aM-related sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of abl sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr-abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 c ab! detected. This could be mediated 
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Fig. 2. Analysis of steady-state abl protein levels by immuno- 
b lot ting. Ceil extracts prepared from 2 x 10 7 ceUs of lines SK- 
CML7BN-2 (A,-), SK-CML7Bt-33 (A,+), SK-CML8BN40 (5,-). 
and SK-CML8Bt-3 (B,+) were concentrated by immunbprecip- 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then electro- 
phoresed in a NaDodS04/8% polyacrylamide gel and transferred to 
nitrocellulose, using protease : facilitated transfer (18). abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 aW-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-conjugated goat anti-mouse 
second-stage antibody (Bio-Rad) for development. 
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Fio. 3. Comparison of abl RNA levels in Ph l -positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb c-abl 
RNAs and the 8-kb 6cr-oWRNA were analyzed by blot hybridization 
using a v-dbt probe. RNA was extracted from Ph'-negative lines 
SK-CML7BN-2 (A,-) and SK-CML16BN-1 {B-), from Ph'-pos- 
itive lines SK-CML6Bt-33 (A,+) and SK-CML16Bt-3 (B,+), and 
from line K562 (C;+) by the NaDodS0 4 /urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and 15 /xg of each sample was electrophoresed in a 1% 
agarose/formaldehyde gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a . nick-translated v-aM fragment 
probe (21) and then autoradiographed for 4 days. 

by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several lines of evidence suggest that formation of Ph 1 is not 
* the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1): This observation, 
coupled with studies of G6PD (gIucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clbnality by isozyme analysis among cell 
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Fig, 4. Southern blot analysis of abl sequences in Ph'-positive 
and -negatiye B-cell lines. High molecular weight DNA (15 Mg) was 
digested with restriction endonuclease BamHl, separated in a 0\S% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick-translated, 2.4-kb Bgl II v-abl 
fragment (1.5 x 10 8 cpm/jag; ref. 21) and exposed for 4 days. (A) 
Autoradiogram of o6/-specific fragments in cell lines HL-60 (lane 1), 
EM2 (lane 2), K562 Gane 3), SK-CML7Bt-33 (lane 4), SK-CML8Bt-3 
(lane 5), SK-CML16BM (lane .6), SK-CML21Bt-6 Cane 7) ( SK- 
CML35Bt-2 (lane 8), SK-CML7BN-2 (lane 9), SK-CML8BN-2 (lane 
10), and SK-CML35BN-1 Oane 11). {B) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose, showing the level of 
variation in amount of DNA loaded per lane. . 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson virus is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the \-abl gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non-aW 
sequences, bcr f normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of tran j-acting factors that occur during evolution of 
the disease. Our analysis of P210 c abI and the 8-kb mRNA in 
Epstein-Barr virus-transformed Ph^positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c " abl one-fifth that of FUS*'** 3 *, as detected by metabolic 
labeling with [ 32 P]orthophosphate and immunoprecipitation 
(S.C., O.N. W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but, in some cases, relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c abI expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Pi^-positive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c abl in those 
derived from patiu v. • at more advanced stages of the disease. 



It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Hiah-throuqhput technologies, such as proteomic screening and DNA micro-arrays produce vast 
amou^ To? d£ requiring comprehensive anaiytica. methods to decipher the **^«*™« 
r«ui« One approach would be to manually search the biomedical literature; however. th,s would be 
an ^ arduous tes~ We^eveloped an automated literature-mining tool, termed MedGene wh.cn 
S^S^-^martai -nd estimates the relative strengths, of all human gene-disease 
relTttonsWps fn Medline. Using MedGene. we analyzed a novel micro-array expression dates* 
7l^no breast «ncer and normal breast tissue in the context of existing knowledge. We found no 
3 o^ee7^ength of the literature association and the magnitude of the difference In 
expSsion £Z wnen considering changes as high as 5-fo.d; however a signincan, cwMor .was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10- old or more. 
^eSl this onlj held true for estrogen receptor (ER) positive tumor*, not ER negafve. MedGene 
idenSed a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords: bioinformatics . micro-array . text mining . gene-disease association . breast cancer 



Introduction 

At its. current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g.. 
breast cancer, a researcher would have had to scan 130 different 
journals and read 27 papers per day in 1999.' This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomics. which require the analysis of 
large datasets involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database. 2 However, as these resources are hand- 
curated, the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining: a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
the literature or by functional annotation. 3 " 7 Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta et al. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked to Inherited disorders. 8 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gene-disease relationships, including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We appiied this tool to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 



Methods 

MedGene Database. MedGene is a relational database, stor- 
ing disease and gene information from NCBI. text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interface for users to 
query the database (http://Wpseq.med.haavard.edu/MedC^e/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http:/ / 
wvw.nlm.nlh.gov/mesh/meshhome.html) and human disease 
categories were selected. LocusUnk files were downloaded from 
the LocusLink web site at NCBI (http://www.ncbi.nih.gov/ 
LocusLink/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July. 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chl-square analysis. Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 16 (http:// 
hipseq.mediiarvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using TreeViewer" (http:// 
rana.lbl.gov/EisenSoftware.htm) . 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immurto-pheno- 
types of breast cancer. Blotlnylated cRNA. generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Aflymetrix U95A oligonucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained uslng'GENE- 
CHIP software from Afrymetrix, and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National library of Medicine and consists 
of a set of terms Or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 910 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information, Supplemental Table 1, or visit http://hipseq. 
med.harvard.edu/MedGene/publication/s_Table l.html) were 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found In Medline text A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI, ,W * which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 th , 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at all. 
For the intended use of this study, this lack of distinction is 
unlikely to have a large effect and may In fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table I). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case-insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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gene symbol/name 
Is not unique 



gene symbol is 

unrelated abbreviation 
gene symbol/name 

has language meaning 
nonstandard syntax ^ 
unofficial gene name/symbol 
nonspedHed gene name 



false positive 

false positive 

false negative 
false negative 
false negative 



MAG-myeltn 

associated glycoprotein 
MAC-^aUgnancy-associated 

protein 

PA— pallid homologue (mouse). 

pallldin (also abbrev. for Pennsylvania) 
IMS-Wiskott-Aldrich Syndrome 

(also the word "was") 
BAG-i instead of BAG! 
P53 instead of TP53 
estrogen receptor instead of 

Estrogen receptor 1 



eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



^T^t4. ^rtt« ^^ sensmvu/even a, the expense of specificity if needed. 

added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name. e.g.. estrogen receptor instead of 
estrogen receptor 1 (ESM). creating a source of false negatives 
For this reason, gene family stem terms were created for £1 
E enes that have an alpha or numerical suffix (e.g.. IL2RA,TGFp, 
ESR1 etc ) and then used to search the literature. The family 
stem' terms were handled separately from the specific gene 
names so that it would be clear when linkages were made to 
the gene family versus a specific member in that family. 

To improve performance and accuracy, some pre-selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second. non-English journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely to contain primary data 
about eene-dlsease relationships were also removed (e.g.. Int. 
J Health Educ. Bedside Nurse, and J. Health Econ.). Together, 
these filters reduced the 12 198 221 Medline publications (July 

2002) by 37%. . 

Ranking the Relative Strengths or Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
Important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, inclusiveness was 
prioritized. To determine the false negative rate in MedGene 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed In 
several other databases Including the Tumor Gene Database 
(TGD) 1 the Breast Cancer Gene Database(BCG), 1 GeneCards 
(GC)" and Swissprot.' 8 Genes were considered false negatives 
if they were represented in at least one of these other databases 
and not In MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes In each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer-related genes that were 
supported by at least one literature citation in these hand- 
curated databases. 26 were absent from MedGene. suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion Supplemental Table 2. or visit http://hipseq.med. 
harvard.edu/MedGene/publicaUon/s_Table 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used; a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF. 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the tides and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information. Supplemental Table 3. or visit 
http://hlpseq.med.harvard.edu/MedGene/publicatlon/ 
s Table 3Jitmi). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer."*" 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information. Supplemental Table 4. or visit http://hipse- 
q.med.harvard.edu/MedGene/pubIication/s„Table 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm fTable 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 MedUne records that 
referred to ESR1 and 10 to ESR2 % whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESR1 or ESRZ 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information, 
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Supplemental Figure 1. or visit http://hipseq.med.harvard.6du/ 
MedGene/publication/s.Figure l.html). For example, in one 
such cluster shown in Figure 2. diabetes and its complications 
grouped together and were also closery linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by thcCalse 
negative rate (-9%) and down by the false positive rate (-26% 
on average). Using this, the average disease has 103.7 ± 45.3 ,. 
(mean ± s.d.) genes associated with it. although the rtfige is 
quite broad with 2359 genes related to breast cancer. 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a 1-fold difference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene. we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family termj; 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedGene/ 
publication/s_Figure 2.html.) Among the 505 previously un- 
related genes. 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes. 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by first-degree associa-, 
tion. 

Among all 754 genes directly linked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r- 0.018. p-value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10-fold change or more in 
expression level, the correlation increased to 0.41 (p-value = 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and the corresponding LPF values for '™ d™"*^ with 
a Ueast 50 gene relationships, were used in an unsupervised clustering of the diseases based on tagmi^m 
them A sample of the data is shown here. 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
erms S he ine) and starvation states terms (under the iine) clustered together. Within these groups, ^^^^ 
diabetic small vessel complications, altered serum chemistries, nutritional disorders, etc. (Supplemental Figure 1: http7/h,pseq.med. 
harvard.edu/MedGene/publication/s_Figure 1.html). 

disease unrelated to breast cancer. As expected, we did not 
observe an increasing trend in correlation for hyperten- 
sion. 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedGene for hypertension, a 
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expression level. Furthermore, the majority of genes that ^^^^ SSTSSnWsoore (LPF) and the fold change 

estrogen receptor negative tumors only (purple). 
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Discussion 

The Human Genome Project heralded a new era in btologteal 
research where the emphasis on understanding specific path- 
os has expanded to global studies of genomic option 
anJ biologiLl systems. High-throughput technologies can 
provide novel Insight into comprehensive^ biological function 
but also introduces new challenges. The utility of these 
technologies Is limited to the ability to generate, analyze, and 
Jl large gene lists. MedGerie. a relational database 
dS by fling the information in Medline, was oeated to 
address this need. MedGene users can query for a rank-ordered 
teSuman gene-disease relationships (Table 2) for one or 
more diseases. Each entry is hyperlinked to the origina papers 
^porting each association and to other relevant databases. 

MedGene is an innovative extension of previous textmining 
approaches. Perez-lr*txeta et al. used the GO annotation and 
ft* chromosomal locations to predict genes that may con- 
tribute to inherited disorders.' MedGene takes a broader view 
and includes all diseases and all possible 
ships Furthermore. MedGene utilizes co-citation to ndicatc , a 
eEonship rather than GO annotation, which ^im^tothe 
subset of genes that have GO annotation. Our approach is 
comSementary to that taken by Chaussabel and Sher . who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 8 

A unique aspect of this tool is the ability to assess the relative 
strengths or gene-disease relationships based on the frequency 
oS co-dtation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias" and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http//hlpseq.med.harvard.edu/MedGene/publication/s_Ta- 

ble 3 html) Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link: 
however, it is strong evidence to support a true relationship. 

Another Important feature of MedGene Is the implementa- 
tion of software raters that substantially reduced the error rate 
We estimate that less than 10% of all associations were mussed 
and at least 70% of even the weakest associations were real. 
For this study, all of the filters that we applied were general 
ones. e.g.. expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
eene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 

rate. . 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2-fold reported in tHe literature. 
Even when these expression changes are statistically significant. 
It is not always clear If they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Cringes in ER- but 
Not in ER+ Breast Tumors 



gene symbol 



fold change (ER+) 



fold change (ER-) 



KRTHB1 

BRS3 

DKK1 

ZICJ 

TLR1 

KIAA0680 

CDKN3 

EBI2 

GZMB 

STK18 

GPR49 

mow 

LAD1 

P0LE2 

HMG4 

BCL2L11 

LRP8 

CCNB2 

CCNE2 

FOB 

KNSL6 

H1F5 

SERPINH2 

YAP1 

LPHB 

TCEA2 

TFF1 

C0L17A1 
.POPS 

BPAG1 

PDZKl 

VEGFC 

MUC6 

SERPINA5 

MEIS1 

CA12 



1.0 
1.2 
1.2 
1.9 
1.0 
2.6 
1.0 
4.0 
3.8 
4.7 
1.0 
1.6 
-1.0 
4.2 
4.4 
-1.2 
2.9 
1.0 
4.0 
-4.3 
2.9 
3.0 
4.6 
1.0 
-1.3 
-1.1 
1.3 
-4.1 
1.1 
-4.6 
-1.1 
-2.8 
-1.4 
-1.0 
-1.6 
2.4 



610.8 
89.4 
69.8 
59.6 
38.5 
33.2 
30.6 
27.9 
21.9 
18.6 
14.6 
14.4 
13.5 
13.0 
12.9 
12.3 
12.2 
11.8 
11.6 
11.1 
10.9 
10.2 
10.2 
10.0 
-10.4 
-10.8 
-11.4 
-15.7 
-16.2 
-22.3 
-36.8 
-51.5 
-64.9 
-83.1 
-85.9 
-150.3 



Tables. ^^X^ n ^R^ 
weak association except those marked with an . 

reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g.. 
BRCA1 and BRCAQ and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
eenes with a 10-fold change or more In expression level there 
was a strong and significant coirelation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First, a careful hunt for corroborating evidence of a role n 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10-fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments thar may contain 
subpopulations that behave very differently The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negative tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore. it-can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes arid concepts, such as cell division and responses to 
small molecules. 
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sive catalog was prepared of the gene 
expression changes that occur during 
morphologic maturation. To do this, 3'- 
end differential display, oligonucleotide 
chip array hybridization, and 2-dimen- 
sional protein electrophoresis were used. 
A large number of genes whose mRNA 
levels are modulated during differentia- 
tion of MPRO celts were Identified. The 
results suggest the involvement of sev- 
eral transcription regulatory factors not 



previously Implicated In this process, but 
they also emphasize the importance of 
events other than the production of new 
transcription factors. Furthermore, gene 
expression patterns were compared at 
the level of mRNA and protein, and the 
correlation between 2 parameters was 
studied. (Blood. 2001;98:513-524) 
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Although the mature neutrophil Is one of 
the better characterized mammalian cell 
types, the mechanisms of myeloid differ- 
entiation are incompletely understood at 
the molecular level. A mouse pro myelo- 
cytic cell line (MPRO), derived from mu- 
rine bone marrow cells and arrested devcl- 
opmentally by a dominant-negative 
rettnolc acid receptor, morphologically 
differentiates to mature neutrophils In the 
presence of 10 p.M retinoid acid. An exten- 

Introduction „ 

Studies of norma! myeloid maturation from many laboratories have 
identified genes that may play critical voles in myeloid differentia- 
tion^ Current studies suggest thai these events are dependent on a 
cascade of molecular changes that involve complex modulation of 
mRNA transcription. Furthermore, studies of acute leukemia have 
suggested that the disease arises from the accumulation of myeloid 
precursors arrested at early stages of differentiation and associated, 
in many cases, with chromosomal rearrangements that alter the 
structure of specific transcription factors. 5 Nevertheless, the molecu - 
lar events underlying the production of mature myeloid cells arc 
not wel! understood and appear to use interacting 'pathways and 
networks, the elucidation of which requires an extensive descrip- 
tion of the molecular components available to the myeloid cell. 

An extensive body of information is accumulating with respect 
to gene expression profiles of mammalian cells. However, much of 
the information available in public databases has been accumulated 
by the use of techniques such as single oligonucleotide chips or 
cDNA arrays that measure fewer than 6000 of potentially 30 000 to 
1 20 000 transcripts. The more l imited range of analyses reported by 
the serial analysis of gene expression (SAGE) 6 * 7 technique accu- 
rately estimates changes in levels of the more abundant mRNAs but 
requires extensive redundant analyses to measure changes in the 
patterns of expression of scarce mRNAs. We have used a modified 
polymerase chain reaction (PCR)-based cDNA differential display 
(DD) method in which single restriction fragments derived from 
the 3' end of cDNAs are separated on a sequencing gel.*- 9 Bands 
from the gel can be identified initial ly by sequencing, but then 
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comparison of patterns from different: samples can be made without 
further sequencing. This sensitive and reproducible method detects, 
in principle, most cDNAs regardless of whether they are repre- 
sented in existing databases. 

Systematic analysis of the function of genes can also be 
performed at the protein level. This approach has the advanUige of 
being closest to function, because proteins perform most of the 
reactions necessary for the cell. The most common method of 
protcome analysis is the combination of 2-dimensional gel electro- 
phoresis (2DE) to separate and visualize protein and mass spectrom- 
etry (MS) for protein identification.'" Several such analyses of 
yeast and of normal or malignant mammalian cells have been 
performed. To date, however, there have been few studies in which 
both mRNA and protein have been compared by applying analyses 
to the same samples. The studies of Anderson 11 and Gygi 12 showed 
that there is not a good correlation between mRNA and protein 
levels, in yeast or human liver ceils. However, other analyses 
disagree with this conclusion (Greenbaum et al, manuscript 
submitted, and Futcher et al 14 ). Furthermore, global correlations 
between changes in mRNA and protein levels have not been 
examined during ihe execution of any developmental program. 

The MPRO cell line was derived by transduction of a dominant- 
negative reiinoic acid receptor construct into normal mouse bone 
marrow cells. It is a granulocyte-macrophage colony-stimulating 
factor (GM-CSF)-dependent line arrested ax a promyelocyte stage 
of development lS - 16 After treatment with vlUrans retinoic acid 
(ATRA) most of the cells acquire the morphology of mature 
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neutrophils and begin to produce neutrophil lactoferrin and gelati- 
nase, 2 proteins characteristic of neutrophil secondary granules. 17 
As such, it offers a valuable model for studying neutrophil 
differentiation in vitro. 

We now report the analysis of mRNA expression changes 
during the process of MPRO cell maturation to neutrophils and 
compare the results with a limited analysis of cellular protein 
composition. mRNA expression changes were studied by combin- 
ing the use of oligonucleotide arrays and DD. A database (dbMC) 
with comprehensive genomic information tor myeloid differentia- 
tion program was constructed (accessible at http://www.bioinfo.mbb. 
yate.edu/expression/neutrophil). We have grouped the changes in 
mRNA levels of a large number of genes into 6 patterns, with 
implications for the genetic program of myeloid differentiation. 

We also compared 2-dimensional high-resolution gel eiectro- 
phoretograms from control cells and cells differentiated for 72 
hours in the presence of ATRA. Fifty protein spots whose relative 
intensity changed prominently during differentiation were exam- 
ined by mass spectrometry. The results suggest a poor correlation 
between mRNA expression and protein abundance, indicating 
that it may be difficult to extrapolate directly from individual 
mRNA changes to corresponding ones in protein levels (as 
estimated from IDE). 



Materials and methods 

Cell lines 

MPRO cells asd HM-5 cells provided by Dr Schickwann Tsai (Fred 
Hutchinson Cancer Research Center, Seattle, WA) 15 were used throughout 
the study. The cells proliferated continuously us a GM-CSF-<Icpcnden! cell 
line at 37 n C in Tscoves modified Dulhecco medium (Gibco BRL, Grand 
Island, NY) supplemented with 5% to 10% fetal calf scrum (Gibco BRL) 
and 10% HM-5-conditio»ed medium as a source of GM-CSK Morphologic 
differentiation of the blocked MPRO promyelocyte* was induced by 
treatment with 10 u.M ATRA (Sigma, St Louis, MO). Controls were 
cultured in the absence of ATRA but with the same volume of ve- 
hicle tcth&nol). 

RNA Isolation and differential display 

After exposure to 10 p.M ATRA for 0.. 24, 4S : or 72 hours, total cellular 
RNA was isolated from MPRO cells using TRIzol reagent (Life Technolo- 
gies, Gaiihersburg, MD). cDNA was then synthesized using a T-7 Sal-Oligo 
d(T) 32 primer as described previously." '* The double-stranded cDNA was 
digested with 1 of 9 different restriction enzymes (Apal, 5g/ll, JJomHI, 
Eagl, EcgR\> r/f'mUlI» Xbal, Kpn\ y and Sphl) and ligated to Y-shaped 
adaptors with a complementary overhang. DNA fragments were then 
amplified by PCR as described previously*'"' PCR products were separated 
on a sequencing gel ot 6% polyacrylauiide with 7 M urea. The gel was dried 
and exposed to x-r»y film. Genes from differential display gels, whose 
maximum intensity chauges equaled 2+ on a scale of 1 1 to B + , were 
recorded as significantly changed.' 9 Individual DNA bands were recovered 
from the gels, amplified by PCR, and sequenced. 

Oligonucleotide chip analysis of RNA samples 

Ten micrograms total RNA from each sample (0, 24 1 48, or 72 hours) was 
used to prepare cDNA. This cDNA was transcribed with T7 RNA 
polymerase lo prepare 2 fluDrcsccntly labeled probc. 20 ^ 1 Each sample was 
hybridized to mouse array chip (MulIK Array; Afiymcuix, Santa Clara, 
CA) containing oligonucleotide probe sets corresponding to approximately 
7000 known genes or ESTs represented by UniGcne clusters.- 1 cDNAs 
were considered present if their probe se t results were rated as such by the 
GeneChip software ( Asymetrix) and if the average difference (AD) 
between perfect match and mismatch probe pairs was not less 100 U. If a 



gene was represented by more than one array probe set, the average of all 
probe sets for the gene was taken. Genes Willi AD values between 1 00 and 
200 were considered unchanged because of their low expression levels. 
Those genes with AD values equal to or more than 200 U al one time point 
were further studied by rescaJing, threshold, and normalization methods 
described in the MIT Center for Genome Research Web site. 1 s A value of 20 
was assigned to any gene wife an AD below 20 at seme time point, 

Bioinformatics and database development 

AH the sequences cr gene fragments were searched using Blast against 
GenBank and TIGR gene indices. A database of genes or ESTs whose 
expression levels changed during myeloid differentiation was constructed 
containing information for each band or gene. This included GenBank 
matches. Locus Link or Unigcne clusters, nprcftsion patterns^ tissue 
distribution, synonym(s) protein name, gene name(s), notations of possible 
functions, poly A signal and sequence quality, and hyperlinks to the 
database searches, sequence trace tiles, and related references. All gene data 
were then gathered into o cluster file. Supplementary information is 
available at http://biotnfo.mbb.yale.edti/expression/heutrophil. 

Classification and analysis of DNA fragments 

Sequences from differential display analyses were classified as representing 
known genes, ESTs, genomic sequences, or novel genes as described. 1 *- 23 
Known genes from both differential display and arrays were clustered into 
27 functional categories and searched against SWISS-PROT (http:// 
www.cxpasy.cbi-.nrc.ca/cgi-biii/sprot-seaich-nil) or PIR (http://www.pjr. 
georgctown.edu/). Information such as function, subcellular location, 
family and super family c I assifi cation, map position, similarity, synonym^) 
protein name, gene namefs), and so on was recorded in a variety 
of databases. 

Northern blot analysis 

Thirty micrograms total cellular R*JA per lane fr«m time-course MPRO 
cells were loaded onto 1.2% foimaldehyde-agarose gels, then transferred to 
Hybon<l-N+ membranes (Amcrsham Pharmacia Biotech, Uppsala, Swe- 
den). Attcr standard prehybridization, membranes were hybridized over- 
night et 65°C with radiolabeled cDNA probes (ordered from Research 
Genetics according to their dbEST Image ID). Membranes were washed at a 
final stringency of 60 C C in 0.1 X SSC. 

Immobilized pH gradient 2 -dimensional gel electrophoresis 
and mass spectrometry 

mduced MPRO cells collected at 0 and 72 hours were lysed with lysis 
buffer (540 mg urea, 20 mg dithiolhrcitol, 20 pL Pharmalytc [3 - 1 0], 1 .4 m« 
phenyhncthylsulfonyl fluoride, I u.g each aprotinin, leupeptin, pepstatin A, 
and antipain 50 p.g TLCFC, and 100 p.g TPCK/i mL). We apphed 100 \iL 
each MPRO cell lysate (2.5 X I0 s ceUs/i00 jiL) to immobilized pH 
gradient (IPG) strips (pH 3-10 L; Amersham Pharmacia Biotech), and IPG 
clecuoplioresis was conducted for 16 hours (20 100 Vb) using an immobi- 
line Drystrtp Kit (Amersham Pharmacia Biotech). Electrophoresis in the 
second dimension was carried out in a 12% sodium dodccyl sulfaic- 
polyacrylamide. gel electrophoresis (SDS-PAGE) gel with the Laemmlt- 
SOS continnous system m a Protean II xi 2-D cell (Bio-Rad) run at 40 mA 
coustant current for 4.5 hours. Proteins were delected by Brilliant Blue 
G-coIloida) staining. 24 Protein spots were excised from the gel and digested 
with trypsin. ACTH clip (aveiagc fM+Ii] 2466.70) and bradykinin 
(average [M+H] 1 06 1 .23) were used for calibration of peptide masses. One 
microliter sample digest was mixed with 1.0 pL a-cyano-4 -hydroxy 
cinnamic acid (4.5 mg/mLin 50% CTT 3 Ch\ 0.05% TFA) matrix solution and 
1 pL ealibrants (100 finol) each. The spectra of the peptides were acquired 
in reflect or/delayed extraction mode on a Voyager- DE STRmass speedom- 
eter (Perseptivc Biosystcms, Poster City, CA). Peptides were identified 
using the Propound search engine. 19 
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Results 

Differentiation of M PRO cells 

Figure 1 illustrates the morphologic changes in an MPRO cell 
population representative of those used lor RNA expression 
analysis. Undifferentiated MPRO cells resembled promyelocytes 
under Ihc light microscope (Figure I A), After induction with ATRA 
for 24 hours, the cells morphologically differentiated into metamy- 
elocytes (Figure IB). At 48 hours, die cells further developed 
into metamyelocytes and band neutrophils (Figure iC). At 72 
hours, nearly 100% of MPRO cells became mature neutrophils 
(Figure ID). 

Identification of mRNAs by differential display assay 

MPRO cellular mRNA was analyzed at 0, 24, 48, and 72 hours after 
ATRA treatment. Nine restriction en2ym.es were used in a 3 '-end 
DD approach. During MPRO differentiation, 1109 fragments 
corresponding to 837 transcripts were found to change substan- 
tially in expression levels (Figure 2). These represented approxi- 
mately 279 known genes, 112 ES'ls, and 59 putative new genes, 
each with a perf ect or fair polyadenylation signal at an appropriate 
distance from the oligo-dT priming site. The gene information 
detected by DD was collected in database dbMCd. 

Identification of mRNAs by oligonucleotide chip assay 

We used an oligonucleotide chip containing 13 179 probe sets 
corresponding to approximately 7000 murine genes to analyze 
patterns of mRNA expression in the same RNA samples used for 
DD. The information obtained by oligonucleotide arrays was 
collected in the da tahase dbMCa. 

We clustered the genes by their simdarity to idealized 
expression patterns. For instance, the expression pattern of an 
ideal gene that is overexpresscd (high) at time 0 and undercx- 
pressed (low) at 24, 48, and 72 hours, would be high-low-Iow- 
Jow (HLLL). Overall we have (2^-2) idealized patterns exclud- 
ing HHHH and LLLL. Pearson correlation was used as the 




Figure 1. Morphology of MPRO cells during differentiation. MPRO cells ware 
Induced as described in 'MateKais and methods,' concentrated by cytospirt, and 
WrKjht-Giemsa stained. (A) Uninduced MPRO cells. (3) MPRO cells induced with 
ATRAfcr 24 hours. (C) MPRO celis Induced wfih ATRA for 4 8 hours. (D) MPRO ceils 
Induced with ATRA for 72 hours. 
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Figure 2. Distribution of genes obtained by DD assay. MPRO oelJ mRNA was 
analyzed at 0. 24. 48, and 72 hours after ATRA treatment; 1109 fragments 
corresponding to e37 transcripts were found to change substantially in expression 
levels, me total 337 transcripts were classified into 6 categories according to the 
bloinforrnatic onalysis. Percentages show the geno distributions !n these 6 catego- 
ries. Information for each transcript was collected In daiabase dbMCd. 



measure of similarity of each gene expression pattern, 
* ~ (x iy x 2t x iw x 4 ) to each of the 14 ideated patterns 
y s (y^'i-ysy*)- The 4 entries of x and y corresponded to the 
4-dimensional gene expression levels at 0, 24, 48, and 72 hours, 
respectively. Each gene was assigned to a cluster labeled by the' 
idealized pattern that had the maximal correlation with\hat 
gene. We selected only genes that hybridized well compared 
with the background (considered "present" by GeneChip soft- 
ware) and had maximal AD amplitude greater than 200 U in at 
least 1 of the 4 stages. We farther tabulated the 14 patterns 
according to whether the gene expression changed ot early 
(0-hour), intermediate (24- and 48-hour), and late (72-hour) 
time points and whether gene expression monotonically in- 
creased (up-regulated), monotomeally decreased (down-regu- 
lated), or was not raonotonic (transient). Table t shows 8 
clusters of 104 genes that had significant changes of mRNA 
levels, arranged according to the temporal stage and the 
monotonic/transient changes of expression levels. 

Principal component analysis determined whether vvc could 
comprehensively present multidimensional data (4-dimcnsional in 
our case) in a simple 2-dimensional graph. First, we found the 4 
principal components, which were the axes of the most compact 
4-dimensional ellipsoid that encompassed the 4-dxrncnsiona! cloud 
of data. Each axis was a different linear combination of the original 
4 variables. Then we verified that the first 2 principal components 
(the first 2 largest axes of the ellipsoid) captured most (95.2°/o) of 
the variation of the data. Therefore, the data could be faithfully 
projected (with a minor loss of information) into a 2-dimensional 
graph, with the 2 largest principal components as the x- and y-axes. 
As shown in Figure 3, genes tend to coalesce in clusters, according 
to their labels determined by their similarity to an ideal expression 
pattern. In summary, a genomic (global) picture of the distribution 
of genes according to their similarity to predetermined idealized 
multidimensional expression patterns is concisely displayed in a 
2-dimensional graph. 
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Table 1. Genes differently regulated during tho different stages of mouse promyelocyte cell lino differentiation t 



Timing 



Category 



Early 



Middle 



Up-regutation 



Down-regulation 



L>IHH (n = 10) 

MarfP2rxi ttgb2 H1r2 Lcn2 //prS 

CebpbH2-D Eloh;6Zyx 
HLLL (n = 11) 

Tcrg-V4 LyG4 Ctsg Spi?-1 McptQ 
Myc Myb Tlr4 Npml Erh Hsp60 



Late 



Transient 



LLHH (n 6) 

Phat Cybb Pfc PiraS Cd53 \fngr2 

HHLL(n = 1) 
Mpo 



LLHL (n = 9) 

Se3 Ktf2 PlraG Pirb Lst 1 Ltf Sema4 d Slate Mmp9 
LHHL (ri » 17) 

Ccbpa LyzB Fcgr3 Arf5 Lompl S:at3 Csf2ra Osl 
Actfl Sfpil Gpx3 Ptprc Prtn3 irf 1 Rps6kat 
LtMrMyln 



LLLH(n= 13) 

111a CsQrU C*s/S100a8 L-CCR Ctss 

Aldol Rac2 F P r1 Ctsd Ubb Ptmb4 
HHHL (n - 37) 

Actx trtt EL2 Rpl19 Actb LySe Affi Hist2 
Psrrta2 Gnas 2fp36'U4ta LforShfdgl 
Max RpsB Csf2rbl Slpi Tctexl Tpl 3!f3 
CwrGys3Slc10a1 Ctsb Seppl R:n3 
Ccnb2 S10059 Cf 1 1 Hlst5-2ax Rela 
Copa Gstml Gnb2-re1 Gm RPLB 



Arrays o! Affymetrix Mu1 k cohtamlng 13103 probe seta corresponding to 12002 GenBank accessions were used for hybridization. Arrays were hybridized with 
^ptav.duvphycoeiylhnn (Molecyfcr Probes, biotrn-labeled RNAand scanned. Intensity for each feature of the anay was captured using Genechip software (ArWtm) and 
^S^express^ 

IT .IS^T Each gene expression profile was categorized ss described in Tables 3. 4, end 5. For the 4 time points, the minimum AD of the 

relat^hlgher group (MIN-H) was d^ed by tho maximum AD of the relatively low gn>up (MAX-L). and those gen es whose MIN-H/MAX-L gLter than 2 wr^etodwl « 
meaningful* regulated. Genes were sorted In descending order based on the MIN-H/MAX-L Genes In boldface are those whose expression level was In the top ?C% fie 
mawmum AO of 4 tor« points greater than 3000), and genes in italic* are those in the bottom 20% <ie. maximum AD of 4 time points less than 300,. The differential period 
was grouped into 3 stages: earVfO-r^ * ' enuawm penoo 

AD indicates average difference; gene symbols are expanded In an Appendix at the end cf this article. 




Figure 3. Gens clusters In the first 2 principal componont spaces. Principal 
component analysis aliowed us to present the multidimensional data (!n this case, 
4-dimensiorjal data of each geno expression pattern) In a simple 2-dimensbnal 
graph. We derived the 4 principal components, which are a linear combination of the 
standardized expression intensities (zero mean and unit variance) at 0, 24, 48. and 
72 hours. The first 2 principal components captured most of the variation of tho data 
(approximately 85%). Therefore, the data can be displayed (with a minor loss of 
(".formation) « a 2-dimensbnat graph. The first and second principal components, cl and 
c2, ere given by the fir-ear combirratwns o, = 0.747 • n1 - 0.11 - n2 - 0.656 • n3 + 0 • 
n4 and = 0.278 - nl + 0.353 * n2 + 0.233 • n3 - 0.B63 - n4, where nl. n2, n3. 
and n4 are the rescaied aixl standardized expression Ievel3 at 0, 24, 48, and 72 
hours, respectively. The exes legends d and c2 stand for the first 2 principal 
components. In this paper we used the Pearson correlation to measure the simBarRy 
of each gene with the idealized expression patterns, ss opposed to the Et.*c&dean 
distance we used In a previous work. 19 because clusters were better separated using 
this measure. In both cases, we presented the data In the 2-d:mens tonal space of the 
lowest principal components. The data had a tendency to be drcularly distributed 
when we used the Pearson correlation es a distance measure. 



Correlation between array and DO analyses 

We have previously demonstrated a correlation coefficient of 0.93 
between visual estimates of changes in band intensity on DD and 
Phosphorirnager System (Molecular Dynamics, Sunnyvale, CA) 
estimates of band intensity and a correlation coefficient of 0.88 
between hybridization intensity changes of mRNA on Northern 
blot analyses and changes in band intensity on DD. 19 In a few cases 
there were clear discrepancies in the pattern of expression of a 
gene, as estimated by DD and by oligonucleotide chip analysis. We 
chose the 6 most extreme cases and examined the levels of mRNA 
change for these genes by Northern blot analysis (Figure 4). In S 
cases > the Northern blot results agreed with the results of the DD 
analysis, whereas Che results of Gnb2-rsl disagreed with the 
oligonucleotide array but duplicate bands from DD showed a 
relatively high level of expression in the 0 time sample that did not 
correlate with the Northern blot (Table 2). One possible explana- 
tion for these findings was the change in the relative use of different 
polyadenylation sites after the addition of ATRA to the MPRO cells. 

Constructing a database for mRNA level changes during 
myeloid differentiation 

Based on the data obtained ebove, an in-house database (dbMC) 
was constructed that included 2 subdatabases, dbMCd and dbMCa, 
for collecting gene information from DD or oligonucleotide arrays, 
respectively. Each entry in dbMC is accompanied by a so-called 
executive summary. The linkage between dbMCd and dbMCa was 
established by UniGene ID and cluster ID- dbMC contains the 
temporal expression patterns of genes during the MPRO cell 
differentiation process, including not only products represented in 
public databases but also novel transcripts. 
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Figure 4. Northern blot analysis of selected mRNAs. Equivalent amounts of RNA 
tram MPRO cells Induced byATRAat different time points (0 hour, 24 hours, 46 hours, 
and 72 hours) were resolved by formaldehyde^ arose gel electrophoresis, stained 
io verify the amount of loading. Eleven genes were separately probed on the RNA 
filters. The gene symbol of each probe was listed at the left of a related Northern blot 
resul Delated information on these 11 probes was listed in Table 5. One of the 
RNA-bbtted membrane photographs is shown with methylene blue-stained 23S and 
1SS RNA subunils demonstrating the quality and quantity of RNA loaded !n 
individual lanes. 



Analysis of gene expression patterns during MPRO 
differentiation 

Many of the genes identified in this study were found in myeloid 
ceils or were implicated in myeloid development for the first time. 
We detected 8 cytokines 25 and chemolcines whose mRNA levels 
changed more than 5-fold by arrays and 2-fold by DD during the 
maturation of MPRO cells (see our Web site, http://bioinfo.mbb. 

Table 2. Expression patterns of genes detected by Northern blot analysis 



yale.edu/expression/neutrophil). Among these were 2 members of 
the CC chemokine family. Interleukin-1 a (IL-la) was up-regulated 
at the late siage of dilferentiation (LLLH pattern, Table 1). 

mRNA for approximately 52 receptors was detected by one or 
the other method. A number of the receptors known to be present on 
mature neutrophils showed late induction of mRNA, and their 
levels of induction were high, indicating that the expression of. 
these products is a prominent event late in neutrophil maturation 
(Table 3). Rarely was mRNA for receptors down-regulated, 
consistent with myeloid maturation being accompanied by increas- 
ing responsiveness of the cell to a variety of external stimuli. 

Expression of mRNA for granule proteins 

Neutrophils contain several types of granules that develop at 
different stages of myeloid maturation. 3 - 1 W Levels of mRNAs 
encoding secondary granule proteins, such as lactoferrin, increased 
as the cells matured (Table 4). The level of mRNA for Mmp9, 
reported as a tertiary granule protein, increased markedly between 
24 and 48 hours after the induction of differentiation, whereas 
mRNAs for secondary granule proteins either increased less 
markedly or showed a maximum increase by 24 hours. mRNAs for 
several primary granule constituents, such as myeloperoxidase and 
cathepsin G, were present in unstimulated cells and decreased as 
the cells matured. There was a discrepancy in the measurements of 
proteoglycan mRNA by DD and oligonucleotide chips, but North- 
ern blots showed that it reached a peak at 48 hours and then 
declined (Figure 4). Cathepsin D is reported as a primary granule 
protein, but its pattern of mRNA expression more closely re- 
sembled that of secondary granule constituents. In addition to 
known granule components, mRNAs for several other cathepsius 
were up-regulated during myeloid differentiation, in parallel with 
or later than the tertiary granule protein mRNAs. 

mRNAs for transcription factors 

Transcription factor genes, including several identified at the sites 
of consistent chromosome rearrangements in acute myeloid leuke- 
mia, have been implicated in normal myeloid differentiation and in 
the expression of neutrophil proteins. 2 ' 5 * 27 However comprehensive 
information concerning the expression of these transcription fac- 
tors during myeloid development is not readily available. There- 
fore, we compared gene names and identifiers in our databases to 
those of the transcription factor database Trans fac (hrtp:// 



Gene 
symbol 


Gene 
accession 




AD value by array 








Intensity by DD 




Oh 


24 h 


48 h 


72 h 


Oh 


24 h 4B h 


72 h 


Cebpa 


M62362 


33 


212 


182 


4'. 










Cebpb 


X62600 


390 


1248 


1380 


1903 










Cefcpd 


X61B00 


157 


262 


168 


*30 










Cebpe 




















Myb 


Ml 2848 


BS2 


356 


230 


435 










Sipl 


U73004 


617 


501 


783 


402 


1 


2 


3 


3 


Prp3 


W45H34 


153 


259 


339 


345 


5 


i 


1 


2 


GnbZ-rel 


X?53t3 


4231 


3623 


3215 


3403 


4 


A 


1 


1 


Ly6e 


U0426B 


3061 


5321 


2344 


1262 


3 


2 


1 


1 


Up1 


M90316 


65 


376 


640 


28 


2 


3 


5 


6 


Actb 


K03765 


3095 


3588 


3976 


2<34 


1 


2 


3 


2 



Gens symbol and gene accession refer tc National Center for Biotechnology Information databases and, In particular, to Locus Link. AO value Is the average difference In 
the value of hybridization intensity between the set of perfect* matched oligonucleotides and the sot of mismatched oligonucleotide In the oligonucleotide array 3and 
intensities from DD were semiquaniified on a seals from 1 (+) to 8 + + These estimates are shown as boldface numbers in this table. 1 * Both AD value and 
intensity of genes were studied at 4 time points corresponding to MPRO ceils Induced for the indicated times. 

DD indicates differential deploy; MPRO, mouse prcmyelocytic cell Elne; for gene symbols, see the Appendix at the end of this article. 
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Table 3. Receptors expressed during myalold differentiation process 



Maximal fold change 



Less than 2 



2 or more, less than 3 



3 or more, less than 4 



4 or more, less than 5 



Gene symbol 


Gone accession 




AD value 


by array 




Oh 


24 h 


43 h 


72h 




D21207 


641 


658 


S81 


B87 


^mkar4 
-m - r 


X99581 


508 


447 


378 


684 


try . 


M34173 


433 


384 


506 


506 


Csf2rh1 


M34397 


31B 


345 


410 


241 


HtrSa 


21B273 


166 


272 


273 


339 


W6pr 


X64058 


536 


409 


408 


649 


MPPtR 


AA1 16789 


232 


B4 


63 


3B1 


TCRGB 


M2SD53 


165 


212 


244 


299 


Tnfrsfla 


M5S377 


0 


1 




1 


Cmkbrl 


U2B404 


221 


244 


. 504 


638 


Crhr 


X72305 


121 


200 


250 


355 


Csf2ra 


MB 5078 


171 


372 


402 


254 


IEW3 


AF013114 


187 


270 


426 


148 


Gfidl 


D10171 


128 


164 


150 


257 


Ifngr 


J0S265 


141 


' 263 


327 


251 


M2rg 


U21795 


205 


184 


231 


477 


Ldfr 


X64414 


13S9 


1653 


1665 


3968 


P40-8 


J02B70 


849 


677 


381 


640 


Plaur 


X62701 


312 


443 


476 


734 


Rorg 


M34476 


102 


113 


114 


218 


Srbl 


U37799 


126 


232 


132 


258 


Cr2 


M29281 


83 


138 


243 


77 


Csf2rt>2 


M2SB55 


209 


249 


437 


111 


Kcerlg 


J05020 


23S8 


2766 


3365 


6751 


Fccr2b 


X04B4B 


1703 


1652 


1431 


4605 


Ifngr2 


U69599 


1 


2 


2 


3 


Nr4e1 


X 16995 


96 


168 


202 


401 


I11r2 


X59769 


482 


1796 


2672 


3816 


C5rt 


LQ5630 


185 


434 


BOB 


1078 


Drd2 


K55B74 


0 


0 


0 


2:9 


Fccr3 


M14215 


1 


1 


1 


2 


Fprl 


U2161 


0 


69 


141 


671 


GCR 


AA240711 


2 


0 


0 


0 


L-CCR 


AA034646 


48 


trs 


314 


2056 


NMDARG8 


AAB2021 1 


2 


2 


0 


0 


P2nct 


XB489B 


79 


346 


530 


744 


Piral 


U96652 


0 


43 


172 


378 


PiraS 


U9S6B6 


274 


391 


954 


1B74 


PiraG 


U9G687 


122 


635 


2014 


1716 


Pirb 


U3B689 


191 


445 


966 


747 


Sell 


M25324 


46 


104 


570 


20 


. Tcnj-V4 


M5499B 


1650 


7B 


65 


31S 



1«. EStT W T ^ "T™ 5 ^ VatU£S wera mnre ,han ° r equaI t0 200 U in thi6 GeneB sorted by their expression patterns as follows: 

SL^-EZ^.^ Tl » . ^ * V3,UCS Abbreviat,on3 Cf °™ — « «•»"• "™ Q-» symbols Hated in the locus Link portbn of the Natlona: 
Center for ^technology Inforw.t.nn database available. Numbers in hold denote those gene expression patterns Attained by deferential dTsplay .attar than by 

oligonucleotide errsy assays. Th a other information is presented as in the legend to Table 2. ^ 
AD Indicates average difference; gene symbols are expanded in an Appendix at the end cf this article. 

www.transfac.gbf-bratinscliweig.den'RANSFAC) and determined 
which factors contained in this database were present at detectable 
levels in MPRO cell mRNA, using Asymetrix software for the 
criteria for inclusion of mRNAs from approximately 200 murine 
transcription factors probe sets on the oligonucleotide chip. Of 
these, SA were oppressed and 13 showed changes of 3-fold or more 
in chip signal (Table 5). 

The changes in certain traiiscription factors, such as the moderate 
dovvn-regulation of myb and myc and the up-regulation of (he Max 
ditnerizalion protein MAD, were consistent with the shift of the cells 



from a proliferative to a differentiated state. 2 * Some changes are more 
difficult to explain, such as the up-regulation of DPI , a partner for E2f 
factois in the regulation of S-phase genes, and the mild up-regulation of 
the Id genes, commonly associated with an inhibition of differentiation 
by competition with bHLH traiiscriplional activators. 29 

The C/EBP family has been extensively studied with respect to 
myeloid differentiation. 2 * 30 Absolute levels of the C/EBP a and 6 
mRNAs were low, probably at the borderline of significance for the 
oligonucleotide chip assay, whereas the level of C/EBP p appeared 
higher. In addition, there were discrepancies between the chit) 



BLOOO.IAUGUSTlOOI.VOLUMEW.NUMBaa GENE AND PROTEIN EXPRESSION IN MYELOPOIESIS 519 

Table 4. Granula constituents oxpressed during mouse promyelocytic eel! line cell differentiation 



Granule constituent 



Gene symbol 



AD value by array 



Azurophil (prima ry) granulrss 



Possible granule proteins 



Specific secondary granules 



Tertiary granules 



Man2c1 

Ctsb 

Ctsd 

Ctsg 

E12 

Ela? 

Gus-s 

Lyzs 

McptB 

Mpo 

Prg 

ctsc 

Ctse 

ash 

Ctsl 

CtS3 

Cpa3 

Cd36l2 

Cnlp 

Cyhb 

Ear2 

Fpr1 

Hgb2 

Lcn2 

Ltf 

M9P 

Mmp13 

nop 

Mmp9 



Gene accession 


Oh 


24 h 


43 h 


72 h 


AA16ie63 


178 


134 


99 


164 


M65270 


442 


480 


595 


389 


X528S6 


214 


1087 


1828 


2784 


M96801 


1509 


405 


46 


235 


U04S62 


658 


1273 


843 


157 


AA589D16 


47 


159 


134 


163 


M63B36 


544 


226 


266 


254 


M21050 


0 


1 


i 


3 


X78545 


831 


268 


66 


491 


X1537B 


3788 


3009 


776 


692 


X16133 


2621 


2653 


2920 


9659 


AA144BB7 


252 


194 


342 


576 


X97399 


1 


3 


4 


5 


U06119 


45 


124 


195 


156 


XGB086 


16 


11 


31 

83 


237 


AA089333 


12 


g 


463 


J05116 


621 


270 


90 


601 


ABO0B553 


113 


93 


157 


187 


X94353 


80 


479 


704 


626 


U433B4 


6 


24 


91 


128 




0 


1 


1 


2 


L22181 


178 


220 


235 


845 


X14951 


0 


2 


4 


2 


W13166 


916 


3513 


3931 


6036 


JQ3298 


19 


162 


333 


136 


W45B34 


5 


1 


1 


2 


X66473 


44 


43 


72 


178 


L377.97 


2661 


4782 


2311 


6912 


Z27231 


0 


1 


2 


2 



vh.fi H h^i h ?r^ fl ™^ r ° lein <*> NA f verted on the oligonucleotide arrays, sorted by their expression patterns asfoHows: first by 
An n' granu!e JJ*. and ,asI b * lhe 3, Pha*etic3l onler of gene symbols. Data are presented as described in the legend to Table 3 
,nd,cates averag B difference; gene symbols are expanded in an Appendix at ihe end of this article. 



the average difference AD 



estimates and the mRNA levels observed by Northern blotting with 
specific probes for these genes. In particular, the latter method, 
more sensitive and specific, showed that C/EBP a began to decline in 
the most mature cells, whereas C/EBP 6 mRNA declined progressively 
beginning at 24 hours after the onset of differentiation. 

C/EBP e is a more recently cloned C/EBP family member. Previous 
studies indicated it is expressed in a large amy of human leukemia cell 
lines blocked at various stages of dilTerentiation and that it is up- 
regulated during granulocytic differentiation. 31 A C/EBP c probe was 
not included in lhe oligonucleotide chips, and this mRNA was not 
detected by DD. Therefore, we examined the C/EBP c expression 
patterns by qiianlitative PCR and Northern blot analysis (Figure 4). 
OEBP c exon I was PCR amplified from MPRO RNAs using primers 
RY48 (AG0C(X<XACACO7ITGArGA) and RY49 (TGGCACACT- 
GCGGGCAGACAG). 52 The results showed that C/EBP € is expressed 
throughout myeloid differentiation, with expression levels increased 
moderately in the later stages. 

We detected a number of other transcription factors that arc 
broadly expressed or that have been reported in other studies of 
hematopoiesis (Table 5). Some of the factors that were most 
strongly induced during differentiation have been studied in other 
contexts but not previously implicated in hematopoiesis, such- as a 
mammalian homologue to the DrosophiJa enhancer of split gene, a 
transcriptional silencer. The mammalian gene is expressed at 
relatively high levels as measured by the oligonucleotide chip and 



is a candidate for mediation of the silencing of growth-related 
genes in the maturing neutrophil. Another candidate transcriptional 
silencer, Tiflb, may serve as a corepressor for the ICRAB domain 
family of zinc finger transcription factors and also may mediate 
binding of the heterochromatin protein HP1 to DNA. 33 

Hiere were 26 transcription factors whose mRNAs showed no 
significant changes by oligonucleotide chip analysis and were not 
identified as differentially regulated genes by differential display 
assays. PU.l, a factor necessary for the production of neutrophils 
and the expression of several neutrophil genes,- 14 showed less than a 
3-fold increase in mRNA, below the threshold for a significant 
change. Other candidate hematopoietic transcription factors, such 
as PEBPlaB2 (AMLI). GATA-1, and SP-2, were represented on 
the oligonucleotide chips, but their mRNA levels were so low that 
they were reported as absent in this study. The possibility that small 
changes in the levels or ratios of some transcription factors could 
produce marked changes in transcription potentially limits the 
ability of data generated by present methods to explain transcrip- 
tional changes during differentiation. 

Protein expression patterns of MPRO cells during 
ATRA induction 

We visually compared the 2DE patterns from MPRO cells at the 
same time points used for mRNA analysis. In most cases the 
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Table 5. Transcription modulators presented during myeloid differentiation 



Maximal fold change 



Gene symbol 



Gene accession 



AO value by array 



24 h 



Less than 2-foid 



2 or more, less than 3 



3 orrrwe. lass than 4 



4 or more, less than 5 



2fp11-5 

8tf3 

Gata2 

Hmgl 

Idb1 

Max 

Nfatc2 

Pm1 

Rarg 

Reia 

Sox15 

Ybxl 

Zfp162 

Cebpd 

!db2 

Jundl 

Lyl1 

Nfe2 

Nfkbl 

Pbx1 

stpn 

Tiflb 

Trp53 

Usf2 

Ybx3 

2fp2l6 

Irfl 
Ktf2 
Wyb ' 
Stat3 
Tfdpl 

Cobpb 
Stre14 

Cebpa 
Grg 
.Mad 
Myc 
Bohi6 
TBX1 



AB020542 

W13502 

AB000D96 

J04179 

M31BSS 

M63903 

AAS60093 

U33626 

M34476 

M61909 

WS3527 

M62B57 

Y12B38 

X61800 

M69293 

W20355 

X57687 

L09600 

L28117 

AF0201P6 

A34693 

U67303 

P10361 

U12263 

L35S49 

AA510137 

M21065 
U2SC96 
Ml 2348 
AA396023 
Q08639 

X626C0 
Y07836 

M62362 
X73359 
X831CB 
L0DO39 
IV83667 
AA542220 



2630 
3 

562 
337 
455 
256 
2313 
173 
102 
297 
419 
643 
671 

157 
244 
1274 
399 
458 
953 
611 
375 
673 
259 
120 
96 
82 

85 
62 
692 
484 
307 

390 
223 



0 

314 
169 
0 



2959 
3 

770 
348 
787 
224 
3218 
261 
113 
260 
461 
469 
734 

262 
210 

2002 
342 
743 

2044 
303 
784 
659 
149 
185 
169 
151 

207 
66 
356 
1057 
560 

1248 
383 

212 
555 
111 
112 
3e6 
0 



2795 
2 

472 
177 
721 
312 
2396 
329 
114 
304 
484 
472 
720 

163 
310 
1434 
347 
1042 
1H76 
345 
991 
420 
125 
285 
210 
204 

278 
246 
230 
1012 
505 

1360 
510 

182 
916 
167 
62 
313 
1 



2515 
1 

730 
232 
637 
172 
2542 
305 
218 
244 
637 
496 
' 992 

430 
604 

3085 
891 
505 

2034 
212 
520 
663 
361 
192 
119 
106 



77 
435 
290 
1093 

1903 
935 



1005 
327 
173 

1003 
2 



Shown ere the tronscnpl.cn factors identified as present by the oligonucleotide array analysis whose maximal AD b-tween perfect match and mhWrh rfi^^M 
sets was greater than or equal to 200 U In this study. Data ore presented as described in the legend to Table 3 oligonucleotide 
AD Indicates average difference; oene symbols are expanded !n an Appendix at the end of this article. 



peptides identified for a given protein were deri ved from regions 
along the entire length of the protein, indicating the observed 
products were not the result of proteolytic degradation. These 
data must be considered with several caveats: membrane and 
other hydrophobic proteins and very basic proteins ore not well 
displayed by the standard 2DE approach, and proteins present at 
low levels will be missed. 35 In addition, to simplify MS analysis, 
we used a Coomassie dye stain rather than silver to visualize 
proteins , and this decreased the sensitivity of detection of minor 
proteins. The MS method we used was sufficiently sensitive lo 
identify proteins that could barely be visualized by colloidal 
blue staining. However, a limitation of the method for the mouse 
is that the current database lacks predicted amino acid sequences 
for a substantial fraction of murine genes. In addition, very 
small proteins give only a few peptides, making statistically 
confident identification difficult. 



Figure 5 shows the analytical colloidal blue-stained 2DE IPG 
reference maps of differentiated MPRO cells. Expression patterns 
of more than 500 protein spots were detected and observed through 
the entire series of gels. Protein spots could easily be cross- 
matched to each other, indicating the reproducibility of the method. 
As marked on the gel pictures (Figure 5), 50 proteins with a wide 
range of molecular weights ( I to 200 kd), isoelectric points (4 to 9), 
and abundances were subjected to MS protein identification. The 
results are presented in Table 6. 

Comparing the theoretical value of the molecular weight and pi 
of each protein to that of the observed value, we confidently 
identified 28 proteins in the expected position on the gels (spots I to 
28). Some of the other proteins with strong matches to the murine 
databases migrated to a somewhat unexpected pi position. Nine 
spots gave clear peptide peaks on mass spectroscopy but did not 
match any known gene. Their identification will require amino acid 
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Figure 5. 2DE clectrophorctograrns of MPRO cells. 
MPRO cell lysate (2.5 x 10^ cetl/saropte) was loaded for 
2DE analysis. Gete wens stained with btillianl blue G~cot- 
ioida! dye. (A) 2DE map of uninduced MPRO cefl (0 hour). 
(B) 2DE map of matured MPRO cells (72 hours). Protein 
spots marked In the maps were considered differentially 
expressed and were 6ub|ecterf to MS analysis. The 
resident protein information is fisted in Table 6. 



IPGs. 
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sequence analysis or availability of more extensive murine data- 
bases. We searched for the expression patterns of the genes cognate 
to the expressed proteins in dbMC (Table 6). Nineteen genes were 
found in dbMC, the mRNA for 5 genes was reported as absent, and 
13 genes were present during MPRO differentiation. Comparison 
of the expression patterns showed only 4 genes of 18 present on the 
oligonucleotide chips whose expression was consistent at the RNA 
level and protein level. None of these was on the list of the genes 

Table 6. Correlation of expression patterns between mRNA level and protein level 



that were differentially expressed significantly (5-fold or greater 
change by array or 2-fold or greater change by DD). 



Discussion 

We explored the temporal patterns of »ene expression during 
myeloid development A database has been developed to provide a 



Spot 



Protein defint:ion 



1 GRP78 

1 Artin, gamma, cytoplasmic 

3 RHO GDI 2 

4 Proliferating cell nuciea r antig en 

5 APS kinase 

6 Pyruvate kinase 3 

7 Melanoma X-actin 

B Glyceraidehyde-3-phosphate dahydrogenase 

9 Stetln 3 

1 0 Guanine nucleotide binding protein, beta-2, 

related sequence 1 

11 Trfose phosphate isomerase 

1 2 Testis -derived c-abl protein 

1 3 RNA binding mutif protein 3 

14 CoEopsin response mediator 

15 LaminA 

16 47-kd keratin 

17 std47of> 

18 MHC class H H2-IA-bsta-5 

10 Androgbn-binding protein; subunft alpha 

20 Neuronal apoptcsis inhibitory protein 

21 PAD type IV 

22 Human serum albumin homologous 

23 syr.crip 

24 Trartsarnidirtase 

25 PGK crier phcsphogiycerele 

26 Proliferation-associated gene A 

27 Putatuve peroxisomal antioxidant enzyme 

28 IgE chain C2 region 



The proteins (feted here are represented by the spots rnarfed in the e'ectrophoretograms shown in Figure 5 
Protein definition, Gi number, and predicted value refer to the protein nam*, accession number < ' 



Gi number 


Predicted 
vstue 


Percentage 
(%} ' 


2DE pattern 


cDNA expression 
pattern 




kd 


p' 


Oh 


72 h 


Oh 


72 h 


2505545 


72.4 


5.1 




1 


3 


1321 


1043.3 


N 


6752954 


41.77 


5.3 


40 


3 


6 


0 


2 


Y 


249470.3 


22.83 


4.S 


33 


3 


3 


341 


441.6 


Y 


7242171 


. 26.77 


4.7 


42 


1 


0 


544 


430.9 


Y 


4038346 


69.8 


7.1 


24 


2 


1 


43 


50.7 


N 


6755074 


57.9 


7.2 


48 


6 


4 


3047 


saeo.a 


N 


6571509 


41.72 


5.3 


39 


1 


3 


2539 


341.3 


N 


6579937 


35.79 


8.7 


39 


B 


T 


3073 


5742.3 


N 


461911 


10.59 


5.9 


4B 


0 


4 


N/A 


N/A 




6580047 


35.06 


7.9 


21 


4 


2 


139 


303.1 


N 


6678413 


23.69 


6.9 


26 


3 




3312 


. 2630.1 


Y 


1196524 


17.19 


7 


51 


2 


3 


152 


126.9 


N 


794S121 


16.59 


6.8 


25 


1 


0 


626 


812.4 


N 


6681019 


62.16 


6.4 


36 


2 


0 


Absent 


Absent 


N 


220474 


47.52 


6.6 


35 


2 


0 


Absent 


Absent 


N 


52783 


35.62 


4.e 


29 


3 


0 


Absent 


Absent 


N 


5931555 


31.3 


6.7 


30 


1 


2 


Absent 


Absent 


N 


3159652 


28.6 


7.1 


39 


1 


2 


N/A 


N/A 




739346 


8.04 


6.4 


ee 


0 


2 


Absent 


Absent . 


N 


5932010 


1587 


6 


17 


1 


0 


N/A 


N/A 




6755019 


74.46 


7.2 


21 


■ 1 


3 


N/A 


N/A 




3212625 


65.45 


5.7 


24 


0 


6 


N/A 


N/A 




6576615 


62.53 


7.2 


33 


2 


1 


N/A 


N/A 




1730203 


48.22 


7.2 


31 


3 


1 


N/A 


N/A 




1730519 


44.54 


8.3 


47 


5 


4 


1083 


1402.3 


N 


6754976 


22.16 


8.6 


53 


. 3 


1 


N/A 


N/A 




3913065 


17 


7,6 


55 


0 


3 


N/A 


N/A 




2137430 


12.1 


5.2 


38 


0 


1 


N/A 


N/A 
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reference for later research on the molecular mechanisms underly- 
ing normal myeloid development. 

The MPRO cell system morphologically mimics normal myeloid 
differentiation and biochemically proceeds further toward mature neutro- 
phils than most other in vitro systems. Because the arrest in differentia- 
tion of MPRO cells growing in the absence of ATRAis not physiologic, 
there is a theoretical risk that gene expression in these ceiis is noi 
coordinated in the way that it is in normal differentiatiori. It is 
encouraging that, for (he most part the timing of expression of genes for 
proteins of the various neutrophil granules is consistent with the timing 
of the morphologic and biochemical appearance of these granule 
components during normal myeloid differentiation- 

The DD technique provides certain advantages for detecting 
and comparing mRNA levels in different samples. First, the method 
is, in principle, similar to competitive RT-PCR, and, with the use of 
stringent PCR conditions, is expected to be about as reliable. 
Second, display patterns are reproducible. Third, the method 
detects the levels not only of RNAs already represented in the 
database but also of unknown RNA species that may represent 
"new" genes. Fourth, closely related genes can be distinguished 
regardless of cross-hybridization, provided there are some single 
nucleotide differences in the 3' end sequence. Limitations associ- 
ated with this technique are that numerous gels are necessary to get 
complete information and that comparison of the levels of different 
mRNAs is only approximate because of the differential amplifica- 
tion of bands of different s ize or sequence. 

Oligonucleotide chip analysis is a fast and effective means of 
accessing mRNA expression patterns. 20 Cluster analysis of groups 
of samples by this approach is effective. However, the present 
results indicate that alternative methods of verification aTe desir- 
able before the data on an unexpected change in a particular gene 
are definit iveiy acc epted . 

To obtain the broadest range of information from the myeloid 
differentiation process, both differential display and oligonucleotide 
chip techniques were applied in the current study. As a result, 65.3% of 
the observed changes in mRNA levels came from the differential display 
method and 41 .5% came from oligonucleotide chip assays. 

Our data showed in general that changes in expression pattern 
by the 2 methods agreed qualitatively but that there was some 
quantitative variation. Our results indicate that DD may be a more 
accurate way to detect changes in levels of gene expression than the 
oligonucleotide chip assay. However, improvements m the types of 
oligonucleotides used in arrays may close this gap in the nature. 

ITie mRNAs for a limited number of transcription factors vary in a 
pattern correlating with that of the mRNAs for primary or secondary 
granule proteins. However, more detailed information is needed, and the 
underlying mechanisms of granule gene regul ation remain unclear. The 
number of potential positive and negative regulatory factors found here 
is sufficiently small as to make it feasible to perform in vivo studies, 
such as chromatin immunoprecipitation. 

The oligonucleotide chip used in this study focused on known 
genes, whereas the DD method samples all polyadenylated tran- 
scripts. The latter method generated a large number of products not 
associated with known genes, in part because the mouse genome is 
not as well represented in the database as the human genome. 
However, our experience with DD and human mRNAs indicates 
that substantial fractions of the products represented as ESTs or not 
represented nt all in ihe public databases are cDNA copies from 
natrons. hnRNA, or other RNA with internal A runs. 

Approximately 59 sequences obtained from gel-display bands 
had significant changes in the level of expression and a sequence 
that did not match that for any named gene in the public databases. 



Of these, 38 had plausible or excellent polyA signals. This is only 
an approximate estimate of the number of new genes found 35 
because e fraction of the mRNAs for known genes still had poor 
polyA signals. In addition, the full 3' untranslated region is often 
not known for characterized genes, and in some cases these new 
genes may prove to be identical to products identified by the 
oligonucleotide chips when more complete sequences are obtained. • 
At the least, their presence indicates that a substantial fraction of 
the regulatory or functional circuitry of maturing myeloid cells 
remains unexplored and that valuable tools for their investigation 
will emerge from a combination of RNA expression studies and 
analysis of emerging genomic sequences. 

The desired end point for the description of gene expression in a 
biologic system is not only the analysis of mRNA tTanscript levels 
but also the accurate measurement of protein abundance. The 
developments in 2DE and new MS instrumentation make it 
possible to accomplish this work rapidly and efficiently. In this 
study, we attempted to identify a number of the proteins differen- 
tially expressed between uninduced and ATRA-differentiated MPRO 
cells and to examine the relation between mRNA and protein expression 
levels for these genes representing the same state. 

For protein levels based on estimated intensity of Coomassie dye 
staining in 2DE, there was poor correlation between changes in mRNA 
levels and estimated protein levels. Other groups have studied the 
correlation between mRNA and protein levels in yeast and liver 
cells. 11 * 12 ' 14 In the liver cell experiments, 1 U2 correlation coefficients of 
0.4 to less than 0.5 were observed. In an extensive study in yeast, 11 ' 12 the 
correlation coefficient was high if the most abundant mRNAs and 
proteins were corjsidered. If a handful of these products was omitted, the 
remaining correlation coefficient was 0.4 or less. However, one 
could restore some of the correlation by averaging individual 
data points into broad proteoniic categories. 17 

The discrepancies between mRNA and protein levels in MPRO cells 
appear to be substantially larger than those observed for yeast Possible 
causes for the discrepancies include translational regulation, differential 
expression of certain mRNA s at various stages of cell growth in vitro, 
post-translational protein modification that varies with the stage of 
maturation of the cells, and selective degradation or excretion of proteins 
in vivo. Furthermore, here we are focusing on a developmental 
time-course, whereas the yeast study concentrated on the organism in 
vegetative growth. New techniques, equipment, and bioinformatic 
analysis tools must be developed to make such systematic, global, and 
quantitative analyses feasible. 

The initial studies of protein expression presented here provide a 
cautionary note for eflbrts to interpret cell composition and function in 
relation to mRNA levels. Discrepancies we observed between gene 
expression and protein abundance suggest that selective post-transerip- 
tional controls may be at least as important as changes in mRNA levels 
in determining the protein composition of neutrophils and that they are 
phenomena less well explored than transcriptional control. Analysis of 
mRNA expression patterns is itself only a small begmning toward a 
genome-wide description of cellular components 
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Appendix _~ 

Gene symbols used in tables: Ad.b: actin, beta, cytoplasmic; Actg: actin, gamma, 
c>ioplasouc; Actx: melanoma X-actin; Aldol: aldolase 1, A isotomi; Mi5: 
ADP-iibosyiiiUon tactoi' 5; Att'i : at t'fvatiiig tnuisaiption factor I ; Ati2: activating 
Udiiscription factor 2; Btl3: basic tnmsaijytion iac^ar 3a; Bap: paiphearalrtype 
ben^odiazcpiiie receptor, C5: 1: complement coniprj!neiit5,itjceptor 1/G protein- 
coupled receptor (C5a); Ccnbi: cyciin B2; Cd36l2: CD36 antigen (coUagen type 
I receptor, thrombospondin fe^eptorVlike 2; C053: CD53 antigen; Cebpa: 
CCAAT/cnhanccr binding protein OBBP, alpha; Ccbpb: CCAAT/Vnhanccr 
binding protein (C/EB?), beta; Ccbpd: CCAAT/enhanccr binding protein fC 
EDP), delta; Ccbpc: CCAAT/enhanccr binding protein (C/EBP\ cpsilon; Cfll: 
cofilin I , nonmuscie; Cmkai4: chcraokbe (C-X-C) receptor 4; Cmkbr I : ehemo- 
lone (C-Q receptor J/tvlipla lrceptor; Cnlp: csthclin-likc protein; Cutf: ciliaiy 
neurotropic fecloiAdnc finger piotein PZF; Ccpa: coaUmicr protein complex 
snbuni! alpha; C|w3: cai-boxypcpUtlasc A3, mast cell; Ol: complement receptor 
2; Crhr: cinticolToptn releasing hormone rctcplor. Cny: complement rcecpav- 
rclauxl protein; CsOr: CSF I (M-CSF) ra:cptor/c-to'CDI 15; Csf2ra: CSF 1 
(GM-C^3F) rcixptor, alpha, low-aflmiry/CDl 1 6; CsfZibl: CSF 2 (GM-CSF) 
receptor, beta % bw-attintLy/U. 3 ixtcptfr-likc protein (AlC2ByCI>jvl31; 



Csflrb2: CSF 2 (GM-CSF) receptor, beta 2, low-affinity/I L-3 receptor (AIC2A); 
Ctsb: cathepsm B; Ctsc: catlicpsin C; Ctsd: cathepsin 1>, Ctsc: cathepsin fc; Osg: 
cathepsin G: Ctsh: cathepsin H; Osl: cathepsin L: Ctss: cathqxiin S; Cybb: 
cyi>xhiome b-245. beta; Chd2: dopamine receptoi- 2; E211: 1121" transcripiion 
factor 1; Eet2: eosuiophil~associate<i libonuclease 2; Ebi3: Epstein-Bair virus- 
induced gene 3/cytokine lTceptor-like molecule (EBB); E12: Balb/c neutrophil 
elastase; Ela2: elastase 2; Erh: enhancer of rudimentary homolog (Drosophita); 
Btohi6: cthanol induced fv'sicrol regulatory clancnt binding transcription factor I 
(SUEBFI) honmlog; F2rl2: coagulation factor FT (rfirumbin) rcccptor-likc 2; 
Fcer Ig: Fc receptor, IgE, high amnity I, gamma polypeptide; Fcgr2b: Fcax<rptor, 
IgG, lov; allinity lib; Fcgr3: Fc rcccpto^ IgG, low affinity III; Fprt: foraryl 
peptide receptor 1/1MJLP receptoi; Gabpbl: GA repeat binding protein (GABP- 
bctal subimit); Gata2: GATA-bmding protein 2; Gna^i: guanine nucleotide 
binding protein, alpha stimulating; Gnb2-rs 1 ; r^aaiiinc miclcolidc binding protein, 
beta- 2, related sequence 1; Gpx3: glutidhionc peroxidase 3; Grg: rclaurd to 
Dmsophiia groutito genu; Gridl : ghitamate receptor channel subimit delta 1 ; 
Gm: granatin; Gstml: glutathionc-S-Uriasfcrafk:, mu I; Gus-s: bcta-ghicurofi:- 
rlasc structural; Gys3: glycogci s>tiUwsc 3, brain; H2-D: hLsrjGco;upcUibiliry 2, D 
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region locus 1; Hist2: histone gene complex 2; Iiist5~2ax: H2A histone family, 
member X; Hmgi: high mobility group protein I; Hsp60: heat shock protein, 60 
kDa; HtrSa: 5-hydraxytryptamine (serotonin) receptor 5A; Idbl: inhibitor of 
DNA binding l/hetix-lnop-heJix DNA bimting protein regulator (Id); Idb2: 
inhibitor of DNA binding 2; Ifhgr. interferon gamma receptor; Imgr2: interferon 
gamma receptor 2; II: la-associated invariant chain; 111 a: ILl alpha; IIli2: IL1 
receptor, type H; H2rg: IL2 receptor, gamma chain; IWra: IL4 receptor, alpha; 
111 Orb: H. 10 receptor, beta; II 17n IL17 receptor, Irfl: interferon regulatory factor 
I; IrQ: interferon regulatory faetor-2; Itgb2; integrinbeta2 (Cdl8); Itpr5: inositol 
1,4,5-trisphosphate receptor (type 2); Jitndl: Jun rtroto-oncogene-related gene 
dl/transcription factor JUN-D; Kl£2; Kruppel-Iike factor LKLF; L-CCR: lipopoly- 
saccharidc inducible C-C ctuanokine receptor-related; Lcn2: lipocalin 2; Ldlr: 
low density lipoprotein receptor; Lspl: Lymphocyte-specific l/S37/pp52; Lstl: 
leucocyte-specific transcript I ; Ltb4r leukotriene B4 receptor; I.thn Lymphotoxin- 
beta receptor; Ltf: U^to transferrin; Ly64: lymphocyte antigen 64; Ly6e: lympho- 
cyte antigen 6 complex, locus E; Lyll : lymphoblastoma teukemla/bHLH factor; 
Lyzs: lysozyme; M6pn rnannafe-6-phosphate receptor, cation dependent; Mad: 
Max dimcrization protein; Man2cl: mannosidase, alpha, class 2C, member J; 
Max: Max protein; Max MYC-assoeiated zinc finger protein (purine-binding 
transcription factor); MBP: eosinophil granule major bask protein precursor; 
McptS: mast cell protease 8; Mil: myeloid/lymphoid or mixed-lineage leukemia; 
Mmp 13: matrix metfllloproteinase 1 3/cottagcuase; Mmp9: matrix metallo protein- 
ase 9/gelatinase B; Mpo: myeloperoxidase; Myb: myeloblastosis oncogene; 
Myb(2: myeloblastosis oncogene-like 2; Myc: myelocytomatosis oncogene; 
Myln: myosin light chain, alkali, nonmuscle; Nfatc2: nuclear feet or of activated T 
cells, cytoplasmic 2; Nte2: nuclear factor, erythroid-derived 2, 45 kDa; Nfkbl : 
NF-kappa-B (pl05); Ngp: neutrophilic granule protein; NMDRGB : N-methyl-D- 
flspartate receptor gtutamnte-bindtug chain homolog; Npml: miclcwphosmin 1; 
Nr4al: nuclear receptor subfiuniry 4. group A, member 1; Osi: oxidative stress 
induced; P2rxl: purinergic receptor P2X, ligand-gated ton channel, I; P2ry2: 
piirinergic receptor P2Y, G-protein-coiipled 2; P40-8: P40-8, nmctbnal/laminin 
receptor, Pbxl: pre B-cell leukemia transcription factor I; Pic: properdin factor, 
complement; Piral: paired-Ig-like receptor A I; PiraS: paired-Ig-like receptor 



A5; Pira6: paired-Ig-like receptor A6; Pirb: paired-Ig-like receptor B; Plaur: 
urokinase plasminogen activator receptor; PM1: putative receptor protein (SP: 
P17152 ); Pml: promyelocytic leukemia; Prg: proteoglycan, secretory granule; 
Prg3: proteoglycan 3/cosmophil major basic protein 2; Prtn3: proteinase 3; 
Psma2: proteasome (prosome, macropain) sub unit, alpha type 2; PtmM: prothy- 
mosin beta 4; Ptprc; protein tyrosine phosphatase, receptor type, C; Rac2: 
RAS-related C3 botulinum substrate 2; Rarg: retinoic acid receptor, gamma; 
Rela: avian iietktdoendotheUosis viral (v-rel) oncogene homolog A/NF-kappa-B 
p65; RpI19: ribosomal protein L19; RPL8: ribosomai protein L8; Rps6kal: 
ribosomal protein S6 kinase polypeptide 1; Rps8: ribosomal protein S8; Rtu3: 
reticulon 3; SlOOaS: S100 calcium binding protein A8 (ca^granulin A); Sl00a9: 
SI 00 calciitm-biudmg protein A9 (cajgranulm B); Sdfr2: stromal cell-derived 
tad or receptor 2; Sell: selectin L (lymphocyte adhesion molecule IX Sema4d: 
semaphoriti 4D; Scppl: selenoprotein P, plasma, I; Slpil: SFFV nroviral 
integration I; Shfdgl: split hand/foot deleted gene 1; Sic 10a 1: solute carrier 
tamiJy 10 (sodium/bite acid co transporter family), member 1; Slpi: secretory 
leukocyte protease inhibitor; SoxlS: SRY-box containing gene 15; Spi2-1: serine 
protease inhibitor 2-1 ; Srb I : scavenger receptor dass B I ; Stat3 : signal transducer 
and activator of transcription 3; Stat5a: signal transducer and activator of 
transcription 5 A; Stat6: signal transducer and activator of transcription 6; Stral4: 
basic-he iix-loop-helix protein-retinoic acid induced; Thxl : TDX1 protein/LPS- 
induccd TNF-alpha factor homolog; Tfcrgb: T-ceJI-receptor germline beta-chain 
gene constant region; Tcrg-V4: T-celL-receptor gamma, variable 4; Tctexl: 
t-complex testis expressed I; Tfdpl: transcription factor Dp I; Tiflb: transcrip- 
tional intermediary factor 1, beta; Tlr4: toll-like receptor 4; Tnfisfla: TNF 
receptor superfamily, member la; Tnftsflb: TNF superfamily, member lb; 
Tbmm70a: trenslocase of outer mitochondrial membrane 70 (yeast) homolog A; 
TpL triftsephosphate isomerase; Trp53: trmstbrmation-rebtcd protein 53; Ubb: 
ubtquilin B: Usf2: upstream transcription factor 2; Ybxl : Y box transcription 
factor; Ybx3: Y box binding protein; Zfpl l-<>: zinc finger protein s 1 1-6; Zfp 18: 
zinc finger protein 1 8 homolog; Zfp36: zinc finger protein 36; Zfpl 62: zinc finger 
protein 162; Zfp216: zinc finger protein 216; Zfpml: zinc finger protein, 
multitype I ; Znfhl al : zinc finger protein, subfamily LA, I (Ikaros); Zyx: zyxin. 
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Bacterial lipopolysaccharide (LPS) evokes several 
functional responses in the neutrophil that contribute 
to innate immunity. Although certain responses, such as 
adhesion and synthesis of tumor necrosis factor-a, are 
inhibited by pretreatment with an inhibitor of p38 mi- 
togen-activated protein kinase; others, such as actin as- 
sembly, are unaffected. The aim of the present study was 
to investigate the changes in neutrophil gene transcrip- 
tion and protein expression following lipopolysaccha- 
ride exposure and to establish their dependence on p38 
signaling. Microarray analysis indicated expression of 
13% of the 7070 Affymetrix gene set in nonstimulated 
neutrophils, and LPS up-regulation of 100 distinct 
genes, including cytokines and chemokines, signaling 
molecules, and regulators of transcription. Proteomic 
analysis yielded a separate list of up-regulated modula- 
tors of inflammation, signaling molecules, and cytoskel- 
etal proteins. Poor concordance between mKNA tran- 
script and protein expression changes was noted. 
Pretreatment with the p38 inhibitor SB203580 attenu- 
ated 23% of LPS-regulated genes and 18% of UPS-regu- 
lated proteins by 2=40%. This study indicates that p38 
plays a selective role in regulation of neutrophil tran- 
scripts and proteins following lipopolysaccharide expo- 
sure, clarifies that several of the effects of lipopolysac- 
charide are post-transcriptional and post-translational, 
and identifies several proteins not previously reported 
to be involved in the innate immune response. 



Lipopolysaccharide (LPS), 1 a component of the outer cell wail 
of Gram-negative bacteria, evokes a variety of functional re- 
sponses in the human neutrophil (PMN) after binding to a 
plasma membrane receptor complex that involves the Toll-like 
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receptors (TLRs) (1-5). These "immediate" functional re- 
sponses, including actin assembly, adhesion, activation of nu- 
clear factor-kappa B (NF-kB), and priming for an enhanced 
secretory response and for release of reactive oxygen interme- 
diates, appear to be central both to the innate immune re- 
sponse and to the pathogenesis of several inflammatory human 
diseases, including sepsis and the acute respiratory distress 
syndrome (6). p38 mitogen-activated protein kinase (p38 
MAPK) has been shown to mediate LPS-induced PMN adhe- 
sion, NF-kB activation, and TNF-a and IL-8 translation and 
release (7), and its blockade attenuates LPS-induced PMN 
accumulation in the airspace (8). However, other cascades al- 
most certainly lead to downstream effectors of the LPS signal; 
for example, actin assembly appears to be p38 MAPK-inde- 
pendent (9). An improved understanding of the transcriptional 
and translational responses of the neutrophil to LPS and the 
modulation of these responses by p38 MAPK might carry 
pathogenetic and therapeutic implications. 

Historically, it has been believed that the downstream PMN 
transcriptional response to LPS is static and that PMN func- 
tional responses to LPS that depend on de novo protein syn- 
thesis are primarily limited to the release of cytokines (10). 
However, recent studies indicate a robust transcriptional re- 
sponse (11). To date, most studies have relied upon and re- 
ported a short list of functional assays of the LPS-exposed 
PMN; therefore, no exhaustive investigation of either the tran- 
scriptional response or protein synthetic repertoire of the PMN 
has been reported. Although several techniques have been used 
to evaluate transcripts, the screening of global changes in 
mRNA by microarray analysis has only recently become possi- 
ble. In this way, thousands of genes can be screened in an 
unbiased fashion for transcript abundance. Such genomic 
screens in mammalian cells have previously been applied to 
define altered expression profiles in response to agonists (12) 
and to drug action (13) and during cell cycle progression (14). 

Although DNA microarray technology is expected to provide 
insight into the response of the human PMN to LPS (15), 
inliibition of LPS-stimulated IL-1 and TNF-a production by 
p38 MAPK inhibitors in THP-1 cells (16) and of TNF-or synthe- 
sis in human PMNs (9) occurs at a translational level and 
would therefore not be detected by DNA microarrays. Further- 
more, in other systems, such as yeast and human liver, mRNA 
and protein levels show poor correlation (17, 18). Proteomice is 
a complementary tool for assessing global changes in cellular 
protein expression, thereby providing additional insight into 
cellular signal regulation. A proteomic approach has proven 
useful in different systems for dissecting signal transduction 
cascades and describing their output (19, 20) and has even 
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recently been used to detect novel upstream messengers in- 
volved in LPS signal transduction (21). We have applied DNA 
microarrays and proteomics to define and compare transcrip- 
tional and post-transcriptional alterations in the LPS-exposed 
PMN and to establish the dependence of these alterations on 
p38 MAPK signaling. 

EXPE R1MKNTAL PROCEDURES 

Materials — Endotoxin-free reagents and plastics were used in all 
experiments. Aprotinin, leupeptin, AEBSF, E-64, pepstatin, and besta- 
tin protease inhibitors, spermine HC1, and a-cyano-4-hydroxycinnamic 
add (CHCA) were all purchased from Sigma Chemical Co. (St. Louis, 
MO). SB203580, a p38 MAPK inhibitor, was purchased from CaJbio- 
chem-Novabiochem Corp. (San Diego, CA). For two-dimensional PAGE, 
rehydration buffer, equilibration buffer a, vertical electrophoresis solu- 
tions, and 10% homogeneous poly aery I amide slab gels were purchased 
from Genomic Solutions, Inc. (GSI, Ann Arbor, Ml). Sequencing grade 
porcine trypsin was purcliased from Promega (Madison, WI). 

LPS Incubation — PMNs were isolated by the plasma Percoll method 
(22), a technique that yields less than 6% monocytic contamination, and 
resuapended at a concentration of 16.4 X 10°/ml in RPMI 1640 culture 
medium (BioWhittaker, Walkersville, MD) supplemented with 10 uim 
HEPES (pH 7.6) and 1% heat-inactivated platelet-poor plasma. After 
addition of 100 ngfonl Escherichia coli 0111*34 LPS (List Biological), 
incubation waa carried out with continuous rotation (4 h, 37 *C) both in 
the presence and absence of SB203580. Both AfFyinetrix analysis and 
proteomic analysts utilized 75 X 10 6 cells. For microarray analysis, 
noustimulated and 4-h-treated PMNa were collected from three sepa- 
rate donors. A more detailed time course following LPS exposure waa 
performed using polymerase chain reaction. For proteomic analysis, 
LPS incubations from separate donors (n = 6) were performed and then 
analyzed individually. Control and poat-LPS incubation PMNs were 
washed (0.84 M sucrose/1 mM EDTA/10 mM Tria) and then lysed in a 
modified rehydration buffer (GSI, Ann Arbor, MI) supplemented with 2 
M thiourea, 50 mM dithiothreitol (DTT), 22.6 mM spermine HC1, and a 
mixture of six protease inhibitors (10 ugfrnl aprotinin, 10 ^tg/nd leupep- 
tin, 2 mM AEBSF, 5 E-64, 1 jum pepatatin, 10 um bestatin). DNA was 
pelleted by centrifugation at 250,000 X g for 60 min (23). 

Asymetrix Oligonucleotide Ajray — Five micrograms of total RNA was 
isolated with TRIzol (Invitrogen) and RNeasy columns (Qiagen) and 
subsequently labeled with btotin as described by Asymetrix. Briefly, 
firsketnmd synthesis was accomplished with Superscript H reverse tran- 
scriptase (Invitrogen) using a T7-oJigo(dT). 24 primer for 1 h at 42 a C 
followed by second-strand synthesis using B. coli DNA polymerase I and 
RNase H (Invitrogen) at 16 °C for 2 h. Double-stranded DNA was used as 
a template for in vitro transcription with T7 RNA polymerase in the 
presence of biotm-labeted tJTP and CTP using die BioArray High Yield 
RNA transcript labeling kit (Enzo). Fifteen micrograms of cRNA was 
fragmented and used for hybridization to AfTymetrix HuGene 6800FL 
Genechipa. Each sample was hybridized initially using a Teat2 Genechip 
to test for sample degradation and full-length in vitro translation. Data 
were analyzed using Asymetrix Genechip software. Results from three 
separate donors were analyzed. 

Reverse Transtxiption and Polymerase Chain Reaction — cDNA was 
prepared by reverse transcription using 2 /jg total RNA, derived from 
20 X 10 6 cells that were treated as indicated. Polymerase chain reac- 
tions were performed using specific primers for Mx-i, TNF-ol, MCP-1, 
p6S, SJ00A4, and glyceraldehyde-^phosphate dehydrogenase. 

Two-dimensional PAGE— The protein concentration of the lyaatea 
was measured as described by Bradford et aL (24). Poor isoelectric 
focusing (IEF) results were encountered unless the polycationic sperm- 
ine was diluted (data not shown); therefore, lyBatee were diluted with 
rehydration buffer (GSI, Ann Arbor, MI) to achieve a final spermine 
concentration of 6 mM. Equal protein loads (1.6 mg) of control and 
LPS-stimulated neutrophils were used to rehydrate IEF gels overnight 
(18 cm, pH 8-10 nonlinear Immobiline Dry Strip IEF gels, Amersham 
Biosciences; Piscataway, NJ). IEF was performed at 20 °C to 100-kVh 
(Phaser, GSI) under mineral oil, followed by two 10-min SDS equilibra- 
tion steps (DTT and then iodoacetimide-containing equilibration buff- 
ers, GST) and then by vertical electrophoresis on 10% homogeneous 
polyacrylamide slab gels (GSI) at GOO V. Protein spota were visualized 
by agitation in colloidal Coomassie Brilliant Blue G-250 (16 h) (26), 
followed by destaining in deionized water (20 h). In separate experi- 
ments, control and LPS-stimulated PMN lysates from three donors 
were pooled and then analyzed by two-dimensional PAGE using over- 
lapping narrow isoelectric pomt (pi) ranges (18 cm, pH 5.0-6.0, 5.5- 



6,7, and 6-11, Ameraharn Biosciences, Piscataway, NJ). Identical IEF 
and vertical electrophoresis parameters were used for all gels. 

[mage Analysis of Two-dimensional Gels— Colloidal Coomassje- 
stained gels were digitized using a Powerlook II (UMAX Data Systems, 
Inc., Taiwan) flatbed Bcanner with 8-bit dynamic range and 150-dpi 
resolution. Biolmage (GSI, Ann Arbor, MI) 2D-Analyzer software was 
used to locate, quantitate, and match protein spots on the control and 
LPS gel images. Analysis was performed by assigning 50 common 
anchor spots between paired images; the remaining spots were com- 
pared by a constellation-matching algorithm. All data were then care- 
fully reviewed by the operator to account for any discrepancies. Protein 
loading between control and experimental gele may have varied be- 
cause of inconsistencies in rehydration of the different IEF gel strips; 
therefore, gel images were normalized so that the Bum of the integrated 
intensities of all matched spots on paired gels was made equal. Control 
and LPS-stimulated gel images from individual donor experiments 
were matched to generate composite images; composite images were 
then matched into a master composite image to track the LPS response 
of protein spots among different donora <26), Only those spots that were 
common (image-matched) to all original 12 (pH 3.0-10.0) gels were 
considered for further analysis. For these spots, the LPS-induced 
change in integrated intensity in the six experiments was subjected to 
statistical analysis with a two-tailed Student's t test, and those spots 
withp < 0.05 were identified by peptide mass fingerprinting (described 
below). For the narrow range (pH 5.0-6.0, 5.5-6.7, and 6-11) two- 
dimensional PAGE experiments using pooled donors, only those spots 
with concordant regulation exceeding 1.6-fold or that appeared denovo 
in the LPS gel in two repeat experiments were further analyzed. 

In-gel Tryptic Digestion — In-gel digestion of protein spots was per- 
formed with sequencing grade porcine-modified trypsin using the 
method of Hellman et al. (27). Tryptic peptides were then extracted (50 
pi of 50% Qcetonitrile/5% trifluoroacetic acid, 2 h), and the supernatant 
was taken to dryness in a vacuum centrifuge and then re dissolved in 
trifluoroacetic acid (20 u\, 0.6%). Peptides were then purified and con- 
centrated using ZipTip cle pipette tips (MilUpore, Bedford, MA). 

MALDI-TOF Mass Spectrometry— Analyses were performed on an 
Applied Biosyatems matrix-assisted laser desorption ionization time-of- 
Oight (MALDI-TOF) Voyager-DE PRO mass spectrometer (Framing- 
ham, MA) operated in delayed extraction mode. Samples (0.5 /xl) were 
spotted onto a sample plate to which matrix (0.5 of 10 mg/ml CHCA) 
was added The sample-matrix mixture was dried at room temperature 
and then analyzed in reflector mode. CHCA was also spotted alone as a 
negative control. Spectra were the stun of 100 laser shots, and those 
peaks with a signal-to-noise ratio of greater than 3:1 were selected for 
data base searcriing. Spectra were internally calibrated using autolytic 
trypsin peptides (mfz 842.51, 2211.10). 

Data Base Searching Algorithm — The monoisotopic masses for each 
protons ted peptide were: (a) entered into the program MS-Fit (available 
at prospector.ucsf.edu) for searches against the Swias-Prot, NCBI, 
and GenPept databases, and (b) entered into Mascot (available at 
matrixscience.com), an algorithm testing eta tistical significance of pep- 
tide mass fingerprinting identifications. For MS-Fit searches, masses 
derived from trypsin, CHCA, keratin, and Coomassie Brilliant Blue 
G-250 were excluded. Search parameters included a maximum allowed 
peptide mass error of 0. 1 Da (0.8 Da in the few instances in which linear 
mode was used), consideration of one incomplete cleavage per peptide, 
pi range of 3.0-10.0, and molecular mass range of 1-200 kDa. Accepted 
modifications included carbamidomethylation of cysteine residues 
(from iodoacetamide exposure following IEF) (28) and methionine oxi- 
dation, a common modification occurring during SDS-PAOE (29). Pro- 
tein identifications were assigned when three criteria were met: 1) 
statistical significance (p < 0.05) of the match when tested by Mascot 
(matrixsci eiJ.ce.com); 2) >20% sequence coverage by the tryptic pep- 
tides; and 8) concordance (±16%) with the molecidar weight and pi of 
the parent two-dimensional PAGE protein spot. The following Bpecial 
exceptions were considered: (a) protein identifications not fulfilling 
criterion 2 were still assigned if criteria 1 and 3 were fulfilled and no 
other Homo sapiens proteins with peptide mass-matched p values < 
0.05 were identified by Mascot; (6) if criterion 3 waa not fulfilled (lower 
than expected molecular weight), a cleavage product of the identified 
protein waa inferred, and the cumulative molecular weight of the tryptic 
peptides waa compared with that of the twa<Umenaional-PAGE spot to 
ensure that it was not exceeded; (c) if criterion 3 was not fulfilled (isolated 
discordance between theoretical and observed pi), post-tranalational mod- 
ification of an iinrecovered peptide waa inferred; and (d) if two or more/f. 
sapiens protein assignments with >4 mutually exclusive matching pep- 
tides were identified, a protein mixture in the two-dimensional PAGE 
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spot was inferred ami further analysis halted (quantitative conclusions 
regarding the irifUvuhial protein constituents could not be drawn). 

RESULTS 

Genes Differentially Expressed in LPS-stimulated Neutro- 
phils— Unman PMNe were left untreated or incubated in the 
presence of 100 ng/ml LPS for 4 h. As a control to confirm that 
the PMNs were quiescent at baseline and that LPS resulted in 
normal stimulation, mRNA was isolated, cDNA was prepared, 
and PCR for TNF-a was performed. Little TNF-a expression 
was seen in nongtimulated cells, whereas LPS treatment led to 
an increase in expression in each of the donors subsequently 
used for microarray analysis (data not shown). No macrophage- 
colony stimulating factor receptor transcript was detected by 
oligonucleotide microarray analysis, confirming there was no 
significant monocytic contamination. 

Human PMNs express a limited repertoire of mRNA tran- 
scripts at baseline but respond to LPS with differential expres- 
sion of genes in many families. Considering only those genes 
present by microarray analysis in all three donors, unstimu- 
lated PMNs expressed 13.0% (923 of 7070 genes) of the Af- 
fymetrix gene set. Gene classes represented at baseline include 
metabolic enzymes, structural proteins, receptors, signaling 
proteins, and transcription factors. By comparison, human 
monocytes expressed -40% and human fibroblasts —35% of the 
represented genes (data not shown). By the criterion of a >3- 
tbld increase in expression in all three donors on Afrymeferix 
oligonucleotide array analysis, exposure of PMNs to LPS for 4 h 
resulted in the up-regulation of 100 genes (Table I). 

Genes from several different functional classes were induced 
in PMNs following LPS exposure. Of interest, a number of 
transcriptional regulators were induced, including transcrip- 
tion factors of the NF-kB family. The transcriptional NF-kB 
complex ha3 previously been implicated in the regulation of the 
genes induced by LPS (11). The genes for several cytokines and 
ehemokines were also found to be up-regulated. These include 
Wa, IL-lfr IL-6, MCP-1, MIP-3a, and MIP-1& (Table D. 
PCR was performed to confirm the results from the inicroarray 
analysis. PCR analysis on selected genes indicates that the 
time course for changes can be rapid or delayed but parallel the 
changes found in the array at the 4-h time point (data not 
shown). Other up-regulated genes included those for metabolic 
enzymes, immune response molecules, kinases, phosphatases, 
signaling molecules, adhesion and cytoskeletal components, 
interferon-stimulated genes, and those with unknown or mis- 
cellaneous function (Table 1). 

LPS stimulation of PMN also resulted in the down-regula- 
tion of 56 genes (Table II). Down-regulated genes were identi- 
fied as transcriptional regulators, protein and lipid kinases and 
phosphatases, structural molecules, and signaling molecules. 
Genes for metabolic proteins were also evident, as were several 
uncharacterized genes. 

Two-dimensional PAGE and Image Analysis— la contrast to 
the limited number of transcripts found at baseline, PMNs 
were found to express a large number and variety of proteins in 
the nonstimulatcd state (Fig. 1, A and C, and Tobies III-V). 
Reproducible protein expression patterns were found on the pH 
3.0-10.0 gels, and the majority of proteins fell in the pH 5.0- 
7.0 range (Fig. 1A). The basic region (pH > 7.0) consistently 
exhibited poor resolution, precluding meaningful image analy- 
sis and further workup (data not shown). Depending on the 
spot-finding parameters (minimum spot intensity, filter width) 
selected on the image analysis software, spot-by-spot manual 
editing was found to be necessary to avoid over- and underde- 
tected spots; moreover, further manual editing was performed 
to 6creen for unmatched and mismatched spots following 
matching of paired control and LPS-stimulated gels. After spot 



editing, -1200 well-reBolved spots were evident on each pH 
3.0-10.0 gel. In an attempt to improve resolution of the pi 
range bearing the greatest number of well-resolved spots, over- 
lappingnarrow pH range gels (pH 5.0-6.0, 5.5-6.7, 6-11) were 
also run. Of interest, a similar number of well-resolved spots 
(-1200) were detected on the narrow pH range gels (Fig. 1, C 
and D). Assuming a detection limit for Copmassie of 15 ng (0.25 
pmol, or 1.5 X 10 11 molecules, for a 60-kDa protein) and a 
protein load per gel corresponding to 76 X 10 6 PMNs, we 
estimate a detection limit on our gels of 2000 molecules/cell for 
a 60-kDa protein. As investigators have suggested in other cell 
lines with the use of high resolution two-dimensional-PAGE 
methods (30), we estimate that >10,000 proteins are expressed 
in the resting PMN. 

Human PMNs respond to LPS with the differential expres- 
sion of a large number of proteins. In the six individual pH 
3.0-10.0 experiments, the number of protein spots that in- 
creased in integrated intensity by at least 50% following LPS 
exposure was 185, 122, 104, 104, 96, and 131, respectively. The 
number of protein spots that decreased by at least 50% follow- 
ing LPS exposure was 72, 151, 102, 98, 128, and 97, respec- 
tively. Although gel-to-gel regional variability in resolution was 
expected to account for individual spots not being well visual- 
ized on particular gels, only those spots that were matched to 
all. 12 original gels were analyzed further. Overall, the number 
of spotB matched to all 12 original gels was 125. The numbers 
of spots that were both matched to all 12 original gels and that 
increased by at least 50% in integrated intensity in the indi- 
vidual experiments following LPS exposure were 46, 13, 17, 27, 
22, and 20, respectively. The numbers of spots that were 
matched to all 12 gels and that decreased by at least 50% were 
6, 22, 17, 22, 34, and 28, respectively. The LPS-induced change 
in integrated intensity of the 126 spots that were matched to all 
12 original gels was subjected to statistical analysis with a 
two-tailed Student's t test, and those spots with statistically 
significant (p < 0.05) regulation among the six experiments 
were identified by peptide mass fingerprinting (Table III). 

Identification of LPS~regulated Proteins — Several proteins 
were consistently up-regulated on the pH 3.0-10.0 gels (Table 
HI), including regulators of inflammation (annexin III) and 
signaling molecules (Rab-GDP dissociation inhibitor 0). Sev- 
eral actin fragments were seen to be consistently up-regulated 
in the six experiments following LPS exposure (Table HI). Of 
interest, the proteasome 0 chain was also consistently un- 
regulated. Down-regulated proteins included other signaling 
molecules, such as Rho GTPase activating protein 1. 

On the pH 5.0-6.0 and 6.5-6.7 gels, several proteins were 
found to show increases of greater than 1.5-fold following LPS 
exposure (Tables IV and V), including cytoskeletal proteins, 
such as moesin, nonmuscle myosin heavy chain, and a putative 
phosphorylated form of nonmuscle myosin heavy chain, and 
signaling molecules, such as protein phosphatase 1 and P0 4 - 
stathmin. The putative phosphorylated form of nonmuscle my- 
osin heavy chain (spot #1101) was positioned 0.03 pH unit more 
acidic than the unmodified protein (spot #1102) (Fig. ID) and 
was distinguished by a tryptic peptide (mlz 1366.74) not pres- 
ent in the unmodified protein, consistent with phosphorylation 
of serine 685. Serine 685 is predicted by NetPhos 2.0 Prediction 
Server (available at www.cbs,dtu.dk/services/NetPhos/(31)) to 
be a high probability phosphorylation residue and by Scan- 
Prosite (www.expasy.ch/tpols/scnpsite.html) to be a substrate 
for protein kinase C. The tryptic phosphopeptide identified in 
P0 4 -stathmin, extending from residues 15 to 27 (1468.7 Da), is 
consistent with phosphorylation of either serine 16, a known 
substrate for Ca 3+ /calmodulin (CaM)-dependent kinases (32), 
or serine 25, a known substrate for p385 and ERK (Fig. 2A) 
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Table I 

Human neutrophil genes induced after 4 h of LPS exposure 



Description 



GenBunk™ no. 



Change- fold 



Transcriptional regulation 

. Pleiomorphic adenoma gene-like 2 

NFKH2 

NFKBIB 

pS5 

BCL3 

X-box binding protein 1 

Metal-regulatory transcription factor 1 

Ets-2 

c-Rel 

NFKB1 

Basic leucine zipper transcription factor. ATF-like 
1KB 

MAX dimerization protein 
DIF2 

Cytokines and receptors 
MCP-Jf 
MlP-lfl 

otHelix coiled-coil rod homolog 

GR03 (beta) 

TNF-a 

MIP-3« 

IL10RA 

IL-6 

GROa 

IIM74 

Immune response 
Orosomucoid 

Complement component C3 

Pintease inhibitor 9 

Complement component 3a receptor 1 

Protease inhibitor 3 

S7JF7/an tilaukoproteas e. 

ELAiVHS/clastase inhibitor 

CD58 

Complement component PFC 

Kinases 
CNKlFNKfPLK-like 
Cot 
Pim-2 
LIMK2 

Phosphatases 
PAC-1 1DVSP2 
DUSP5 
PHAt 

Signaling molecules 
mFAJPl/A2Q 
TRAF1 
RanBP2 
GNA15 
PTAFR 

Adhesion and cytoskeleton 
ICAM1 

CEACAMl (bilary glycoprotein) 
UMS1 

SJWactin bundling protein 

Gatectin-l/LGALSl 

MEMD/ALCAM 

CD44 

TSG-6 

Metabolic 
GTP ayclohydrolase I 
.VDLVV^/ubiqumone reductase 
P5AM6Aproteosbme iota) 
UDP-galactose transporter (SW35A2) 
PLAU (urokinase) 
/nWC//Lrkynurenine hydrolase 
AMPD3 

J/prolyl ^-hydroxylase 
y Glutamylcysteine synthetase 
ATP6D 
ATP6S1 



D83784 


16.8 


S76638 


12.3 


U91616 


11.5 


L19067 


8.4 


U05681 


7.7 


M31627 


7.5 


X78710 


7.4 


J04102 


7.4 


X75042 


6.2 


M58603 


5.8 


U15460 


4.7 


M69043 


3.8 


L06895 


3.6 


S81914 


3.1 



M69203 

M72885 

AF014958 

X04500 

M57731 

X02910 

U64197 

U00672 

Y00081 

X54489 

D10923 



X02544 
K02765 
U71364 
U28488 
L10343 
XCH470 
M93U56 
Y00636 
Kf 83652 



U56998 
D14497 
U77735 
D45906 

L11329 
U15932 
U73477 



M59465 
U19261 
D42063 
M63904 
D10202 



M24283 
XI 6354 
U09284 
U03057 
M57710 
U30999 

HG2981— HT3125 
M31165 

U19523 
M22538 
X59417 
D34454 
X02419 
U57721 
D12775 
M24486 
L35546 
J05682 
D16469 



78.7 
48.8 
20.8 
17.6 
17.3 
14.5 
8.1 
7.3 
6.3 
4 

3.3 



20.2 
12.8 
9.5 
6,1 
4.9 
4.7 
4.6 
3.8 
3.5 



16.2 
11.9 

9.5 
4.3 



11.8 

5.3 
3.4 



10 

6.2 
5.6 
5.2 
3.9 



22.4 
6.3 
6.1 
5.9 
4.7 
4.2 
3,9 
3.7 



13.5 
8.6 
8.4 
7.3 
6.4 
5.5 
5 

4.7 
4.5 
4.2 
4 
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Tabu; I — continued . 



Description 



GenBank 1 * no. 



Glycerol kinase 

FACIA 

AK3 

Interferon- inducible 

rsois 

Mxl 

mss 

INDO 
GBPJ 
PRKR 
IFIT4 
IFI54 
IFI58 
IFP35 

Other 
Gos2 

muciiAPi 

KIAA010S 

R2AA01I8 

SNAP23 

CASP5 

KIAA0113 

MAA0255 

Hepatoma-derived GF 

PTGS2 

CD48 

UNC119 homolog 

9UAA0151 

Rablb 

Annexin V77 

K1AA0110 

Adrenomedullin 

AIM! 

MAA0250 

P5-1 

Scavenger receptor expressed by endothelial cells 
VHL 



X68285 
L09229 
X60673 



M13756 
M33882 
M24594 
M34455 
M55542 
U50648 
U52513 
M14G60 
U34605 
U72882 

M72885 
U37546 
D14661 
D42087 
U55936 
U28015 
D30755 
D87444 
D16431 
D2B235 
M37766 
U40998 
D634B5 
XM035G60 
J04543 
D14811 
D14874 
U83115 
D87437 
X0617S 
D63483 
L15409 



Change-fold 



3.6 
3.5 
3.3 



22.5 
19.4 
12.1 
5.2 
4.3 
3.7 
3.6 
3.5 
3.5 
3 



48.8 
7.2 
5.1 
5 
5 

4.8 
4.8 
4.7 
4.7 
4.6 
4.3 
4.2 
3.9 
3.8 
3.7 
3.7 
3.7 
3.6 
3.2 
8.2 
3.2 
3.1 



(33). Assuming that no other multiply phosphorylated etath- 
min species had escaped detection, analysis of the integrated 
intensities of the P0 4 -stathmin and stathmin spots indicates 
that the percentage of the P0 4 form of total cellular stathmin 
increased from 11% to 38% with LPS stimulation (Fig. 2B). 
This is similar to a previous report of an increase from <10% to 
35-40% of the Ser^-phosphorylated form in Jurkat cells stim- 
ulated with anti-CD3 (34). 

Effect of SB203580 on LPS-stimulated Gene Expression— 
Gene expression analysis of PMNs stimulated with LPS indi- 
cated that the majority of genes induced by LPS were unaf- 
fected by prior treatment of PMN with SB203580. Of the 100 
genes up-regulated by LPS, the up-regulation of 23 was inhib- 
ited by greater than 40% (Table VI). The majority of these 
genes affected by SB203580 were inhibited by less than 60%. 
whereas only six were inhibited by greater than 80%, all of 
which represent previously identified interferon-stimulated 
genes. Induction of cytokine genes by LPS, with the exception 
of JX-6, was generally unaffected by SB203580. 

Effect ofSB2035S0 on LPS-stimulated Protein Expression^- 
Similar to the effect of SB203580 on LPS-stimulated gene 
expression, little effect of SB203580 was seen on expression 
levels for the majority of LPS-regulated proteins (Table VTD. 
Two exceptions are annexin III and a-enolase, for which LPS- 
stimulated expression was attenuated in the presence of the 
p38MAPK inhibitor. 

Comparison of Microarray and Proteomics Results— -Of the 
LPS-regulated proteins identified by peptide mass fingerprint- 
ing for which probes were present on the oligonucleotide mi- 
croarray, poor concordance was found at the mRNA level (Table 
VIII). For 13 LPS- up-regulated proteins, 2 corresponding 



mRNA transcripts were up-regulated, 1 was down-regulated, 5 
were unchanged, and 5 were not detected by the Affymetrix 
chip. For 5 down-regulated proteins, 3 corresponding tran- 
scripts were down-regulated, 1 was unchanged, and 1 was not 
detected. Varying patterns of LPS regulation emerge for those 
candidates detected at both the transcript and protein level. 
Proteasome /3 chain was up-regulated at both the transcript 
and protein levels (Table VH0, with no notable effect of 
SB203580 on expression at either level. Similarly, CAP1, Rho- 
GAP1, and ficolin 1 were down-regulated at both the mRNA 
transcript and protein level (Table VHI), with no notable effect 
of SB203580. Annexin III was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Tables VII 
and VIII). 

DISCUSSION 

Interaction of bacterial LPS with the human PMN repre- 
sents a model system for studying the activation and output of 
the innate immune system during infection and inflammation. 
A recent publication (35) describes the gene expression changes 
of a cultured monocytic cell line after infection by the Gram- 
positive bacterium Listeria monocytogenes. The cell wall com- 
ponents of Gram-positive bacteria, like Gram -negative-derived 
LPS (i.e. from E. coli), are known to signal through TLRs (36, 
37). Importantly, many of the expression changes found in 
LPS-stimulated PMNs in the present study were also described 
in the bacteria-exposed monocytic cells, indicating that many of 
the gene expression changes seen in bacterial infection are 
likely mediated by TLRs (38, 39) and that the LPS model 
system accurately reflects exposure of immune cells to infec- 
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Tables II 

Human neutrophil genes repressed (>4-fold) after 4 h of LPS exposure 



Description 



GenBuiik™ no. 



Change 



Kinases 
CAMK, //, gamma 
Diacylglycerol kinase, delta 
PRKCL2fVTWt, protein kinase C-like 2 
MAPICAPIC3 

Protein kinase Ht3 1, eAMP-dependent 
CAMKJI 

Transporters 
SLC25A5f6o\ute carrier family 25, member 5 
SLC19A1; folate transporter 
SLC2A3; facilitated glucose transporter 

Metabolic 
Carbonic anhydrase IV 
KNase A family, k6 
Glycogen phosphorylase; liver 
Inositol polyphosphate-5-phosphatase 
Inositol 1,3,4'trisphosphate 5/6-kinase 
Tranehetolase 

Protein phosphatase 4 S reg. eubunit 1 (clone 23B40) 

Cytidine deaminase 

MGAT1 

HMOXl 

MAN2A2 

Glycogenic (also represents U31525) 

Structural 
Fibrinogen-like protein (pT49 protein) 
H2AFZ 
PaxUlin 
Lamin B R 
Dynamin 2 
Actinin 1 
et-Tubuliti 

Tubulin, al % isoform 44 

Transcriptional regulators 
Lymplwblastic leukemut-derived sequence I 
MAX-interacting protein 1 
Nuclear factor crythroid 2 isoform f 
Transducer of ERBB2, 1 
WATC4 
ATF-2 iCRB-Bpa) 

Receptors 
Lymphotoxin. fi receptor 
Folate receptor 3 (gamma) 

Signaling 
Pix-a; cool-2 (IOAA0006) 

TNFSF10; TRAIL 

Ca fl+ binding 
ANXIT 
S100A4 
ANX1 

Other 

Protcclipid protein 2 

Protein phosphatase 1, a catalytic subunit 

TIMP2 

KTAA0199 

Lipin 2 (KIAA0249) 

r^MP(Jawl) 

CUGBP2 

Clone 23933 

PECAM1 

Delta sleep-inducing peptide 

DiGeorge syntl critical region gene 2 (KIAA01& 0 .) 
SBLPLG: CD162; selectin P ligand 



U50360 
D63479 
U33052 
U09578 

HG2167-HT2237 
L07044 



J02683 
UJ.75G6 
M20681 



L10955 
U64998 
M14636 
U67650 
U51336 
L12711 
U79267 
L27943 
M55621 
X06985 
L28821 

HG4334-HT4604 

Z36531 
M37583 
U14568 
L25931 
L36983 
M95178 
X01703 

HG226JM-IT2348 

M22633 
L07648 
S77763 
D38305 
L41067 
L05515 



L04270 
U08471 
U11875 

D25304 
M12174 
U37518 



L19605 
M805G3 
X05908 



1,09604 

HG1614-HT1614 

M32304 

D83782 

D87436 

U 10485 

U69546 

U79273 

L34G57 

250781 

D79985 

U25956 



•fad 

-4 

-4.2 

-4.3 

-6.3 

-8 

-9.8 



-4.2 
-4.4 

-5 



-4.4 
-4.5 
-4.6 
-4.6 

r-4.7 

-4.8 
-4.9 
-5.4 
-5.4 
-5.4 
-5.8 
-5.9 



-4.2 
-4.7 
-4.9 
-5.9 
-6.2 
-6.7 

-10 

-15 



-4.4 
-4.5 

-6 
-6,9 

-7.8 

-9.6 



-4.4 

-5 

-5.3 



-4.5 
-4.5 
-6.6 



-4.3 
-4.8 
-4.8 



-4.9 

-5 
-5.1 
-5.2 
-5.6 
-5.8 
-6.9 
-7 
-8 
-8.7 
-9 
-32 



tion. Nevertheless, the reliance upon DNA microarrays alone 
affords insight only into the transcriptional response without 
corroboration at the protein level. In the present study, appli- 



cation of both DNA microarray and proteomics technology to 
our model system provides unique insight into both the cellular 
biology of the activated PMN and the responsiveness and reg- 
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Ftg. 1. Two-dimensional PAGE of LPS-exposed human PMNs. 
A and B, colloidal Coomassie Blue-stained pH 3.0-10.0, two-dimen- 
sional PAGE gels (A, control; B, LPS- exposed) with up-regulated isolid 
arrows) and down-regulated (hatched arrows) proteins indicated. These 
results are representative of six separate experiments. C and D, colloi- 
dal Coomaseic Blue-stained pH 6.0-6.0, two-dimensional PAGE gels 
<C t control; D t LPS-expbsed) with up-regulated (solid arrows), new 
(solid arrow, open arrowhead^ and down-regulated (hatched arrows) 
proteins indicated. UPS-exposed PMNs from three blood donors were 
pooled. 

ulation of its transcriptional iind translational machinery. As 
will be discussed below, our study identifies, in particular, 
novel aspects of the LPS-stimulated PMN transcriptional reg- 
ulation, activity in the innate immune response, signaling, 
cytoekeletal reorganization, and priming for granule release. 

In the present study, the increase in NF-kB transcript abun- 
dance (Table I) detected by the microarrays corroborates the 
findings of other studies of PMNs and monocytes (40) and 
indicates a mechanism for the responsiveness and scope of the 
PMN transcriptional machinery following LPS exposure. NF- 
kB, recently described to be activated by LPS through the 
TLR/MyD8S/interleukin-l receptor-associated kinase pathway 
(1, 4), is the only transcriptional complex reported to be in- 
duced by LPS in the PMN. However, because the transcrip- 
tional NF-kB complex has been implicated in the regulation of 
only a portion of the genes induced by LPS in this study (data 
not shown), the importance of alternative transcriptional reg- 
ulators in the PMN is clear. Of interest, several other known 
and putative transcriptional regulators with less well defined 
functions were also up-regulated in the present study, includ- 
ing PLAGL2, a putative zinc-finger protein, XBP-1, MTF-1, 
Ets-2, B-ATF, and DIF-2. On the other hand, LPS-down-regu- 
lated genes include ATF-2 (a known target of p38), NFATC4, 
TOB-1, NF-E2, MXI-1, and LYL1. Although the exact role of 
these, gene products in regulating cell function is unknown, 



these data indicate that the range of transcriptional responses 
in the LPS-stimulated PMN is much broader than previously 
suggested and that the signaling capabilities of the PMN in the 
immune response are thereby likely extended in scope and 
specificity. 

As expected from the literature, the genes for several cyto- 
kines and chemokines, including JTL- 1 0, IL-6, and MlP-lp, were 
found to be up-regulated (Table I). On the other hand, the 
notable absence of up-regulated cytokines in the proteomics 
experiments reflects their removal in the post-LPS incubation 
wash performed prior to lysis for two-dimensional-PAGE. Up- 
regulation of these inflammatory mediators is well documented 
in PMNs exposed to LPS and in animal models of LPS-induced 
sepsis syndrome and acute respiratory distress syndrome, a 
PMN-mediated illness (41, 42). Several genes in this family 
were up-regulated that have not, to our knowledge, been de- 
scribed in LPS-stimulated cells, including MCP-l, GR03 t 
IL-10RA, and 8M74, an orphan G protein-coupled receptor 
with homology to chemokine receptors. The down-regulation of 
TNFSF10) lymphotoxin b receptor, and TNFAIP1 were also 
observed. The modulation of genes involved in cytokine signal- 
ing, including the adapter molecules TRAFl (LPS and TNF 
receptor signaling) and TNFAIPl (TNF receptor signaling) and 
several kinases and phosphatases, may indicate a change in 
cytokine responsiveness after LPS treatment. Relevant in this 
regard from the proteomics data are: 1) the up-regulation of 
protein phosphatase 1, which has been shown to regulate PMN 
NADPH oxidase activation and translocation (43, 44) and to 
regulate LPS-induced NF-kB activation (45); 2) the down-reg- 
ulation of Rho-GAPl, which has been shown to regulate 
NADPH oxidase activity in the PMN (46); and 3) the up- 
regulation of P0 4 -stathmin (Table IV), a phosphoprotein pos- 
tulated to function as a relayer and integrator of multiple 
signal transduction pathways (34). Several noncytokine, 
nonchemokine genes involved in the immune response were 
also up-regulated, including the complement pathway mem- 
bers C3, C3AR1, and PFC\ the protease inhibitors ELANH2 
(elastase inhibitor), SLPI, PL3, and PI-9; and the acute phase 
protein orosomucoid, LPS regulation of C3AR1 and orosomu- 
coid expression have not previously been reported. In the pro- 
teomics experiments, the down-regulation of ficolin-1 (Table III), 
a collectin-like cell surface protein reported to activate the com- 
plement system and to mediate adhesion and phagocytosis in 
monocytes but not previously reported in granulocytes (47), may 
represent negative modulation of the innate immune response. 
The finding that genes other than cytokines and chemokines are 
regulated by the PMN in response to LPS indicates that the PMN 
plays a more sophisticated role in host-defense and immunity 
than previously thought. 

Treatment of the PMN with LPS lead to the induction of a set 
of genes associated with the anti-viral Type I interferons, 
IFNa/0. This induction occurs independently of the release of 
IFN or another unidentified soluble factor. 2 Furthermore, the 
set of genes expressed is smaller than that induced by IFNo/0, 
as described by Der et al. (12). This may be due to differences in 
the scope of the signaling systems activated by LPS and 
IFNa/0, or the time course of analysis of genes in the LPS- 
stimulated PMN. The implication that LPS treatment of PMN 
allows PMN to express anti-viral activity is currently being 
tested. Of interest was the finding that induction of interteron- 
stimulated genes was blocked by pretreatment of PMNs with 
SB203580. Work from our laboratory has indicated that signal 
transducers and activators of transcription activation does not 
occur in response to LPS in PMNs. 2 In addition, interferon- 



2 K. C. Malcolm and G. S, Worthen, manuscript in preparation. 
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Tabus in 

Analysis of pH 3.0-10.0 two-dimensional PAGE gels 
Mcaii change(-foid) in expression, level among six PMN donors is reported. The change in expression for the proteins Hated was statistically 
significant {p < 0.05) as measured by a two-tailed Students t test. 



Identification [spot no.l 


Swiss-Prot no. 


M&L 


I iicorcticHi 
Af„/pl 


reyuues matched/ 
submitted 


Protein 
covered" 


Mean 
change 










% 


%' 


■fold 


Up-regulated 














Proteasome fS chain 1646] 


P28070 


. 27/5,7 


29.2/5.72 


9/12(75%) 


36% 


1.51 


Anncxin III [550] 


P12429 


31/5.7 


36.4/5.6 


14/18 (78%) 


42% 


1.37 _ 


Actin fragment [544|° 


P02570 


32/5.5 


141.7/5.29) 


13/15 (87%) 


(34%) 


1.74 


Actin fragment [59i] d 


P02570 


30/5.4 


(41.7/5.29) 


14/18 (7B%) 


(29%) 


1.60 


«-Enolase (380] 


P06733 


4175.7 


47.2/7.01 


9/10(90%) 


24% 


1.65 


Rab-GDP dissociation inhibitor ft (289] 


P50395 


50/6.1 


50.7/6.11 


10/11 (91%) 


25% 


1.24 


Glutathione S- transferase P [648] 


P09211 


23/5.5 


23.4/5.43 


6/8 (75%) 


41% 


1.54 


Pre-B-eell colony enhancing factor [1152] 


P43490 


53/7.0 


55.5/6.69 


12/16 (75%) 


25% 


1.29 


Down-regulated 














Adenylyl cyclase-associated protein 1 [256] 


Q01518 


55/7.3 


51.7/8.07 


16/22 (73%) 


34% 


0.53 


Rho-CMPl [283] 


Q07960 


. 50/5.8 


50.4/5.85 


7/9 (78%) 


22% 


0.67 


Ficolin 1 [511] 


000602 


33/6.5 


35/6.39 


10/12 (83%) 


25% 


0.74 



° The theoretical pi and A/ R .of native actin are indicated. Protein coverage indicates coverage of native actin. 

Table IV 

Analysis of pH 5.0.-6.0 two-dimensional PAGE gels 
Results are from pooled samples for control (n~3) and LPS-exposed (n, = 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. "New* designates proteins seen in the LPS gel in two repeat experiments but 
not detectable in the corresponding control gels. 



Identification; [spot no.) 


Swiss-Prot 
no. 


Estimated 

MM 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Change 










% 


% 


■fold 


Up-regulated 














Protein-tyrosin e kinase 9- like [468] 


Q9Y3F5" 


34/5.81 


39.5/6.37 


10/14 (71%) 


34% 


1.8 


Protein phosphatase 1, catalytic summit, p isoform 


P37140 


38/5.73 


37.2/5.84 


7/10 (70%) 


22% 


2.0 


[378] 














PO^stathmin [577] 


P16949* 


18/5.36 


17.3/5.76 


9/12 < 75%) 


42% 


2.1* 


Konmusde myosin heavy chain [1102] 


189036* 


145/5.32 


145/5.23 


20/21 (95%) 


17% 


New 


Putative P0 4 -nonmnscle myosin heavy chain [1101]'' 


189036 ft -* 


145/5.29 


145/5.23 


14/16 (87%) 


13% 


New 


Leukocyte elastase inhibitor [318] 


P30740 


42/5.71 


42.7/5.9 


9/13 (69%) 


22% 


2.4 


Grancalcjn [10041 


P28676 


24/5.36 


24.0/5.02 


7/10 (70%) 


31% 


New 


Down-regulated 














Adenosylhomocysteinase [324] 


P23526 


48/5.82 


47.7/6.04 


7/9 (78%) 


14% 


0.4 


PEST phosphatase interacting protein homolog (234}? 


4100162' 


48/5.30 


47.6/5.35 


11/13 (85%) 


30% 


0.5 



n TrEMBL accession number. 

h Accession number and theoretical pi and M n for the unmodified protein are indicated. 
c JVCBI accession number. 

* See text for explanation. 

* Among three experiments, the ratio of P0 4 -stnthmin expression increase, following LPS exposure in the presence of SB203580 divided by that 
in the absence of SB203580, was 0.93. 

^Genpept accession number. 

* Tnis search was performed using average masses measured by linear mode MALDI-TOF MS. 

Table V 

Analysis of pH 5.5-6.7 two-dimensional PAGE gels 
Results are from pooled samples for control {n 3) and LPS-exposed (n = 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1,5-fold following LPS exposure in two repeat experiments. 



Identification [spot no.] 


Swiss- Etc* 
no. 


Estimated 


Theoretical 
M a /pl 


Peptides matched/ 
submitted 


Protein 
covered 


Change 










% 


% 


-fold 


Up-regulated 














Tninsnldolase [4751 


P37837 


38/5.95 


37.5/6.36 


13/17 (76%) 


33% 


2.5 


Isocitrate dehydrogenase [431] 


075874 


46'6.25 


46.7/6.35 


7/7 (100%) 


13% 


2.3 


Moesin [2011 


P26038 


61/6.09 


67.8/6.07 


11/13(85%) 


17% 


2.1 


«-Enolase (459] 


P06733 


43/5.64 


47.2/7.01 


7/10 (70%) 


17% 


3.8 


Down-regulated 














Calponin H2 1240] 


Q99439 


34&65 


33.7/6.94 


10/11 (90%) 


27% 


0.5 



regulatory factor 3, a known regulator of interteron-Btimulated 
gene transcription, is not a direct target of p38 kinase. 2 There- 
fore, gene expression analysis of LPS-stiniulated PMNs has 
uncovered a previously uncharacterized eignal transduction 
system that is sensitive to inhibition of p38 MAPK. 

Knowledge of the genes down-regulated by LPS permits the 



development of further hypotheses addressing PMN function in 
the face of infection. Strikingly, several down-regulated genes 
and gene products are structural in nature (eg. paxillin, acti- 
nin, calponin H2) (Tables II and V). A known consequence to 
the PMN of LPS exposure is decreased motility (48). Up-regu- 
iation of genes for adhesion molecules (ICAAf-l, CD44 y AL- 
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Tabus VI 

Effect ofSB203580 on LPS-stimtdated gene expression 
Genes are reported for which the SB20358G/control expression ratio 
is ^ 0.80. 

~ a -fold change ratiu Change in absence 

Gene name <SB2035B0&mtrol) ofSB20358t) 



-fdd 



ISG15 


0.09 


22.5 


HCR 


0.38 


20.8 


Mxl 


0 


19.4 


IFI66 


0 


12.1 


PL9 


0.57 


9.5 


Et<t-2 


0.59 


7.4 


IL-6 


0.45 


6.3 


Rel 


0.50 


6.2 


UMS1 


0.58 


6.1 


C3AR1 


0.49 


6.1 


WD0 


0.35 


5.2 


KIAA0105 


0.41 


5.1 


SNAP23 


0.58 


fi.O 


SLPJ 


0.58 


4.7 


ELNAIJ2 


0.49 


4.6 


HM-74 


0.57 


3.8 


PKR 


0 


3.7 


MAD 


0.21 


3.6 


IFIT4 


0.12 


3.6 


Glycerol kinase 


0 


3.6 


IFIS4 


0 


3.5 


IFI58 


0.39 


3.5 


IPF35 


0.46 


3.0 




Fig. 2. A, the predicted sequence of the tryptic phosphopeptide in 
P0 4 -atathihin (1468.72 Da). The peptide mass measured by MALDI- 
TOF MS and the predicted mass differed by 14 ppm. As indicated, two 
alternate phosphorylation sites are possible: serine 16 and serine 25. JB, 
P0 4 -stathmin and stathmin were identified on the control and LPS- 
exposed pH 6.0-6.0 gels. Consistent with phosphorylation, the P0 4 - 
Btathmin spot was distinguished by a peptide of mass 1468.72 Da {ue. 
80 Da greater than the peptide of 1388.72 Da seen in the stathmin spot). 
Assuming that no other multiply phoaphorylated etathmin species have 
escaped detection, analysis of the integrated intensities of the P0 4 - 
stathmin and stathmin spots indicates tliat the percentage of the PO* 
form of total cellular Btathmin has increased from 1 1% to 38% with LPS 
stimulation. The decrease in integrated intensity for stathmin was 
equal in amount to the increase in P0 4 -stathmin following LPS 
exposure. 

CAM, and TSG-6}, and down-regulation of genes for structural 
proteins, indicates a genetic basis for this observation. Down- 
regulation of two genes implicated in cytoskeletal regulation, 
Pix-ct and RhoB, was also observed. The calcium-binding pro- 
tein S100A4, down-regulated in LPS-treated PMNs (Table II). 
has been implicated in cell motility and metastasis (49). De- 
creased motility may be beneficial in sustaining the inflamma- 
tory response at sites of infection. In addition, LPS treatment 
results in on inhibition of afioptosis (50). Therefore, the longer 
residence time of the PMN at sites of infection is consistent 
with the long term genetically coded changes seen in these 
gene-profiling experiments and indicates that the changes in 
gene expression are functionally relevant to host defense and 
immunity. 

By providing information on post-translational modification, 
the proteomics data may provide further insights into the cy- 



Table VII 

Effect of SB203S80 on LPS -stimulated protein expression 



Protein ruiuie 


-fold change ratio 
{SB2U3680£cntraD 


Change in 
uhsence of 
SB 203580 






-fold 


Up-regulated 






Pro teas ome (S chain 


0.8 


1.51 


Ann ex in III 


0.6 


1.37 


Actin fragment (5441 


0.8 


1.74 


Actin fragment [591} 


0.3 


1.60 


a-Enolase 


0.6 


1.65 


Rah-GDP dissociation inhibitor ft 


1.1 


1.24 


Glutathione S-tranefcrase P 


1.2 


1.54 


Pre-B-cell colony enhancing factor 


1.2 


1.29 


Down-regulated 






Adenylyl cyclase- associated protein 1 


1.3 


0.53 


Rho-GAPl 


0.8 


0.67 


Ficolin 1 


1.0 


0.74 



toskeletal remodeling effects of LPS upon the PMN. We con- 
tend that the actin fragments identified (Table III) are unlikely 
to represent technical artifacts. Rather, their specificity (iden- 
tical molecular weight/pl among different experiments), statis- 
tically significant up-regulation by LPS, as well as the use of a 
lysis buffer containing cliaotropes and multiple protease inhib- 
itors argue instead that these fragments are physiologic con- 
sequences of LPS exposure in the human PMN. More specifi- 
cally, the up-regulation of these fragments following LPS 
exposure (Table III) suggests that LPS may activate an actin- 
cleaving enzyme, which, in turn, remodels the cytoskeleton. 
Intriguing in this vein, cnlpain has recently been reported to 
play an important role in cell migration and cytoskeletal orga- 
nization of fibroblasts (51). The possibilities that LPS may 
induce calpain activation and that calpain activation may reg- 
ulate cytoskeletal reorganization and motility ore currently 
under investigation. An alternative possibility is that actin 
cleavage is a marker of neutrophil apoptosis (52). 

Other LPS-regulated proteins may play important roles in 
cytoskeletal reorganization. The up-regulation of protein-ty- 
rosine kinase 9-like (A6-related protein) may modulate LPS- 
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Table VIII 

LPS-regulated proteins for which a probe was present on the 
Asymetrix chip 

A comparison of corresponding protein and mRNA transcript changes 
following LPS exposure is shown. 



iVrArin T££ mRNA change 



Up-regitlated 

Proteasome 0 chain 1.5 1.9 f 

Leukocyte elastase inhibitor 2.4 4.6 f 

Rab-GDI p 124 NC* 

Grancalcin New NC 

Transaldolase 2.5 NC 

Moesin 2.1 NC 

Nonmuscle myosin heavy chain New NC 

Glutathione S-transfcrase P 1.54 Absent 

Pre-B cell enhancing factor 129 Absent 

Isocitrate dehydrogenase 2.3 Absent 

PCVstatlunin 2.1 Absent (stathmin) 

Protein phosphatase 1, fi catalytic subunit 2 AbEent 
AnnezinOI • 3.1 3.1 j 

Down-regulated 

Adenvlyl cyclase- associated protein 1 1.9 2.1 [ 

Rho-GAP i 15 2.7| 

Fieoliii 1 14 1.7 i 

AdenosyXhomocysteinuse 2.5 Absent 

Calponin H2 2 NC 



d NC t no measureable change. 

induced actin polymerization, because it bears a high degree of 
homology to twinfilin (A6), an actin monomer-binding protein 
that localizes to sites of rapid filament assembly in cells and is 
believed to regulate actin filament turnover (53). In turn, LPS- 
induced down-regulation of Rho-GTPase activating protein 1 
(Table III) may regulate twinfilin (and protein- tyrosine kinase 
9-lifce) activity, because twinfilin has been shown to colocalize 
with Racl and Cdc42 and to be regulated by active Racl in NIH 
3T3 cells (63). Activation of Rho proteins may be facilitated by 
LPS up-regulation of moesin (Table V), because moesin report- 
edly induces the dissociation of Rho from GDI (54). Racl may, 
in turn, promote activation of the actin filament-nucleating 
Arp2/3 complex fclirough interactions with WASP (Wiskott-Al- 
drich syndrome protein) family proteins (55) and, interestingly, 
is postulated to regulate the dynamics of both the actin and 
microtubule cytoskeletons via phosphorylation of stathmin (Ta- 
ble IV) (56). Calponin H2 is an actin-binding protein not pre- 
viously reported in PMNs that is postulated to play a role in 
cytoskeletal organization (57). Its down-regulation by LPS (Ta- 
ble V) likely modulates LPS-induced cytoskeletal reorganiza- 
tion. The up-regulation of nonmuscle myosin heavy chain and a 
putative phosphorylated form of myosin heavy chain (putative 
protein kinase C substrate by prediction rules) in the LPS- 
exposed PMN (Table IV) is of uncertain significance; myosin 
has been implicated in multiple functions in the PMN, includ- 
ing locomotion, fluid pinocytosis, and phagocytosis (58). Of 
interest, however, S100A4 (down-regulated, Table II) lias been 
reported to regulate cytoskeletal dynamics by inhibiting pro- 
tein kinase C-mediated phosphorylation of nonmuscle myosin 
heavy chain (59). 

LPS induction of stathmin phosphorylation (Table IV and 
Pig. 2) may represent another mechanism by which the cy- 
toskeleton is remodeled. Stathmin is a phosphoprotein report- 
edly involved in both signal transduction and in regulation of 
the microtubule filament network; furthermore, phosphoryla- 
tion of stathmin has been reported to modulate its tubulin- 
binding avidity (60). Inferences can be made about both the 
phosphorylation site on PO^-stathmin and the responsible ki- 
nase induced by LPS. Four phosphorylation sites in stathmin 
have been well described: Ser 16 , Ser 25 * Ser 38 , and Ser 63 (32, 33); 



Ser 16 has been reported as a substrate for Ca 2+ /calmodulin 
(CaMVdependent kinases (32), and Ser 25 as primarily a sub- 
strate for p38 and ERK (33), with p34 ud<i2 also active but bear- 
ing a 5-fold preference for Ser 36 (34). As stated above, the 
phosphopeptide identified in P0 4 -sta£hmin, extending from 
residues 15 to 27 (1468.7 Da), is consistent with phosphoryla- 
tion of either Ser 16 or Ser 25 (Fig. 2). Although both p385 and 
p38a MAPK isoforms are expressed in the human PMN, LPS 
has been shown to selectively activate the p38a isoform in 
human PMNs (9). The p38a isoform, however, has been shown 
to be relatively inactive nt Ser 25 ; in fact, p385 is — 100-fold more 
active at Ser 25 , and selective p38a inhibitors do not inhibit the 
stress-activated phosphorylation of stathmin in 293 cells (33). 
Further support for the lack of involvement of p38 signaling in 
phosphorylation of stathmin in our system is the apparent lack 
of effect of SB203580 (a selective p38a and p380 inhibitor) on 
LPS-induced expression of P0 4 -stathmin (Table IV). Because 
p34 cdc2 is relatively inactive at Ser 25 (34), we conclude that the 
phosphorylation site is likely to be Ser 16 , a reported substrate 
of CaM-dependent kinase. Although CaM kinases have previ- 
ously been implicated in gene activation in LPS-exposed my- 
elonionocytic HD11 cells (61), stathmin signaling has not, to 
our knowledge, been previously reported in either PMNs or 
lipopolysaccharide signal transduction. 

Cytoskeletol reorganization, a well-described regulator of 
granule release (62), may underlie LPS-induced priming for 
PMN granule release, but several LPS-regulated proteins may 
provide more specific clues. LPS exposure led to increased 
levels of grancalcin, a calcium-binding protein previously de- 
tected in PMNs nnd shown to translocate to granules and 
plasma membrane in the presence of physiologic concentra- 
tions of calcium (63). Similarly, annexin HI, a calcium-binding 
protein highly expressed in PMN granule membranes and im- 
plicated in calcium-mediated secretion (64) and in granule fu- 
sion (65), was also found to be up-regulated. Exocytosis of 
granule contents may also be facilitated by LPS up-regulntion 
of Rab-GDP dissociation inhibitor (Table III), which has been 
proposed to recycle Rab after vesicle fusion by extracting it 
from the membrane and loading it onto newly formed transport 
intermediates (66). 

Parallel use of DNA microarrays and proteomics affords a 
powerful strategy for comparison of corresponding mRNA tran- 
scripts and proteins, thereby affording new insight into the 
mechanisms by which the cell regulates its signaling responses 
to the external environment. Of interest, a poor correlation was 
found between corresponding transcripts and proteinH (Table 
VHI), as reported in other systems (17, 18). The finding in some 
cases of unchanged transcript abundance in the face of regu- 
lated protein levels indicates post-transcriptional modulation 
following LPS exposure. The finding of undetected transcripts 
in the face of regulated levels of the corresponding proteins 
may indicate previous transcription of these genes in an earlier 
state of the myeloid maturation of the PMN, producing stable 
protein species that have undergone post-translational alter- 
ation following LPS exposure. The use of SB203580, a p38 
inhibitor, adds further insights into the mechanisms of LPS 
regulation. At the level of mRNA expression, SB203580 inhib- 
ited 23% of LPS-stimulated genes by >40% and 11% of genes 
by ^60%; therefore, p38 plays a specific role in gene regulation 
in the PMN. Li particular, proteasome 0 chain was up-regu- 
lated at both the mRNA transcript and protein level (Table 
VTII), with no notable effect of SB203580 on expression at 
either level, consistent with a non-p38-mediated pathway of 
primary transcriptional up-regulation induced by LPS. Simi- 
larly, CAP1, Rho-GAPl, and ficolin 1 were down-regulated at 
both the mRNA transcript and protein level (Table VTII), with 
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no notable effect of SB203580, consistent with a non~p38-me- 
diatcd pathway of primary transcriptional down-regulation. 
Interestingly, anriexin III was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Table VII), 
consistent with a p38-mediated post-transcriptional up-regula- 
tion induced by LPS. 

Limitations of the present study should be noted. Gene ex- 
pression analysis by cDNA microarrays does not distinguish 
between transcriptional regulation and mRNA stabilization; 
similarly, two-dimensional PAGE proteomics by itself does not 
distinguish among transcriptional, translational, or post-trans- 
lational regulation of protein abundance. Transcript detection 
by microarray technology is limited to the probes included; 
protein identification by two-dimensional PAGE proteomics is 
limited to well-re3olved regions of the gel, may perform less 
well with hydrophobic and high molecular weight proteins, and 
tends to select for more abundant protein species (30). Harvest- 
ing of the LPS-incubated PMNs at 4 h may have prevented 
detection of earlier, transient changes and may have thereby 
introduced artifactual transcript-protein discordance. Further- 
more, the post-LPS incubation, pre-two-dimensional PAGE cell 
washes would be expected to remove secreted proteins from 
further analysis, with uncertain effects on detected protein 
abundance depending on such factors as the degree of de novo 
synthesis and extent of degranulation/exocytosis. Because pro- 
tein binding of Coomassie Blue has a limited dynamic range 
and is typically not linear throughout the range of detection, 
image analysis of Coomassie Blue-stained protein spots should 
be considered semi-quantitative. For some protein spots, the 
apparent magnitude of regulation by LPS may have been 
blunted by the spot approaching staining saturation in the 
control gel. By limiting our analysis to those protein spots 
common to all twelve pH 3.0-10.0 two-dimensional gels, we 
likely excluded some LPS-regulated proteins that happened to 
be either poorly resolved on a subset of the gels or unmatched 
by the image analysis software. By further limiting the analy- 
sis to those matched spots on die pH 3.0-10.0 gels for which a 
two- tailed t test demonstrated p < 0.05, the list of regulated 
proteins was likely also limited by statistical power. In addition 
to those regulated proteins listed in Table HI, three others were 
up-regulated and three down-regulated with p < 0.09 (data not 
shown). 

Limiting our reported results to those changes that met 
statistical significance among the donors carries further impor- 
tant implications. We have encountered a two order of magni- 
tude range of response in unselected donor LPS-induced PMN 
functions, such as TNF-a and superoxide anion release (data 
not shown). The sources of this physiologic heterogeneity re- 
main uncertain but may possibly include such factors as nat- 
ural mutations of the LPS receptor component, TLR4 (67). By 
selecting for LPS effects common to all donors, we may not have 
characterized the range of genomic and proteomic heterogene- 
ity present in the population and thereby may have focused on l 
only a narrow portion of a broader biological response to LPS. 
We contend that this reductionist approach is valid because it 
would be expected to enrich for biologically integral responses 
of the PMN to LPS. Nevertheless, correlation of genomic and 
proteomic profiles with functional phenotypes of the PMN may 
bear important diagnostic and therapeutic implications and 
will be pursued in future studies. 

Widespread regulation of numerous noncytokine/chemokine 
genes and proteins in the LPS-stimulated human PMN is a 
novel finding. These data indicate that, despite a narrow scope 
of gene expression in the nonstimulated state, the terminally 
differentiated, short-lived PMN likely plays a role in the innate 



immune response that is far more sophisticated and dynamic 
than the simple release of preformed inflammatory mediators. 
Although gene expression appears to be an important mecha- 
nism by which PMNs respond acutely to infection, mRNA tran- 
script/protein concordance is limited, and post-transcriptional 
(and post-translational) modifications also play an important 
role. The alteration of multiple transcriptional regulators, G- 
protein regulators, P0 4 -stathmin, and protein phosphatase 1 
indicates that one of the responses to LPS exposure ia to modify 
subsequent signaling events by bacterial components or by 
other cytokines and chemokines. Finally, the finding that p38 
MAPK mediates LPS regulation of a limited subset of tran- 
scripts and proteins underlines the continuing need to define 
signal transduction cascades in the neutrophil. 
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An increased high-mobility group A2 expression level is associated with 
madguant phototype in pancreatic exocrine tissue. 

AbeN, VVatanabe T, Suzuld Y , Matsumoto N . MasakiT . Mori T. Sugiyama M 
Chiappctta C , Fusco A . Atonii Y . ' ' 

First Department of Surgery, Kyorin University School of Medicine, 6-20-2 Shinkawa 
Mitaka, Tokyo 181-861 1, Japan, abenbtg@kyorin-u.ac.jp 

The altered form , of the high-mobility group A2 (HMGA2) gene is somehow related to 
the generation of human benign and malignant tumours of mesenchymal origin 
However only a few data on (he expression of HMGA2 in malignant tumour originating 
from ep.diel.al t.ssue are.available. In this study, we examined the HMGA2 expression 
level m pancreatic carcinoma, and investigated whether alterations in the HMGA2 
expression level are associated with a malignant phenotype in pancreatic tissue Higli- 
mob.l.ty group A2 mRNA and protein expression was determined in eight surgically 
resected specimens of non-neoplastic tissue (six specimens of normal pancreatic tissue 
and two of chronic pancreatitis tissue) and 27 pancreatic carcinomas by highly sensitive 
reverse transcriptase-polymerase chain reaction (RT-PCR) techniques and 
immunohistochemical staining, respectively. Reverse transcriptase-polymerase chain 
reaction analysis revealed the expression of the HMOA2 gene in non-neoplastic 
pancreatic tissue, although its expression level was significantly lower than that in 
carcinoma. Immunohistochemical anaJysis indicated that the presence of the HMGA2 
gene in non-neoplastic pancreatic tissue observed in RT-PCR reflects its abundant 
expression m islet cells, togetoer with its focal expression in duct epithelial cells. Intense 
and multifocal or diffuse HMGA2 immunoreactivity was noted in all the pancreatic 
carcinoma examined, A strong correlation between HMGA2 overexpression and the 
diagnosis of carcinoma was statistically verified. Based on these findings we propose 
that an increased expression level of the HMGA2 protein is closely associated .with the 
malignant phenotype m the pancreatic exocrine system, and accordingly HMGA2 could 
serve as a potential diagnostic molecular marker for distinguishing pancreatic malignant 
cells horn non neoplastic pancreatic exocrine cells. 
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extracts. If these minor cell proteins differ among cells to the same extent as the 
more abundant.proteins, as is commonly assumed, onfy a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellularorganism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell 
. is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization — different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc : 
tive character. These features reflect the persistent expression of different sets of 
genes." 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to FtNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is (he control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
ail of them can in principle be regulated. Thus a" cell can control the proteins it 
makes by (I) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (ItNA processing control), i'S) selecting which completed 
rnftNAs in the cell nucleus are exported to the cytoplasm (UNA transport con- 
trol). H) selecting which mRNAs in the cytoplasm are translated by rihosomes 
Uranslational control), (!>) selectively destabilizing certain mKNA molecules in 
i he cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
mode (protein activity control) (I'ignre 9 2). 

l : or most genes transcriptional controls are paramount. This makes sense 
.because, of all the possible control points illustrated in Figure 0-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Summary 

The ttumy types of cells in animals tint! plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe r 
cialized animal t ells can maintain their nnhpte character when grown in culture, the 
gene regulatory mechanisms involved in creating them must he stable once est ah- 
lished.it nd heritable when the. celt divides, endowing the cell with a memory of its 
developmental history. I>rocaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized cell types in higher encaryotes. One such mecha- 
nism -involves a competitive interaction between two (or more) gene regulatory pro- 
wins, each of which inhibits the synthesis of the other; this can create a flip-flop 
switch thin switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for celt memory 

In encaryotes gene transcription is generally controlled by combinations of gene 
''regulatory proteins. It is thought that each type of cell in n higher encuryolic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion aj only those genes appropriate to that type of cell. A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

In tnhtition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by encaryofic cells to regulate gene expression, hi ver- 
tebrates DNA ittediylafion also plays a part, mainly as a device to reinforce decisions 
aboia gene expression that are aunte initially by other mechanisms. 



Although ronnols on thi- initiation ol ^:iU' t ransci iption are the predominant 
toim ol iviMil:nion lor most genes, other controls- run :u*i Inter in the pathway 
from liNA to piotern to modulate the amount of gene prodma that is made. Al- 
though ih.-M- posnnm.soipiional controls, whirh operate alter RNA polymerase 
ha.s hound to rite ;.',ene"s promoter and hegnn RNA synthesis, are less common 
than tr.insitiptianot control, lor many genes I hey aie en trial. It seems thai every 
step in gene expression that could he eontiolled in piineiple is likely to be regu- 
lated niiili'i .some riicnmsianres lot some genes! 

We .-mv.ider the varieties ol post transcriptional regulation in temporal 01- 
tin. a. r or dm- to the sequence ol »- vents that might be espri ieneed by an UNA 
inolernlr alir-i its nanseripiion has. begun I r ignre i ■ 
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FROM DNA TO RNA 

. Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cel.) to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment — most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell lakes in rending out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still wiiiten in essentially the same language as it is in DNA— 
the Janguage of a nucleotide sequence. Hence the name transcription. 

LiJce DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-1). It differs from 
DNA chemically in two respects: (J) the nucleotides in RNA are 
ribonucleotides — that is. ihey contain the sugar ri bos e (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C). it contains .the base uracil tU) 
instead of ihe thymine IT) in DNA. Since U, likeX con base- pair by hydrogen 
bonding with A (Figure 6-5). the complementary base- pairing properties 
described for DNA in Chapters -J and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with 1.1). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double 
stranded helix, RNA is single- stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up. to form the final shape of 
a protein (Figure 6- 6). As we see later in this chapter, the abrJrtv to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have sum: t oral 
and catalytic functions. 



Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A is : 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cetl to be 
much greater than that of protein B. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

AJI of the RNA in a cell is made bv ON A n arisen prion, a process thai has cer- 
tain similarities to the process of DNA replication discussed in Chapter f>. 
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Figure 6-90 The production of a 
protein by a eucaryotic celL The final 
level of each protein in a eucaryotic ceB 
depends upon the efficiency of each step 
depicted. 
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urc 6-90) conlri be regulated by the cell for c;uh individual protein. However, ;is 
we sh:il! see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes'. This makes sense, 
inasmuch as the most efficient way to keep a gene bom being expressed is io 
block the very .first step— the transcription of its ONA sequence into an UNA 
molecule. 



Summary 

Vie translation of the nucleotide sequence of nn mlbVA molecule into protein takes 
place in the cytoplasm on a large ribonncteoprotein assembly called n ribostmte. J he 
amino nods used for protein synthesis are first attached to a family of tftNA 
molecules, each of which recognizes, try complementary base- pair interactions, pat- 
titular sets of three nucleotides in the mliNA (cottons). The sequence oj nucleotides in 
the mHNA is then read from one etui to the other in sets of three according to the 
geiretic code. 

To initiate translation, a small ribosomnl snbunit binds to the mflNA molecule 
at n start rodon (AUG) that is recognized by a unique initiator tlWA molecule. A 
large ribosomnl snbunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phn.\e r aminoocyl ifh\ : As—each hearing a 
Specific amino acid bind sequentially to the appropriate cod on in mltNA by forming 
complementary base pairs with the tRNA anticodon. f.ach amino acid is added to the 
C terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
cucaryotic gene expression can be 
controlled. Controls that operate at 
steps ! through 5 are discussed in this 
chapter. Step 6. the regulation of protein 
activity, includes reversible activation or 
inac ovation by protein phosphorylation 
* (discussed in Chapter 3) as well as 
irreversible inactivatton by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
3 cell can control the proteins it makes by (1) controlling when and .how often a 
given gene is transcribed (iranscriplionaj control), (2) controlling how (he RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mflNAs in the cell nucleus are exported to thecyiosol 
and determining where in Ihe cytosol they are localized (RNA transport and 
JocaJization control), (4) selecting which mKNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degr ailing, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (figure 7-5). 

Vor most genes Transcriptional controls ate paramount. This makes sense 
because, of all the possible control points illustrated in Pigure 7-5, only tran- 
scriptional control ensures that die cell will not synthesize superfluous interme ; 
dimes. In the following sections 'we discuss ihe DNA and protein components 
ibat perform this function by regulating die initiation of gene transcription. We 
vhall t el urn at the end of the chapter to the additional ways of regulating gene 
impression. 



jummary 

Ihe- genome of a cell contains in its ON A serf net tee the information to ntake many 
thousands of different protein and UNA molecules. A cell typically expresses only ,i 
inaction of its genes, and the different types of cells in multicellular organisms arise 
i---.cn use different sets of genes are expressed. Moreover, cells can change the pattern 
r ? genes they express in response to changes in their environment, such as signals 
'■r/m other cells. Although all of the steps involved in expressing a gene can in pint- 
■ :pic be regulated, for most genes the initiation of UNA transcription is the most 
-.portant point of control. 



" av does a cell determine which of "us thousands of genes to iianscribe? As 
rationed briefly in Chapters -I and b. the tiansoipiion of each e.ene is con 
i. Hed by a regulatory region of DNA relatively neat the site where transcription 
'. ;"j)')S. -Some icgulatojy regions :nr simple and act as switches that ate i hi own 
*■ ■': single signal. Many otheis ate complex and a<. t a:> tiny mioopiocrssois. 
ponding to a variety of signals that ihi.-y inteipiet 'and integrate to switch ihe 
::/hboting gene on or oft. Whether c omple.x cu simple, these switching devices 
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Selective apoptosis of natural killer-cell tumours by l-asparaginase. 

Ando M , Sugimoto K , Kitoh T , Sasaki M , Mukai K , Ando J , Egashira M , Schuster 
SM, Oshiini K . 

Department of Haematology, Juntendo University School of Medicine, Tokyo, Japan. 

We examined the effectiveness of various anti -tumour agents to natural killer (NK)-cell 
tumour cell lines and samples, which are generally resistant to chemotherapy, using flow 
. cytometric terminal deoxy nucleotidyl trans ferase-mediated dUTP-biotin nick end- 
iabelling (TUNEL) assay. Although NK-YS and NK-92 were highly resistant to various 
anti-tumour agents, ^asparaginase induced apoptosis in these two NK-cell lines. NIC-cell 
leukaemia/lymphoma and acute lymphoblastic leukaemia (ALL) samples, were . 
selectively sensitive to I -asparaginase and to doxorubicin (DXR) respectively. Samples of 
chronic NK lymphocytosis, an NK-cell disorder with an indolent clinical course, were 
resistant to both drugs. Our study clearly separated two major categories of NK-cell 
. disorders and ALL according to the sensitivity to DXR and l-asparaginase. We examined 
asparagine synthetase levels by real-time quantitative polymerase chain reaction (RQ- 
PCR) and immiuiostaining in these samples. At least in nasal-type NK-cell lymphoma, 
there was a good correlation among asparagine synthetase expression, in vitro sensitivity 
and clinical response to l-asparaginase. In aggressive NK-cell leukaemia, although 
asparagine synthetase expression was high at both mRNA and protein levels, l- 
asparaginase induced considerable apoptosis. Furthermore, samples of each disease entity 
occupied a distinct area in two-dimensional plotting with asparagine synthetase mRNA 
level (RQ-PCR) and in vitrol -asparaginase sensitivity (TUNEL assay). We confirmed 
rather specific anti-tumour activity of l-asparaginase against NK-cell tumours in vitro, 
which provides an experimental background to the clinical use of l-asparaginase for NK- 
cell tumours. 
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Human thyroid carcinoma cell lines and normal thyrocytes: expression 
and regulation of matrix metalloproteinase-1 and tissue matrix 
metalloproteinase inhibitor-1 messenger-RNA and protein. 

AustG, Hofmann A. Laue S, Rost A . Kohler T , Scherbatim WA . 

Instirut of Anatomy, University of Leipzig, Germany. 

Matrix metalloproteinase- 1 (MMP-1) and tissue matrix metalloproteinase inhibitor 1 
(TIMP-1) play an important role, in remodeling the extracellular matrix in normal and 
pathological processes. The effect of phorbol-myristate acetate (PMA), interleukin-l (IL- 
.1), and tumor necrosis factor-alpha (TNF-alpha) on MMP-1 and TIMP-1 expression was 
studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 
1736 thyroid carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA 
(mRNA) levels were monitored by competitive semiquantitative reverse transcriptase 
polymerase chain reaction (RT-PCR) after 24 hours. Culture supernatants were assayed 
for free artd/or complexed MMP-1 and TIMP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELISA) systems (detection limit: <2 ng/mL). MMP-1 and TIMP-1 
mRNA were present in all cell types, although thyrocytes showed MMP-1 mRNA levels 
near the detection limit. 8505 C expressed MMP-1 mRNA levels of up to 10(6) times 
those of the other cells analyzed. PMA and IL-I increased MMP-1 mRNA in most cell . 
types. TIMP-1 mRNA increased after treatment with PMA in all cells except 8505 C, 
whereas only slight effects were shown after IL-I stimulation. MMP-1 protein was 
undetectable in normal thyrocyte cultures, but was secreted spontaneously by all cell 
lines ([ng/mL]; C 643: 15+/-7; HTh 74: 81+/- 1 ; SW 1736: 13+/-2; 8.505 C: 2097+/-320). 
There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99, p 
< .0001). PMA and 1L-1 increased MMP-1 secretion in all cell types after 48 hours. 
Fibroblasts ([ng/mL] 517+/-55) and the cell lines (C 643: 142+/-48; HTh 74: 1 15+/-13; 
SW 1736: 202+/- 14; 8505C: 1 20+/- 19) secreted TIMP-1 in unstimulated cultures, 
whereas only a trace amount. was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upreguiated TjMP-I secretion after 48 hours in SW 1736, HTh 74, and C 
643 cells. Our data suggest that in contrast to nonnal thyrocytes, dedifferentiated thyroid 
carcinoma cell lines are potential producers ofMMP-1 as well as TIMP-1. High MMP-1 
or MMP-I/TIMP-1 expression may play a role in tissue invasion of undifferentiated 
thyroid cancer cells. 

PM1D: 9349574 (PubMed - indexed for MEDLINE] 
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ABSTRACT 

Matrix metalloproteinase- 1 (MMP-1) and tissue matrix metaJIoproteinase inhibitor 1 (TIMP-1) play an impor- 
tant role in remodeling the extracellular matrix in normal and pathological processes. The effect of phorbol-myris- 
tate acetate (PMA), interleukin-1 (EL-1), and rumor necrosis factor- a (TNF-a) on MMP-1 and T1MP-1 expres- 
sion was studied on highly purified thyrocytes and undifferentiated 8505 Q C 643, HTh 74, SW 1736 thyroid 
carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA (mRNA) levels were monitored by 
competitive semiquantitative reverse transcriptase polymerase chain reaction (RT-PCR) after 24 hours. Culture 
5upernatants were assayed for free and/or complexed MMP-1 and TTMP-1 after 48 hours using enzyme-linked 
immunosorbent assay (EL1SA) systems (detection limit: <2 ng/mL). MMP-1 and T1MP-1 mRNA were present in 
all cell types, although thyrocytes showed MMP-1 mRNA levels near the detection limit. 8505 C expressed MMP- 
1 mRNA levels of up to 10 6 times those of the other cells analyzed. PMA and.DL-1 increased MMP-1 mRNA in 
most cell types. T1MP-1 mRNA increased after treatment with PMA in all cells except 8505 C, whereas only 
slight effects. were shown after DL-1 stimulation. MMP-1 protein was undetectable in normal thyrocyte cultures, 
but was secreted spontaneously by all cell lines ([ng/mL]; C 643: 15 ± 7; HTh 74: 81 ± 1; SW 1736: 13 ±. 2; 
8505C: 2097 ± 320). There was a strong correlation between levels of MMP-1 mRNA and protein (r= 0.99, 
p < .0001). PMA and EL-1 increased MMP-1 secretion in all cell types after 48 hours. Fibroblasts {[ng/mL] 517 ±: 
55) and the cell lines (C 643: 142 ^ 48; HTh 74: 115 ± 13; SW 1736: 202 ± 14; 8505 C: 120 ± 19) secreted 
T1MP-1 in unstimulated cultures, whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. EL-1 upregulated T1MP-1 secretion after 48 hours in SW 1736, HTh 74, and C 643 cells. Our data 
suggest that in contrast to normal thyrocytes, dedifferentiated thyroid carcinoma cell lines are potential produc- 
ers of MMP-1 as well as T1MP-1. High MMP-1 or MMP-l/TlMP-l expression may play a role in tissue inva- 
sion of undifferentiated thyroid cancer cells. 



INTRODUCTION 

Matrix metal lopkoteinases, (MMPs) constitute 3 
. family of structurally related proteolytic en2ymes re- 
sponsible /or rhe proteolytic degradation of extracellular 
matrix (ECM) components. They arc important partici- 
pants in normal tissue remodeling and contribute to the 
phenoiype ol several pathological conditions thai are as^ 
sociated with progressive ECM degradation. MMPs are 
highly regulated at different levels (1). At the transcrip- 
tional level, MMP expression can be directly induced or 



suppressed on external stimulation, ie, with cytokines, 
phorbo) 12-myristate 13-acetate (PMA), lipopolysaccaride 
(LPS), or retinoic acid (2,3). After secretion at post-uan- 
scriptional level, latent MMP proenzymes are regulated by 
proteolytic activation and interaction with tissue inhibitors 
of matrix metalloproteinase (TIMPs), their specific in- 
hibitors. Any imbalance berween rhe proteolytic MMPs ac- 
tivities and the TIMPs that could be influenced and caused 
by cytokines could potentially lead to pathological condi- 
tions (4). 

MMP-1, although known as an interstitial collagenase, 
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is the only enzyme active at neutral pH that can degrade 
extracellular fibers comprised of collagen types I, I], and 
HI. With this initial sre P> MMP-1 provides the cleavage 
products to other collagenase types (5). The major specific 
inhibitor of MMP-1 is TlMP-1, a 28.5-kd glycoprotein, 
which forms 1:1 stoichiometric complexes with the pro- 
tease (6). Cytokines and growth factors have been shown 
to regulate the expression of both MMP-1 "and T1MP-1 
U>7>8). 

Although the participation of MMP-1 as the initial col- 
lagenase in tissue breakdown during tumor development is 
well documented (9-11), only one srudy has described the 
expression (12) but no srudy has as yet investigated the reg- 
ulation of this enzyme in different thyroid tumors. Few stud- 
xL™ VC bCCn pub,ishcd inv «tigating the role of other 
MMPs m normal and pathological thyroid tissue by in situ 
hybridization and immunohistochemistry (13-16). Fur- 
thermore, tissue remodeling includes both the action of 
MMPs and thetr inhibitors; thus, these enzymes could be 
involved in autoimmune and other nonautoimmune thyroid 
diseases during morphological changes (17,18). It is srill un- 
known whether or not thyrocytes are able to express MMPs 
PS ' A,rhou & b W c W collagenases (MMP-2 and 
MMP-9) were deteaed in various human epithelial cells of 
different tissue origin (19,20), only one study described the 
secretion of MMP-1 by epithelial cells (21). 

Highly purified normal thyrocytes and four thyroid car- 
cinoma cell lines were included in this study to investigate 
t.he involvement of these cells in MMP-1 and TlMP-1 pro- 
duction during thyroid tissue remodeling processes and in 
malignant thyroid neoplasms. MMP-1 and TIMP-1 ex- 
pression were studied at both the mRNA and protein level 
by semiquantitative RT-PCR and EL1SA measurement, re- 
specrivdy. 

In unstimulated carcinoma cell lines both MMP-1 and 
J IMP-1 mRNA were expressed, partly at a high level, fol- 
lowed by the spontaneous secretion of the proteins The 
various conditions for the stimulation of the different cell 
hues by cytokines and PMA were defined. In contrast to 
the cell hnes, normal thyrocytes did not secrete MMP-1 
and only trace amount of TlMP-1, even after stimulation 
wnh PMA. 



MATERIALS AND METHODS 

Preparation of tissues, thyroid-derived cells and 
cell lines 

Thyrocytes were prepared from surgical thyroid speci- 
mens from 3 patients (1 Graves' disease, 2 nontoxic goi- 
ter; mean age 543 ± 5.0 years). Fibroblasts were separated 
from thyroid tissue of 5 other patients (3 Groves' disease 
I nontoxic goner; mean age 43.6 * 6.4 years). Graves' dis- 
ease and nontoxic goiter were diagnosed on the strength 
of clinical, biochemical, and immunologic features as well 
as thyroid scintiscans. 

Thyroid tissue was rrimmed of fat and connective tissue 
immediately aher surgery. Thy,o»d,der.ved cells were en- 
riched aher g.adual enzymatic digestion of tissue and cul- 
tured over a period of 3 6 hours as described. Thyrocvtes 
were obia.ned from the adherent faction bv incubaii.,,. 
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the cell monolayer with phosphate buffered saline (PBS) 
without Ca^/Mg 2 * for 45 minutes (22). Residual fibro- 
blasts were removed after subsequent incubation of the 
cells with the fibroblast-spccifk mab FibASQl {22} and 
goat-anti-mouse lgG-DYNABEAOS® M450 (DYNAL, 
Hamburg, Germany) according to the manufacturer's pro- 
tocol . 

Thyroid-derived fibroblasts were obtained after culruring 
small pieces of thyroid tissue in Dulbeccos's Modified Ea- 
gle's Medium (DMEM) with 10% fetal calf serum (FCS) 
and harvested in the 5th to 7th passage. The purity of the 
thyrocytes and fibroblasts was determined by using indirect 
immunofluorescence technique on a FACS-Scan {Becton 
Dickinson GmbH, Heidelberg, Germany) as described (22). 

The following human anaplastic thyroid carcinoma cell 
hnes were cultured in DMEM with 10% FCS- C 643 (23)- 
SW 1736 (23); and HTh 74 (24). The cell line 8505 C (25) 
was purchased from the German Collection of Microor- 
ganisms and Animal Cell Cultures (DSM ACC219). This 
cell Ii„ c was established from a primary thyroid tumor 
characterized histologically as.a undifferentiated carcinoma 
that way partially composed of poorly differentiated pap- 
illary cells (25). This is a feature of a subgroup of anaplas- 
tic carcinoma (26). The majority of these coexistent better 
differentiated carcinoma foci in anaplastic carcinoma were 
papillary (26); . 

In vitro cultures 

Using 24- well plates, 1 X 10* cells were cultured for 24 
hours. The medium was aspirated and replaced wiih 500 
fiL OPT1-MEM (C1BCO BRL, Grand Island, NY) without 
FCS to eliminate possible stimulation of MMP-1 arid 
TlMP-1 production by FCS. The medium contained the 
desired concentration of human IL-la (10 U/ml; Pepro 
Tech EC Ltd., London, UK), TNF-a (100 U/mL; Pepro 
Tech EC Ltd.), interferon-/ (IFN-7) (500 U/mL,- Pepro 
Tech EC Ltd.), or 10 ng/mL PMA (SIGMA). 

Triplicate cultures of each stimulator were analyzed af- 
ter 3, 6, and 24 hours at the mRNA and after 24 and 48 
hours at the protein level. The supernatants were removed 
and stored at - 80*C for further use. Fixst,.a.collagenolytic 
assay based on the digestion of type 1 collagen was per- 
formed. This method showed direct evidence of free pro- . 
MMP : 1 enzyme in the cell culture supernatants of un- 
stimulated and IL-la stimulated 8505 C, HTh 74, and 
C634 cells (data not shown). However, the method' does 
not allow quantitation of MMP-1 enzyme activity. Thus, 
the cell culture supernatants were assayed for MMP-1, 
TlMP-1, and MMP1/HMP-1 complex by ELISA (Arner- 
sham Life Sciences, Braunschweig, Germany). The MMP- 
1 assay (sensitivity: 1.7 ng/mL) detected only total human 
MMP-1 , ie, free MMP-1 and MMP-1 complexcd with in- 
hibitors such as T1MP-T it did not detect MMP-1 bound 
by the nonspecific piorease inhibitor a 2 -ma£r°globulin. 
The MMP-J/TIMIM assay (sensitivity: 1.5 ng/mL) de- 
lected MMP-1/T1MP-1 complex, ie, activated MMP-1 thai 
has been subsequently completed with the specific MMP- 
1 inhibitor HMP-1. h did not detect free active MMP-1, 
iree TlMP-1, or pro-MMP-3. There was no cross-reactiv- 
ity with aaive MMP-1 bound by the nonspecific protease 
inhibitor o ? -maeroglobulin. The TIMP- 1 assay (sensitivity: 
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1.Z5 ng/mL) detected total human TIMP-1, ie, free TIMP- 
1 and that complexed with MMPs. The assay did fully 
cross-react with TIMP-1 in complexes with other MM P. 
It did not cross-react with TiMP-2. 

RNA extraction and cDNA synthesis 

For gene expression studies, 5 mL RNAzol™ B (Biotex 
Laboratories Inc., Houston, TX) was added to the cell cul- 
ture wells. The content of three wells of any cell type was 
pooled and then stored frozen for further mRNA analysis 
in liquid nitrogen. Total cellular RNA (cDNA) was iso- 
lated from the probes according to the manufacturers pro- 
tocol. RNA was fractionated on a denaturing 1.0% agarose 
gel and stained with ethidium bromide to confirm that, 
spectrophotometric measurements were accurate and that 
the RNA had not been degraded. Five micrograms total 
RNA was reverse-transcribed to cDNA using the First- 
strand cDNA synthesis kit of Pharmacia (Uppsala, Swe- 
den), in a total reaction volume of 15 jiL. 

mRNA analysis by competitive RT-PCR 

To correct for variations across different cDNA prepa- 
rationsi samples were first adjusted to contain equal in- 
put glyceralderyde-3-phosphate dehydrogenase (GAPDH) 
cDNA concentrations. Semi-quantitative GAPDH RT-PCR 
was used with a heterologeous synthetic competitor frag- 
ment. The generation of the specific PCR products from 
the competitor and the cDNA with the GAPDH primers 
were published earlier (22,27). 

We then estimated the MMP-1 and TTMP-1 cDNA in 
these adjusted samples. The primers were selected using the 
DNAsis computer program (Hitachi Software Engineering 
Co, Yokohama, Japan). The primer pairs span one or more 
introns to allow unambiguous discrimination between 
cDNA and unwelcome contaminating genomic DNA. In 
quanritating MMP-1 and TIMPrl cDNA, a rapid one-step 
method was introduced to synthesize an internal honriolo- 
geous competitor (plan diagram of procedure: Fig. 1, ex- 
emplary for MMP-1 [28J--A hybrid primer was synthesized 
{MMP-lhy) that consisted of two segments (segj, scg 2 ). It 

MMPii 

5' - 3' 



s eq?* : 

MMPlhy^ MMPlr : 



5 478 bp 



MMPt internal homotogeous compelrlor 

F)G. 1. Genera) scheme lor generating homologous com- 
petitors used (or quantitative PCR. 



had a length of 40 nucleotides, in which 20 nucleotides (segj) 
at the y end corresponded to the opposite strand of the tar- 
get sequence a predetermined distance from primer MMP- 
lf, and 20 nucleotides at the 5' end (seg2 = MMP-lr) that 
corresponded to the target sequence upstream from the seg- 
ment seg|. Amplification with the primers MMP-lf and 
MMP-lhy from the cDNA resulted in a 478- base pair (bp) 
(polymerase chain reaction (PCR) product. It was freed from 
excess primers and deoxynuclepside-triphosphates (dNTPs) 
using the Qia.quick Gel Extraction Kit (Qiagen GmbH, 
Hilden, Germany) and. quantified. A known number of 
copies of the competitor was introduced in the GAPDH-ad- 
justed samples and amplified with me primers MMP-lf and 
MMP-lr. With this approach, two products were generated, 
one derived from the cDNA (560 bp) arid another, 82 bp 
smaller in size derived from the internal competitor (Fig. 1). 
PCR. products were resolved by gel electrophoresis (1.5% 
agarose gel). The relative amounts of sample cDNA and 
competitor were quantified by measuring the intensity! of 
ethidium fluorescence with a CCD -image sensor and .ana- 
lyzing the data with the EASY program (Herolab, Wiesloch, 
Germany); The initial amounts of sample cDNA and com- 
petitor were assumed to be equal in those reactions where 
the ratio of the two products was judged to be equal. This 
was expressed in arbitrary units (AU) (22,29). One AU was 
defined as the lowest concentration of competitor yielding* 
a detectable amplification product when added to PCR 
alone. For example, if equivalence between sample cDNA . 
and competitor was reached using a 100-ToId concentrated 
competitor the relative sample cDNA concentration was 100 
AU. Thyrocytes and the cell lines were analyzed for the ex- 
pression of thyroid-specific and cytokine receptor mRNAs 
in a simple RT-PCR. The sequences of the TPO and cy- 
tokine receptor primer pairs have been published by Wat- 
son et al. (30) and Tada et al. (31) and gave the following 
product sizes: TPO: 506 bp; 1L-1R type 1 (p80):300 bp; 1L- 
1R type U (p68): 392 bp; TNF-aR ( p 75): 324 bp; TNF-aR 
(p55): 587 bp and IFN-yR: $99 bp. The thyroglobulin (Tg) 
and thyroid stimulating hormone receptor (TSH-R) primer 
pairs were selected according to the published sequences us- 
ing the DNAsis program (Table 1). 

Each 25-pL amplification reaction contained 2.5 /iL 
10 X concentrated PCR buffer (15 mM MgCI 2 , Boehringer 
Mannheim, Germany), 0.3 U Taq DNA polymerase 
(Boehringer Mannheim, Germany), 100 /iM dNTPs 
(Perkin Elmer, Weiterstadt, Germany), 0.1 of each 

primer (1MB, Jena, Germany), and 1 /iL cDNA and com- 
petitor in adjusted dilution. Furthermore, restriction map- 
ping (restriction enzymes: Boehringer Mannheim GmbH, 
Germany) was carried out to confirm the originality of the 
PCR product (Fig. 1, Table 1). 

Statistics 

Proiein levels of thyrocyte or fibroblast cultures from the 
different patients and of (he thyroid carcinoma cell lines ob- 
tained from ihrec separate experiments were presented as 
mean Z SUM values. Statistical comparisons berween un- 
stimulated and stimulated cell cultures were performed by 
the alternate (Welch) t-test. The correlation berween basal 
mRNA levels and the unstimulated protein secretion in all 
cell rypes was calculated according to the Spearman method. 
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Table J. Primers, Length of Ampufieo Templates, Restriction Mapping and Assay Conditions for RT-PCR 

Primer length of Length of Annealing . Number of 

5'- -3' cDNA (bp) . competitor (bp) temperature cycles 



Tg 


forward 


GCAGATCTTACTGAGTGGCT 


416 




60 


35 




reverse 


TGTCAGCACAGTGGCAATAC 








TSH-R 


forward 


ACTTGCTGCAGCTGGTGCT 


354 




65 


35 


exons 1-4 


reverse 


TGAGGGCATCAGGGTCTATG 








TSH-R 


forward 


GAAATTCGGAATACCAGGAACTTA ACT 


896 




53 


.?5 


cxons 4-10 


reverse 


AACTCATCGGACrTGGGGGTACA 










MMP-1 


forward 


TGGGACCAAACACATCTGAC 


560 


478 . 


64 . 


33 




reverse - 


ATCACTTCTCCCCGAATCGT 










hybrid 


ATCACTTCTCCCCGAATCGT 
CCATATATGGGTTCK3ATGCC 










TTMP-I 


forward 


CTTAGGGGATGCCGCTGACA 


351 


274. 


64 


30 




reverse • 


GGCAGGCAGGCAAGGTGACG 








hybrid 


GGCAGGCAGGCAAGGTGACG 
GGATGGATAAACAGCGAAAC 











Tg indicates ihyrogjobulinc; TSH-R. thyroid stimulating : -hormone receptor. MMP-1. matrix meiaHoproteinas*-!: TIMP-1, tissue inhibitor of 
metalloproleinasc-l; bp, base pair. 



RESULTS • . 

Thyroid specific and. cytokine receptor 
mRNA expression 

Isolated thyrocytes as well as 8505 C cells expressed Tg 
arid thy roper oxidase (TPO) mRNA, whereas transcripts of 
the TSH-R (exons 3-4, 354 bp, exons 4-30, 896 bp) were 
present only in the thyrocytes. The three anaplastic thy- 
roid carcinoma cell lines 3736* C 634, and HTh 74 
were completely negative for the Tg, TPO, and TSH-R mR- 
NAs (Fig. 2). All cell lines and thyrocytes expressed 1L-3R 
(type I and type II), TNF-oR (p75 and p55) and IFN-yR 
mRNA (Fig. 2). 

Basal MMP-) and TIMP-1 mRNA and 
protein expression 

In most stimulation experiments, mRNA levels did not 
increase until 24 hours of incubation. The 24-hour mRNA 
levels arc shown in Figures 3 and 4. The 3- and 6-hour 
levels are demonstrated in those experiments where the 
mRNA levels leached their peak before 24 hours of stim- 
ulation. If nor otherwise indicated, the MMP-1 levels were 
measured using the EL3SA system, which recognizes 
free/complex ed MMP-J. 

MMP-1 and TIMP-1 mRNA were found during un- 
stimulated culture in all investigated cell types, although 
the mRNA levels varied over a great range. 8505 C showed 
a basal MMP-1 mRNA level 20 rimes as high as those 0/ 
the HTh 74 cells, 6 X JO 4 times as high as C 643, and 2 X 
10 6 times as high as SW 3736 cells. In thyrocytes, MMP- 
3 mRNA levels were found near the deiecrion limit (Figs 
3 and 4). 

Generally, when analyzing the noted cell rypes, the mea- 
sured basal MMP-1 or TiMP-1 mRNA levels correlated 
well with the basal protein expression (MMP-1: r = 0.99, 
P < ,000!;TIMM: r = 0.98, p < .002). Corresponding to 
»he high MMP-J mRNA .level, 8505 C cells secreted ex- 
tremely high levels of MMP-J. No MMP-J or TIMP-1 was 



detected in unstimulated thyrocyte cultures at any time- 
point examined. All other cell rypes showed a spontaneous 
MMP-1 and TIMP-1 secretion (Figs. 5 and 6). Thyroid- 
derived fibroblasts produced basal TIMP-1 levels of up to 
4 times higher In the four carcinoma cell lines, which se- 
creted nearly the same amounts of basal TIMP-1 protein. 
Nevertheless, TIMP-1 secretion 0/ fibroblasts was found 
at lower levels than expected after TTMP-1 mRNA mea- 
surement in 4 of 5 analyzed patients. The results of the fi- 
broblast cultures from patient five showing a higher T1MP- 
1 expression than those from the 4 other patients (basal 
24 hour: 50 ± 2; PMA 24 hour:. 90 ±- 6 ng/mL TLMP-1) 
was omitted in Figure 6. 

Comparing the basal amount of free/complexed and 
TIMP-3 complexed MMP-1 after 24 hours of stimulation, 
a significant level of MMP-1 was not complexed with 
TIMP-1 in 8505 C cultures, whereas in fibroblast cultures 
most of the MMP-1 activity was inhibited by TIMP-3. The 
anaplastic carcinoma cell line HTh 74 did not show such 
a great discrepancy between free/complexed and TTMP-1 
complexed MMP-3 level as 8505 C cells (Fig. 7). 

Effects of IL-la on MMP-1 and TIMP-) mRNA and 
protein expression 

Experiments were performed to determine whether hu- 
man thyroid epithelial cells and thyroid carcinoma cell lines 
could produce or increase basal MMP-3 and TIMP-1 se- 
cretion after exposure to various stimuli. The results from 
these stimulation experiments are summarized in Figures 
3 through 5: Generally, there was a delay in protein se- 
cretion level in comparison to the mRNA expression level. 
At the protein level, the cytokine-mediated stimulating or 
inhibiting effect is more distinct afier 48 hours compared 
with 24 hours, even when the mRNA level had already de- 
creased after 6 hours. 

IL-1 upregulated MMP-3 mRNA in SW 1736 cclls up 
to 100 times and, in thyroid-derived fibroblasts, up to 12 
times after 24 hours of incubation (Fig. 3). This increased 
mRNA le vel was accompanied by 3 significantly enhanced 
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TPO 

Tg 

TSH-R 
Exon 1- 4 

II-1-R1 
II-1-R2 
IFN-yR 

P75TNFR 
p55TNFR 



FIG. 2. Amplification of thyroid specific and interleukin-rcceptor mRNA in thyrocytes (1), SW J 736 (2), C 643 (3), HTh 
74 (4) and 8505 C (5) cells using RT-PCR; M f - 1 00-bp ladder (G1BCO). 



MMP-1 secretion after 48 hours. Furthermore, IL-1 a in- 
creased MMP-1 mRNA expression in thyrocytes up to 
seven times after 6 hours, but no MMP-1 protein could be 
detected in thyrocyie cultures. IL-1 had no stimulatory ef- 
fect on MMP-1 mRNA expression in C 643, HTh 74, and 
8505 C cells after 24 hours, although a significant increase 
of MMP-1 secretion was found in HTh-74 and 1736 
cells after 48 hours of incubation (Fig/5). This discrepancy 
may be explained by a possible increase in MMP-1 mRNA 
level after 24 hours. of stimulation. The S3ine effect could 
also be observed in the IL-1 stimulated T1MP-1 at the 
mRNA as well 3S the protein level: the only slight effect 
of IL-1 on T1MP-1 mRNA expression in carcinoma cell 
lines after 24 hours was accompanied by a significant in- 
crease of TJMP-1 secretion in 8505 C and HTh 74 cells 
after 48 hours (Figs. A and 6). 

Effects of TNF-a on both MMP-1/TJMP-] mRNA 
and protein expression 

In contrast to IL-1, TNF-a did not stimulate the MMP- 
J and T1MP-1 mRNA and protein levels in all carcinoma 
cell lines and thyrocytes. Only thyroid-derived fibroblasts 
responded with a slight upregulation of MMP- 1 and T1MP-, 
1 mRNA expiession after TNF-a stimulation, which was 
not accompanied by an increase of MMP ] and PIMP- 1 
secretion. 



Effects of PMA, and IFN-y on MMP-1 and TIMP-1 
niRNA and protein expression 

PMA was included in our study as a positive control 
because it is known to upregulate.or induce both MMP- 
1 and T1MP-J secretion in various cell types (1,32). In- 
deed, PMA was able' to induce or enhance MMP-1 
mRNA levels in all cell types investigated, although the 
detected levels varied to a large extent (Fig. 3). This re- 
sult is in good correlation with the significantly increased 
MMP-1 protein levels that were already detectable after 
24 hours of stimulation (Fig. 5). PMA upregulated T1MP- 

1 mRNA levels by up to 20 times in C 643, and up to 

2 times in SW 1736 and HTh 74 cells, fibroblasts and 
thyrocytes, but it did not change theTIMP-,1 mRNA con- 
tent in 8505 cells (Fig. 4). At the protein level, we found 
a significant stimulation of TIMP-1 secretion in C 643 
and HTh 74 cells, as well as in thyroid-derived fibrob- 
lasts (Fig. 6). 

In contrast to PMA, IFN-7 W3S without effect on stim- 
ulation 01 downregulation of MMP-1 and TIMP-1 mRNA 
or protein in 3nv of the cell rypes investigated (Figs 5 and 

6). 

I he mam inhibitor of MMP* 1 is TiMP I, which forms 
1:1 stoichiometric complexes with MMP- 1 , although some 
Other inhibitors can also bind MMP-1. On the other hand, 
TIMP I can bind oihrr MMP types. 
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FIG. 3. Representative samples of competitive amplified MMP-1 mRNA of thvrocytes, thyroid-derived fibroblasts and thy- 
roid "rcinoma cell Imes without stimulation (control) and after stimulation with 10 U/mL IL-la and 100 U/mL TNF-a and 
, n * ml Scn £ diIul,0 » s of known amounts of the competitor fragment were coamplified with identi- 

cal ahquots of cDNA The 560-bp (cDNA) and 478-b P (competitor) PCR products were visualized by agarose gel electrophoresis 
and e.h^mmb^mide stainmg. The relative concentration of the added competitor was given in arbitrary units (AU) in the 
AU . wa J dcf,ncd as thc iowest. concentration of the competitor yielding a detectable amplification for MMP-1 
mKNA._ ihe ratio of competitor to cDNA fragments was determined by measuring the intensity of ethidium fluorescence with 
3 LCD image sensor and analysis of data. Measured cDNA concentration can expressed in AU ' 



DISCUSSION 

Our findings demonstrate for the first time that thyroid 
carcinoma cell lines are able to express MMP-1 and T1MP- 
1 mRNA and proiein at significant levels in vitro. The ob- 
servation of spontaneous release of MMP-1 and T1MP-1 
corresponds well with earlier studies covering the secretion 
of these proieins by several carcinoma cell lines (33,34). 

However, in contrast to its clear physiological function 
in extracellular matrix breakdown, the role of MMP- J in 
tumor growth and metastases is still controversial 
(9-11,3.5/. Recently, Murray er al. (10) demonstrated thai 
MMP-1 is associated wirh poor prognosis in colorecral can- 
cer, and has a prognosiic value independent of the Dulces 
stage. There/ore, MMP-1 could be a larger for therapeu- 
tic intervention in such tumors. Furrhermore, the hypoth- 
esis of whether or noi cancer cells themselves arc able to 



produce MMP, or whether cancer cells stimulate the sur- 
rounding stromal cells lo secrete MMP in vivo y is disputed. 
MMP-1 mRNA and protein were detected by both in sitit 
hybridization and immunohisiochemistry in stromal as 
well as tumor cells of head, neck, gastric, colorectal, and 
mammary carcinomas (9,10,36,37). In contrast, 
Karneyama (32) demonstrated by in situ hybridization that 
the MMP-1 mRNA was not expressed in the cancer cells 
but m the surrounding fibrous capsules of strongly differ- 
entiated papillary thyroid carcinoma tissue. Highly differ- 
entiated follicular carcinomas and follicular adenomas 
were depleted for MMP-1 transcripis. Undifferentiated fol- 
licular, papillary, and aggressive anaplastic carcinomas 
that showed poor prognosis and strong tumor invasive and 
metastatic potential and that can be compared in their 
morphological, genetic and growih features with undiffer- 
entiated thyroid carcinoma cell fines were not included this 
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study. However, the missing expression of MMP-1 by nor- 
mal, thymocytes and the spontaneous secretion of this pro- 
tein by highly malignant ihyroid carcinoma cell lines, as 
demonstrated in our study, indicate the involvement of 
MMP-1 secretion of transformed thyrocytes in aggressive 
ihyroid tumors. 

Although all cell lines analyzed in our study sponta- 
neously secreted MMP-1, we observed marked differences 
in the basal secretion capacity. The highest MMP-1 levels 
were determined in cultures of .8505 C cells. Only 8505 C 
cells expressed TPO and Tg mRNA that may be put down 
to residual differentiated components in the cell line (see 
Materials), However, none of the analyzed cell lines ex- 
pressed TSH-R mRNA. The cell population doubling times 
were Jess than 40 hours. All cell lines had accumulations 
of multiple genetic evenis. These facts indicate the undif- 
ferentiated pathology of ihe studied lines. It is Well known 
that anaplastic carcinoma cell lines well retain the malig- 
nant characterises of their parental rumors (38-40). 

Furthermore, we. found a distorted proportion berwecn 
MMP-J and TIMP-J mRNA/proiein for carcinoma cell 
lines but not for normal thyroid-derived fibroblasts. The 
most disadvantageous constellation berween MMP-1 and 
T1MP-1 was found in 8505 C cells. Similar to other stud- 
ies (41), these results suggest the influence of an altered 
MMP/T1MP relation on tumor progression. However, it 



should be mentioned that most studies, including the pre- 
sent one, do not rake into consideration that a number of 
inhibitors distinct from TIMP-1 may regulate MMP-1 ac- 
tivity. Taking into account that the balance of active en- 
zyme and TIMP-1 concentration strongly influence the ex- 
tent of local matrix degradation, a number of studies 
showed unexpectedly high levels of TIMP-1 in malignant 
neoplasms (9,42,43). There is a great discussion as to 
whether the overall expression of MM P- 1 and TIMP-1 or 
the amount of noncomplexed MMP-1 could be critical in 
aggressive tumor development. This fact underlines the na- 
ture of tissue breakdown, reflecting the complicated net- 
work of selective and coordinated production of individ- 
ual proteinases . and inhibitors under normal and 
pathophysiological conditions. Thus, the invasive and 
metastatic potential of thyroid tumors depends on the lo- 
cal net level of active MMPs. 

The synthesis of MMP-1 and TIMP-1 is influenced by a 
variety of biochemical stimuli. The recent findings on 
MMP-J and TIMP-1 gene promoters are useful in under- 
standing the complex mechanisms implied in the regula- 
tion of MMP synthesis modulated by cytokines and tumor 
promoters (34,44,45). The promoter regions contain tu- 
mor promoter responsive elements (TRF) and binding mo- 
trfs for the transcription factor PEA-3, which are recog- 
nized by pioto-oncogenic transcription factors, such as the 
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FIG. 7. Comparison between (i) free/complex and (ii) T1MP- 
1 complied MMP-1 levels, and (in) TIMP-1 levels in super- 
narants of unstimulated 8505 C and HTh 74 cells, smd-ihy- 
rotd-d.er.ved fibroblasts after 24 hours using a (i) MMP-1 
x7ka» "cognizes total MMP-1 (sec Figure 5). The (ii) 
MMP-l/TIMP-1 assay recognizes MMP-l/nMP-1 complexes 
ie, activated MMP-1 that has subsequently been complied 
with the specific MMP inhibitor TIMP-1. The (iii) TIMP-1 
EUSA-recognizes total TIMPO, ie, free TIMP-1 and that com- 
plexed with MMPs. 



fos and jun family (45-47). 1L-J, TNF-a, and PMA Dp- 
regulate proto- oncogenes like fos and }un, resulting in the 
stimulation of MMP-1 and TIMP-1 (45,48). The action of 
(he cytokines is mediated by their specific receptors. In our 

S!U t >^!i L ' JR (typC 1 3 " d VP* »)> TNF-aR ( P 75 and P 55) 
and IFN-yR mRNAs were demonstrated in all investigated 
cell types. PMA and IL -1 were shown to elevate MMP-1 
and TIMP-1 in nearly all cell types investigated, thus con- 
firming the results of several studies on other epithelial cells 
(reviewed in refs. 1,7,4*). In :the majority of experiments, 
we found a concordant expression of MMP-1 and 71MP- 
1 after stimulation, possibly achieved by the coordinated 
anions of the nuclear transcription factors, although MMP- 
1 and TIMP-1 expression can also be independently or even 
reciprocally regulated (1). The effect of TNF-a was not as 
distinct as in the case of PMA and IL-1, although several 
investigators found a pronounced effect of TNF-cr particu- 
arly on TLMP-] secretion (4,34). In contrast to studies per- 
formed wuh other cell types (863,864,81*), IFN-y did noi 
influence MMP-1 and TIMP-1 expression in thyroid card- 
noma cell Imes. In summary, the involvement of ihe in- 
irathyroidal physiological and pathological cytokine mi- 
ooenvironmem in the reguJarion of MMP-1 and TIMP-1 
induction activation and inhibition ,s strongly suggested. 

Furthermore, the data demonstrate that regular human 
myrocyres did not produce MMP-1, even after powerful 
si.mulai.on with PMA. Investigating other mammalian cp- 
"hehal cells, only one study revealed the production of 



MMP-1 by rabbit corneal cells (21). It is yet not clear 
whether the MMP-1 mRNA detected in thyrocytes is due 
to a low level of constitutive transcription of the MJvlP-1 

f^x!J i,1CginmatC transcr 'P tion )> an existing pool of stable 
MMP-1 mRNA, or. in vitro induction of MMP-1 mRNA 
But it seems more likely that residual fibroblasts contained 
in the purified thyrocyte preparation (< 0.2%) art respon- 
sible for the slightly positive RT-PCR results. Another ex- 
planation could be that thyrocytes are indeed MMP-1 pro- 
ducers, but the ELISA detection system use.d was not 
sensitive enough to measure extremely low MMP-1 secre 
^tiT k D^^T'^ the discrc P ancv ^tw«„ elevated 
^^| mRNA ^Kkpf rhyiocytes and the extremely low 
HMP-1 proton secretion by these cells is difficult to ex- 
plain. Post-transcriptional regulatory events may be re- 
sponsible for this confounding result. 

Taken together, the present study suggests that the in- 
trathyroidal cytokine microenvironmenr is involved in the 
regulation of MMP-1 and its inhibitor 71MP-1 in the thy- 
roid, and that both proteins may be secreted by dediffer- 
entiated thyroid carcinoma cells and involved in aggressive 
thyroid tumors in vivo. 
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Expression of embryouic fibroncctin isoform EHIA parallels alpha-smooth 
muscle acttn in maturing and diseased kidney. 

Barnes VL . MusaJ, Mitchell RJ. Barnes JL 

Department of Medicine, Division of Nephrology, University of Texas Health Science 
Center, San Antonio, Texas, USA. 

In this study we examined if an association exists between expression of an alternatively 
spliced "embryonic" fibronectin isoform EIliA (Fn-EIUA) and alpha-smooth muscle 
actin (alpha-SMA) in the maturing and adult rat kidney and in two unrelated models of 
glomerular disease, passive accelerated anti-glomerular basement membrane (GBM) 
nephritis and Habu venom (HV)-induced proliferative glomerulonephritis, using 
iminunohistochemistry and in situ hybridization. Fn-EIIIA and alpha-SMA proteins were 
abundantly-expressed in niesangium and in periglomerular and peritubular, interstiti urn of 
20-day embryonic and 7-day (D-7) postnatal kidneys in regions of tubule and glomerular 
development Staining was markedly reduced in these structures in maturing juvenile (D- 
14) kidney and was largely lost in adult kidney. Expression of Fn-EttlA and alpha T SMA 
was reinitiated in the mesangiurh and the periglomerular and peritubular interstitium in 
both models and was also observed in glomerular crescents in anti-GBM nephritis. 
Increased expression of Fn-EIIIA mRNA by in situ hybridization corresponded to the 
localization of protein staining. Dual labeling experiments verified co-localization of Fn- 
EULA and alpha-SMA, showing a strong correlation of staining between location and 
staining intensity during kidney, development, maturation, and disease. Expression of 
ElfIA mRNA corresponded to protein expression in developing and diseased kidneys and 
was lost in adult kidney. These studies show a recapitulation of the co-expression of Fn- 
EIIIA and alpha-SMA in anti-GBM disease and suggest a functional link for these two 
proteins. 
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BMI-1 gene amplification and overexpressiou ia hematological 
maligaancics occur mainly in mantle cell lymphomas. 

Bea_S , Tort F, Pinyol M . Puig X , Hernandez L , Hernandez S . Fernandez PL . van 
Lohuizcn M , Colomer D. Cam no E . 

The Hematopathology Section, Laboratory of Anatomic Pathology, Hospital Clinic, 
Institut d'lnvestigacions Biomediques August Pi iSunyer.(IDIBAPS), University of 
Barcelona, Spain. 

The BMI-1 gene is a putative oncogene belonging to the Polycomb group family that 
cooperates with c-myc in the generation of mouse lymphomas and seems to participate in 
cell cycle regulation and senescence by acting as a transcriptional repressor of the 
INK4a/ARF locus. The BMl-1 gene has been located on chromosome I Op 13 , a region 
involved in chromosomal translocations in infant leukemias, and amplified in occasional 
non-Hodgkin's lymphomas (NHLs) and solid tumors. To determine the possible 
alterations of this gene in human malignancies, we have examined 160 
lymplioproliferative disorders, 13 myeloid leukemias, and 89 carcinomas by Southern 
blot analysis and detected BMI-1 gene amplification (3- to 7^fold) in 4 of 36 (1 1%) 
mantle cell lymphomas (MCLs) with no alterations in die INK.4a/ARF locus. BMl-l and 
p 1 6INK4a mRNA and protein expression were also studied by real-time quantitative • 
reverse transcription-PCR and Western blot, respectively, in a subset of NHLs. BMI-1 
expression was significantly higher in chronic lymphocytic leukemia and MCL than in 
follicular lymphoma and large B cell lymphoma. The four rumors with gene 
amplification showed significantly higher mRNA levels than other MCLs and NHLs with 
the BMM gene in germline configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between BML 1 mRNA 
levels and protein expression was observed in all types of lymphomas. No relationship 
was detected between BMI-1 and pl61NK4a mRNA levels. These findings suggest that 
BMI-l gene alterations in human neoplasms are uncommon, but they may contribute to 
the pathogenesis in a subset of malignant lymphomas, particularly of mantle cell type. 

PMID: i 1289106 [PubMed - indexed for MEDLINE] 
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BMI-J Gene Amplification and Overexpression in Hematological Malignancies 
Occur Mainly in Mantle Cell Lymphomas 1 

Silvia Bea, Frederic Tort, Magda Pinyoi, Xavier Puig, Luis Hernandez, Silvia Hernandez, Pedro L. Fernandez, 
Maarten van Lohnizen, Dolors Colomer,* and Elias Campo 2 . 

■ 1066 CXA^ia*. Ne„*r,Spi. y. LJ " 1 ft> ""°" * IMmhr . O-rcmog^ab. Jh< Norland, Conor Insgn^r. 



Abstract 

The BMJ-J gene is a putative oncogene belonging to the Polycornb 
group family that cooperates with c-myc in the generation of mouse 
lymphomas and seems to participate in cell cycle regulation and senes- 
cence by acting as a transcriptional repressor of the JM4a/ARF Jocus. 
The BMJ-1 gene has been located on chromosome J Op] 3, a region involved, 
in chromosomal translocations in in/ant leukeroias, and, amplified in. 
occasional non-Hodgkin's lymphomas (NHLs) and solid tumors. To de- 
termine Ibe possible alterations of this gene in human malignancies, We 
have examined J 60 lymph oproJiferative disorders, 13 myeloid leukeroias, 
and 89 carcinomas by Southern blot analysis and detected BMJ-J gene 
amplification (3- to 7-fold) in 4 of 36 (11%) mantle re)l lymphomas 
(MCLs) with no alterations in the JNX4a/,1JtF locus. BMI-J and pj6 INK '" 
mRNA and protein expression were also studied by real-time quantitative 
reverse Iranscription-P.CR and Western blol, respectively, in a subset of 
NWLs. BMM expression was significantly higher in chronic lymphocytic 
leukemia and JVJCL than in follicular lymphoma and large B cell lym- 
phoma. The four tumors with gene amplification showed significantly 
higher mRNA levels than other M.CLs and NHLs >vith the BMJ-J gene in 
gcrmline configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between 
BMJ-J mRNA levels and protein expression was observed in all types of 
lymphomas. No relationship was detected between BMM and pl6 ,rvK ** 
mRNA levels. These findings suggest that BMJ-J gene alterations in 
human neoplasms are uncommon, but tbey may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 

introduction 



JNK4a/ARF locus by acting as an upstream negative regulator of 
pl6 WK4a and P 14/pI9 ARF gene expression (5). The human BMJ-J 
gene has been mapped to chromosome 10pl3 (6), a region inyolved.in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant T cell lymphomas (8, 9). More recently, higb-Ieve) 
DNA amplifications of this region have been found by comparative 
genomic hybridization in NHLs and solid tumors (10, 11). However,, 
the possible implication of the BMI-J gene in these alterations and its 
role in the pathogenesis of human tumors is not known. The aim of 
this study was to analyze- the possible BMI-J gene alterations, arid 
expression in a large series of human neoplasms and to determine ihe ' 
relationship with JNX4o/ARF locus aberrations. 

Materials and Methods 



The BMJ-J 3 gene is a pulative oncogene of the Polycornb group 
originally identified by retroviral insertional mutagenesis in Eji-c- 
myc transgenic mice infected with the Moloney murine leukemia 
vims (1, 2). These animals had a rapid development of pre-B cell 
lymphomas showing frequent provfral insertions near the BMJ-J gene. 
This integration resulted in BMM overexpression. suggesting a coop- 
erative effect between C-MYC and BMJ-J genes in the development of 
these tumors (3, 4). Recent shidies have indicated that the BMJ-J gene 
may also participate in cell cycle control and senescence through the 
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Case Selection. A series of 2.62 human tumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table J), matched normal tissues from 
all carcinomas, \\ samples of normal peripheral mononuclear cells, and 5 . 
reactive lymph nodes and tonsils, were selected based on the availability of 
fro2en samples- for molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNase treatment. 15 ftg were digested with EcoRl 
and Hind]]) restriction enzymes (Life Technologies, Inc., Gailhersburg, MD), 
for Southern blol analysis and hybridized with a 1.5-kb Pst\ fragment of the 
partial BM1-I cDNA (6). 

RNA Extraction and Real- rime Quantitative RT-PCR. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs, 27 MCLs, 8 FLvand 22 
LCLs) using guanidine/isothiocyanate extraction and cesium/chloride gradient 
cenrrifugation. One M g of total RNA was transcribed into cDNA using 
MMLV-reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of . the BMJ-J and the pJ6 
detection probes and primers were designed using the Primer Express program 
(Applied Biosystems, Foster City) as follows: BMJ-J sense, 5'-CTGGTTGC- 
CCATTGACAGC-3'; .BMJ-J antisense, 5'-CAOAAAATGAATGCGAO- 
CCA-3': pi 6 sense, 5 '-CA A CGCACCG A A TAGTTACGG-3'; pI6 antisense. 
5 / -AACTTCGTCCTCCAGAGTCGC-3 f . The probes BMJ-J, 5'-CAGCTC- 
GCTTCAAGATGGCCGC-3', and^;<5; 5' CGGAGGCCGATCCAGGTGG- 
GTA-3', were labeled with 6-ca(boKy-nuorescein as the reporter dye. The 
TaqMan-GAPDH Control Reagents (Applied Biosystems) were used lo am- 
plify and detect the GAPOH gene., as recommended by the manufacturer. The 
ouantilarivr. 3SSay amplified 1 M ml of c DNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosystems). All reactions were performed in an AB1 PRISM 7700 Sequence 
Detecior System (Applied Biosystems). GAPDI1, BMM, and pie™ 4 * ex- 
pression was related to a standard curv e derived from serial dilutions of Raji 
cONA. The KUs of BMM and p^™" 3 expression were defined as the 
mRNA levels of these genes normalized to the CADPI1 expression Jevel in 
each case. 

Protein Analysis. Whole-cell protein extracts were obtained from addi- 
tional fiozcn tissue available in 31 cases (7 CLLs. 12 MCLs, 8 FLs, and 4 
LCLs), loaded onto a 10% SDS r olyacryl;.tnide. gel. and elect/oMoiie-d to a 
nitroccllulo.se membrane (Amosham). Blocked membranes were incubated 
sequentially with the monoclonal :,ntibody DMI-F6 (12), aniimouse conju- 
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Tabic I Hematological malignancies and solid tumor samples analysed for BMJ- 1 
gene alterations 



Tissue samples No. of cases 



Hematological malignancies 

Hodgfein's disease 2 
B cell lymphoproliferative disorders 

B- Acute lymphoblastic leukemia M 

CLL 29 

Hairy cell leukemia 4 

FL J5 

MCL 36 

LCL 40 
T ceil rymphoproliferatrvc disorders 

T-Acule lymphoblastic leukemia 8 

targe granular cell leukemia 4 

Peripheral T-cell lymphoma g 
Myeloproliferative disorders 

Acute myeloid leukemia 7 

Chronic myeloid leukemia .5 
Solid tumors 

Colon carcinoma 26 

Breast carcinoma 29 

Laryngeal squamous cell carcinoma 34' 

Total ' 262 



gated to horseradish peroxidase (Amersbam), and detected by enhanced chemi- 
luminescencc (Amersham) according lo the manufacturer's recommendations. 
Statistical Analysis. Because of the non-normal distribution of the samples 
. and ibe small. size of some subsets of tumors, tbc statistical evaluation Was 
performed using nonparamelric tests (SPSS, version 9-0). Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
Ibe Kruskal-WaMis Test, with a P for significance set al 0.05. For differences 
between particular groups, the conservative Bonfenoni procedure was per- 
formed, and the P was set al 0.005. The remaining statistical analyses were 
carried out using the Mann- Whitney nonparamelric U test (significance, P 
<0.05). The comparison between BMI-1 and pie 1 ™-* quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

Results 

BMI-1 Gene Amplification. Tbc BMI-1 gene was examined by 
Southern Wot in a large series of human tumors and normal samples 
(Table 1). The cDNA probe used in the study detected three EcdKX 
fragments of 7.3, 3:8, and 2,6 kb and three HmdlTI fragments of 6.2, 
4, and 3.5 kb. BMI-1 gene amplification (3- lo 7- fold) was detected in 
4 of 36 (11%) MCLs (Fig. 1). The amplifjca lions were con finned with 
both restriction enzymes. The amplified MCLs were two Mastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with their respective matched non- 
neoplastic mucosa. No BMI-1 gene rearrangements were observed in 
any of the samples examined. 

BJVjM mRNA Expression. To determine the BMI-1 expression 
pattern in NHL we analyzed BMM mRNA levels by real- time quan- 
titative RT-PCR in 67 lymphomas (10 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BJVJJ-1 mRNA expression pattern was observed in Ihe different types 
of lymphomas (Fig. 2; Krusfcal- Wallis Test; P < 0.001). The BM1 
mRNA levels in CLLs (mean, 2.2 RU; SO, 1.3) and MCLs with no 
BMI-1 gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6.RU; 
SD, 0.4; Mann- Whitney nonparamelric V lest; P < 0.01). The 4 
MCLs with BMI-1 gene amplification showed significantly higher 
levels of expression than all other groups of tumors (mean, 5.1 RU; 
SD, 1 .6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of the gene also showed very high levels of BMJ- 1 mRNA 
expression ranging from 4 to 9.8 RU, similar to cases with gene 
amplification (Fig. Z-1). 

BMM Protein Expression. BMM protein expression was exam- 2 
ined by Western blot in 31 tumors (7 CLLs: 1 7 MCLs, including two I 
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cases with BMI-1 gene amplification and 4 cases with mRNA over- 
expression and no structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody 3gainsl BMM detected three closely migrating proteins of 
M t 45,000-48,000 (2). The two more slowly migrating bands prob- 
ably represent pbosphorylated isoforms of the protein (12). The two 
MCLs with gene amplification and three of four cases with mRNA 
overexpression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed . 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that BMM protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMM and pl6 ,NK4 * Gene Alterations. 
The INK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of ihe BMI-1 gene, 
suggesting thai this gene may contribute to. human neoplasias with 
wild type 1NK4/ARF (5). Most of the lyrnpboproliferalive disorders 
analyzed in the present study, including the four cases with BMI-1 
gene amplification, had been previously examined for p53 gene mu- 
tations and INK4a/ARF locus alterations, including gene deletions, 
mutations, hypermethyJation, and expression (13, 14). The four MCLs 
with BMI-1 gene amplification and mRNA overexpression and the. 
five tumors, with BMM mRNA overexpression with no structural 
alterations of the gene showed a wild- type configuration of the 
INK4o/ARF locus (13). However, one case with BMI- I gene ampli- 
fication and one case with mRNA overexpression with no alteration of 
the gene showed p53 gene mutations associated with allelic deletions. ■ 

To detennine the possible relationship between BMM and 
pJ6 IKK4a mRNA expression, p!6 JNK4a mRNA levels were evaluated 
by real-time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs, 
and 13 LCLs), including 6 cases with alterations in the lNK4a/ARF 
locus (2 MCLs and 1 LCL with />7<S )NK4a gene deletion, 2 LCLs with 
pi 6 promoter hypermethylalion. and 1 CLL with p7d ,NK4a gene 
mutation), and ihe 4 lymphomas with BMhl amplification. Negative 
or negligible levels of pJ^ 43 were observed in Ihe 6 rumors with 
INK4a/ARF locus alterations. These cases were not included in the - 
comparisons between BMM and pI6 INK4a "niRNA expression. The 
p , 6 )NK4, expjession j eveJs W£rc relatively similar in Ihe different 
types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach statistical significance (Fig. IB). No 
differences were observed in the pJ6 INK4a mRNA levels between 
tumors with BMI-1 gene amplification and overexpression and lym- 
phomas wilh germline configuration of ihe gene. 



MCX MCL* LCL FX 




Fig. I. Somber* bloi analy:.;* of BMI- J t -cnr. Font MCI J [MCL *) :>howed BMI- 1 gene 
■rtpliiic 3iion (3- to 7-fui(J) i-fmpaii'J wiih non-ncopbsiic tissues (;V>and oihcr N'Hjt.v "No 
nptiJicuiif>ni or cenr nanwijTmoib woe iicicrird in ibe remaining NHLs 3»tl carri- 
jmns iniludii) in the Hwi\. 
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Fig. 2. A, quantitative BMl-l roRNA transcript analysts (median and range) using 
real- lime RT-PCR in a scries of NHLs. MCLs with BMJ-J gene amplification {MCL*} 
revealed significantly higher ovciall BM1-J mRNA levels than all other types of NHLs. 
including MCLs with no srnx; rural alterations of the gene (7* < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs (P < 0.00)). Res oils are depicted 
as the ratio of absolute BMM:GADPH roRN'A ttanscripr numbers (KU). Bors r SD. B, 
quantitative pI6 ,NK4 * mRNA transcript analysis (median and range) using real-time 
RT-PCR in a series of NHLs, Expression levels were relative!)* similar in the different 
types of tumors. Results are depicted as the ratio of absolute p)6 ,NK ' l *:GADPH mRNA 
transcript numbers (RU). Bars, SD. 



Discussion 

In Ihe present study, we have examined a large series of human 
tumors for the presence of gene alterations and mRNA expression of 
the BMJ-I gene. Gene amplification was identified in four MCLs. 
These rumors showed significantly higher levels of roRNA and pro- 
tein expression compared with other lymphomas with BMI-I in germ- 
line configuration. BMI-J expression levels were also highly up- 
TCgulated in a subset of MCLs with no apparent sirucrural alterations 
of the gene. No alterations were delected in any of the different types 
of carcinomas included in the study. BMI-J is considered an oncogene 
belonging to the Poly comb, group family of genes. These proteins 
mainly act as transcriptional regulators, controlling specific target 
genes involved in development, cell differentiation, proliferation, nnd 
senescence. Different studies have shown the implication of BM1- 1 
over expression in the development of lymphomas in murine nnd 
feline animal models (3. 4). The findings of the present study indicate 
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for the first time that BMI-J gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type. 

The human BMI-J gene has been mapped to chromosome J Op 13. 
High-level DNA amplifications and gains in this region have been 
identified by comparative genomic hybridization in occasional solid 
tumors and NHLs (1G\ II). Different chromosomal translocations 
involving the 10pl3 region have also been identified in infant leuke- 
mias and T cell lymphoproliferative disorders (7, 8, 15). Most acute 
leukemias with this chromosomal alteration occur in children <J2 
months of age, whereas it seems to be extremely rare in adults. J Op 
translocations in T-cell lymphoproliferative disorders have been ob- 
served mainly in adult T cell leukemia/Iymphomas and occasional' 
cutaneous T cell lymphomas. In our study, we did not observe BMI-J 
. rearrangements or amplifications in any of the acute Jeukemias or T 
cell lymphomas. However, all of the acute Jeukemias in this study 
were diagnosed in patients over J 6 years, and no adult T cell leuke- • 
mia/)ymphomas or cutaneous lymphomas could be included in the 
series. Similarly, high-level DNA amplifications at the 10p)3 region 
have been detected in head and neck carcinomas and other solid 
tumors. Although we found no. evidence for BMI-J gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these rumors. Additional studies are required to elucidate the 
possible involvement of BM1-I in these particular groups of human 
neoplasms. - 

In human hematopoietic cells, BMI-I is preferentially expressed in 
primitive CD34-t- bone marrow cells, whereas it is negative or very 
low in more mature CD34- cells (16). In peripheral lymphocytes, and 
particularly in follicular B cells, BMI-J protein expression has been 
detected in resting cells of the mantle zone, whereas it is down- . 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BMI-J expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow' 
and antigen-specific response in peripheral lymphocytes. BJvll-1 ex- 
pression in human rumors has not been examined previously. In this 
study, we have demonstrated that BM1 : 1 mRNA and protein expres- 
sion show a distinct pattern in different types of lymphomas. Thus, 
BM1- 1 levels were low in LCLs and FLs and significantly higher in 
JVlCLs and CLLs. These findings suggest that BMM expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their norma) celt counterparts; because FLs and at least a subgroup 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are tumors mainly derived from . 
naive pregerminal center cells. However, the four MCLs with BMI-I 
gene amplification expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations 
of the gene showed high mRNA levels similar to those observed in 
mmors with BMI-J gene amplification, suggesting that other mecha- 
nisms may be involved in up- regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMI-I gene expression on skeleton development 
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and lymphomagenesis (|. r 3). These observations suggest that the high 
mRNA and protein levels detected in a subset of MCLs may play a 
role in the pathogenesis of these neoplasms. . : 

Recent studies have identified the JNK4/ARF locus as a down- 
stream target of the BMJ-1 transcriptional repressor activity, suggest- 
ing that BM1-) overexpression may contribute to human neoplasias 
that retain the wild-type JNK4a/ARF locus (5). Interestingly, in our 
study, BMJ-J amplification and overexpression appeared in tumors 
with no alterations in pJ6™ A * and pJ4 AKF genes. However, we could 
not detect differences in the expression levels of pie 1 *""* in rumors 
with arid without BMJ-J gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than JNK4a/ARFzre the main targets of 
BMI- 1 repressor activity in these tumors. Particularly, different genes 
of the HOX family are regulated by BMI- 1 and may also be involved 
in Jymphomagenesis (19, 20). 

In conclusion, the findings of this study indicate that BMJ-J gene 

. expression is differentially regulated in B cell lymphomas. Alterations 
of the gene seem to be an uncommon phenomenon in human nco- 

- plasms, but they may contribute to the pathogenesis in- a subset of 
MCLs. Although, BMJ-J gene alterations occurred in rumors with 
wild-type JNK4a/ARF locus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMJ-1 in human neoplasms 

require additional analysis. 
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Hislopatbology is insufficient to predict disease progression and clinical outcome in lung adeno- 
carcinoma. Here we show that gene-expression profiles based on microarray analysis can be 
used to predict patient survival in early-stage lung Ddenocarcinomas. Genes most related to sur- 
vival were identified with univariate Cox analysis. Using either two equivalent but independent 
training and testing sets, or 'leave-one-out' cross-validation analysis with all tumors, a risk index 
based on the top 50 genes identified low-risk and high-risk stage I lung adenocarcinomas which 
differed significantly with respect to survival. This risk index was then validated using an inde- 
pendent sample of lung adenocarcinomas that predicted high- and low-risk groups. This index 
included genes not previously associated with survival. The identification of a set of genes that 
predict: survival in early-stage lung adenocarcinoma allows delineation of a high-risk group that 
may benefit from adjuvant therapy. 



Lung cancer jemains the leading cause of cancer death in indus- 
trialized countries. Most patients with non-small cell lung can- 
cer (NSCLC) present with advanced disease, and despite recent 
advances in multi-modality therapy, the overall 10-year survival 
rate remains a dismal 8-70%V However, a significant minority of 
patients {-25-30%) with NSCLC have stage I disease and receive 
. surgical intervention alone. Although 35-50% of patients with 
stage I disease will lelapse ivithin 5 years 7 ' 4 , it is not currently 
possible to identify specific high-risk patients. 

Adenocarcinoma is currently the predominant histological 
subtype of NSCLC (rels 1,5,6). Although morphological assess- 
ment of lung carcinomas can roughly stratify patients, there is a 
need to identify patients al high risk lor recurrent oi metastatic 
disease. Preoperative variables that affect survival of patients 
with N5CLC have been identified 7 ' ,0 . Tumoi size, vascular inva- 
sion, poor differentiation, high tumor-prolifeiative index and 
several genetic. altera lions, including K-ras (refs. 11,3 2) and p53 
(rels. 10,13) mutations, have prognostic significance. Multiple 
independently assessed genes oi gene products have also been 
investigated io better predict patient prognosis in lung tan- 
cer" Technologies that simultaneously analyze the expression 
of thousands ol genes" i an be used to cor relate gene- express! on 
patteihs with numerous clinical paiameters— including patient 
outcome— to better predicl tumor behavior in individual pa- 
tients™. Analyses of lung cancers using array technologies havr 
identified subgroup* of lumors that differ according to lumen 
type and histological subclasses and, to a lesser extern, sujvivaJ 
among adenocarcinoma patients 71 -". Here w-e correlated gent- 
ex pi essi on prolilrs with clink a I outcome in a cohoit oi pa tie/) is 
with lung adenocarcinoma and identified specific genes \lu\ 



predict survival among patients with stage 1 disease. For further 
validation, we also show that the risk index predicted survival in 
an independent cohort of stage I lung adenocarcinomas. 

Hierarchical profile clustering yields three tumor subsets 
Using oligonucleotide arrays, we generated gene-expression pro- 
hies lor 86 primary lung adenocarcinomas, including 67 stage 1 
and 19 stage 1)1 tumors, as well as 10 non neoplastic lung sam- 
ples. Selected sample replicates showed high correlation among 
coeflicients and reliable reproducibility. We determined tran- 
script abundance using a custom algorithm and the data set was 
trimmed of genes expressed at extremely low levels, that is, 
genes were excluded if the measure of their 75th percentile value 
was less than 100. Although potentially resulting in the loss ol 
some information, trimming in this manner decreased the possi- 
bility that the clustering algorithm would be strongly influenced 
hy genes with little or no expression in these samples. 
Hieiairhif al clustering with the resulting 4,966 genes yielded 3 
clusters of tumors (Fig. ]). All 10 non-nebplaslic samples clus- 
tered rightly together within Cluster. 1 (data not shown). We ex- 
amined (he relationships between cluster and patient and tumoi 
characteristics (Fig. 1 and Supplementary Figure A online). There 
were associations between cluster and stage [P - 0.030) and be- 
tween cluster and differentiation [P = 0.01). Cluster J contained 
ihe greaiesi percentage (-12.8%) of well differentiated tumors, 
followed by Cluster 2 (27%) and Cluster 3 (4.7%). Cluster 3 con- 
tained the highesl percentage of both poorly dilleienitated 
■ -i/.tV.f.) and stage III turnots (42.8%), yet contained 3 |M.3%) 
moderately dilleientiated and } (5%) well ditleientiaied stage I 
turner. Notably, 1 1 stage J turnois were present in Cluster 3, sug- 
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gesting 3 common gene-expression profile for 
this subset of stage I and stage JII tumors. . 

For patients with stage ! and stage 1JI tumors, 
the average ages were 68.3 and 64.5 years and 
the percentage of smokers was 88.9% and 
89-5% r respectively. Marginally significant as- 
sociations between cluster and smoking his- 
tory were observed {P = 0.06). A significant 
relationship between hist opathoJogicaJ classifi- 
cation and cluster was only discernible for 
bronchioloarveolai adenocarcinomas {BAs), 
which were only present in Clusters 1 and 2 
(P= 0.0055). and comprised 357% and 32.3% 
of tumors for Clusters 1 and. 2, respectively. 

We examined the heterogeneity in gene-ex- 
pression profiles based on the trimmed data.set among normal 
lung samples and stage I and stage HI adenocarcinomas by calcu- 
lating correlation coefficients between all pairs of samples. In 
contrast to normal lung samples that displayed highly similar 
gene-expression profiles (median correlation, 0.9), both stage I 
and 11) lung tumors demonstrated much greater heterogeneity in 
their expression profiles with lower correlation coefficients (me- 
dian values, 0.82 and 0.79, respectively). 

Northern-blot and immunohist ©chemistry analyses 
Of the 4,966 genes examined, 967 differed significantly between 
stage I and 111 adenocarcinomas, a number in excess of that ex- 
pected by chance alone (248 at alpha level (a) = 0.05). Three 
genes were arbitrarily selected to verify the microauay expression 
data. The mRNA from 20 of the normal lung and tumor samples 
was examined by northern- blot hybridization with probes /or in- 
sulin-Iike growth factor-binding protein 3 {JGFBP3), cyst at in C 
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Fig. 1 Unsupervised classification analysis of lung adenocarcinomas. 3 dasses of tumors identi- 
fied by agglomerative hierarchical clustering ol gene-expression profiles using the 4,966 expressed 
genes. Patient and histopatho(ogita) information for each lung adenocarcinoma case by cluster 
designation and methods for K-ros 12/13uYcodon mutational status and nuclear p53 protein ac-. 
cumulation are provided (Supplementary Figure A online). TN classification denotes information 
regarding patient tumor si2e and nodal involvement. Associations between cluster membership 
and patient or hiMopathotogical variables are indicated at significance level (P< 0.05). 



arid lactate dehydrogenase A (LDH-A) (Fig. 2a). Two gene probes 
not represented on the micioarrays were used as controls, includ- 
ing histone H4, a potential index of overall cell proliferation, and 
28S ribosomal RNA, a control for sample loading and transfer. 
The relative amounts of 1GFBP3, cystatin C and.JLDH-yl mRNA 
strongly correlated with microarray-based measurements (Fig. 
2b). In both assays, 1GFBP3 and WH-A mRNA levels increased 
from stage I to stage III adenocarcinomas and were higher than 
those in normal lung. Cystatin C mRNA levels were moje variable 
but relatively greater in normal lung than tumors. These results 
suggest that the oligonucleotide microarrays provided reliable 
measures of gene expression. The tumors showed slightly greater 
histone H4 expression than the normal lung, likely reflecting in- 
creased proliferation ol tumor cells. 

Immunohistochemistry was performed for IGFBP3, cystatin C 
and H5P-70 to determine whether mRNA over expression was re- 
flected by an inciease ol their corresponding proteins in tumors. 
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fig. 2 Validation analyses of gene-e*pres- 
sion profiting, n. Norther n- blot analysis of 
selected candidate genes foi verification of 
data obtained from oligonucleotide arrays.- ticnhvtn 
The same sample RNA lor the 4 uninvotved 
lung, 8 stage I and 8 stage III tumors was 

used lot the northern-blot and otigonuc leotrde on ay analyses. 
b, Correlation analysis ol quantitative data obtained horn oligonucleotide 
arrays and northern blots measuied by integrated phosphor imager based 
signals loi the IGIBP3 and IDH- A genes, "the ratio ol IGfBPJ. cystatin C 
and I Oil- A mRNA to ?85 iRNA was determined, the relative values lor 
each gene from each sample are >hown. n, non- neoplastic normal lung; 
1, stage I tumors; 3, stage til tumors, t, IrnmimohiMDc hemit ;>) analysts o( 
lCIBP-3, HiP-70 and cyslalin C in lung and lung adrnoc arc inomaj. 
Cytoplasmic ICI BP- .3 immunoi eac irvity in a neoplastic f)land (lurnr.-r I 7'j) 
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with prominent apical staining (blue reactant staining, onow, upper led). 
Ditluse cytoplasmic HSP-70 immunoieac tivily (tumor l?7), yrt stromal el- 
ements show no reac hvity (upper right). Normal lung parenchyma (lower 
felt) shows cyloplasmic <ystatin C immunoreac tivrty in alveolar pneumo- 
tytes (arrow) and intra alveolar macrophages but tumor (190) shows dif- 
fuse cytoplasmic cystatin C immune-reactivity with piominenl apical 
Mzrinmq (tower tigM). Magnili( ation, >700 
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Jmmunoreactivity for both 1GFBP-3 and HSP-70 (Fig. 2c) was de- 
tected in the cytoplasm of the adenocarcinomas, with little de- 
tectable reactivity in the stromal oi inflammatory cells. Cystatin 
C was detected in alveolar pneumocytes and rntxa-alveorai 
macrophages in non neoplastic lung parenchyma and also con- 
sistently j/i the cytoplasm of neoplastic cells. 

Cene-e*pression profiles predict survival 

As expected, Kaplan-Meier survival curves (Fig. 3a) and log-rank 
tests indicated poorer survival among stage III compared with 
stage I adenocarcinomas (P = <O.000J). Two statistical ap- 
proaches were used to determine whether gene- expression pro- 
files could p.edict survival using the data set of 4,966 genes. In 
one approach, equal numbers of randomly assigned stage I ami 
Mage III tumors constituted naming (n = 43) and testing (n = 43) 
sets. In the training set, the top JO, 20, 50 or 75 genes were used 
lo r.ente risk indices that were evaluated for their association 
with survival using the 50th, 60th or 70th percentile cutoff 
points to categorize patients into high oi low g.oups. The results 
were similar across cutoll points but the 50- gene risk index had 
the best overall association wiih survival in the training set 



F»g. 3 Cene-expression profiles and patient survival, o. Relationship be- 
tween tumor stage and patient survival (stage 1 and stage 3 differ signifi- 
cantly, P< 0.0001). i>. Relationship between the survival. in the 43 test 
samples and their risk assignments based on the 50-gene risk index esti- 
mated in the 43 training samples. The high- and low- risk groups differ sig- 
nificantly </> = 0.024). c, Relationship between patient survival and the risk 
assignments in test samples (in ^conditional for tumor stage. The high- 
and low-risk stage I groups differ significantly (P ~ 0.028), whereas stage 111 
low- and high-risk groups did not (/>= 0.634). rf, Relationship between sur- 
vival in the test cases and their risk assignments based on the.86 leave-one- 
ouf cross-validation of the 50-gene risk index. The high- and low-risk 
groups differ significantly (P= 0.0006). r, Relationship between test case's 
risk assignment and survival (in d) conditional on tumor stage. The high- 
and low-risk stage I lung adenocarcinoma groups differ significantly from 
each other (P = 0.003), whereas low- and high-risk stage III tumors do not. 
/, Relationship between tumor class identified by hierarchical clustering and 
patient survival. Survival for patients in Cluster 3 differed relative to the tu- 
mors in Cluster 2 (P= 0.037) and approached significance for Cluster 1 and 
2 combined (P= 0.06). g, Analysis of the Michigan- based risk index using 
top cross-validated survival genes identify a low- and high-risk group in an 
independent cohort of B4 Massachusetts- based lung adenocarcinomas that 
are significantly different (P= 0.003). A, Among the 62 stage I lung adeno- 
carcinomas in the Massachusetts sample, the high- and low-risk groups dif- 
fered significantly (p = O.006). 



After conservatively choosing the 60th percentile cutoff point 
from the training set, we then applied this risk index and cutoff 
point to the testing set. The risk index of the top 50 genes cor- 
rectly identified low- and high-risk individuals within the inde- 
pendent testing set (P , 0.024) (Fig. 3b and Supplementary 
Methods online). Notably, 11 stage J tumors were included in 
the high-risk subgroup. When this risk assignment was then 
conditionally examined for stage progression (Fig. 3c), low- and 
high-risk groups among stage 1 tumors were found to differ (/> = 
0.028) in their survival. 

Identification of a robust set ol survival genes 
Although predictive of patient survival, a single training-testing 
set may not provide the most robust set of genes due to random 
sampling issues. Therefore, a leave-one-oul* cross-validation ap- 
proach Was used to identify genes associated with survival from 
all 86-tnmox samples. We first developed a 50-gene risk index in 
each training set, and then applied the risk index to the test case 
held out from the hill set of tumors and assigned the held out 
tumor to the high- or low-iisk groups (Fig. 3d). The high and 
low-risk subgroups determined in the test cases differed signifi- 
cantly in their overall survival (P = 0.0006). Among the larger 
group of stage I lung adenocarcinomas, the Jow-jisk (n = 46) and 
high risk [n = 2J) groups had markedly different survival (p - 
0.003) (Fig. 3e). Table ] lists selected examples of the cumulative 
top )00 genes derived from this cross-validation procedure 
(complete list in Supplementary Table A online). 

It was also noted that many of the stage f palienls in the high- 
nsk subgroup {Tig. 3,<) were present in Cluster 3 (Fig. 1). 
Kaplan-Meier analysis |Fig. 3/) dembnstrated a significantly 
worse survival </> = 0.037) lor patients in Cluster 3 relative to pa- 
tients in Cluster 2 and approaching significance for Cluster 1 
and 2 combined iP = 0.06). This further indicates the important 
relationship between gene .expression profiles and patient sur- 
viva), independent of disease stage. 

Consistent with previous analyses of lung adenocarcinomas^ 
40% o) stage I and 57. S% of stage 111 tumors had 72lh or *3lh 
rodon K-nis gene mutations. Those patients will, tumors con- 
taining K - /r.s mutations showed a trend of poorer survival, but 
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Table 7 Selected examples of the top 1 00 genes from cross-validation 


Cene name 


P 


% 


P 


% Change in 


Coefficient 


Unigene comment 




(normal versus . 


Change in tumor 


(stage 1 versus 


stage HI 


^ 






tumor t-test) 




stage 1)1 Mest) 




















Apopto sis- related ' 


CASP4 . 


0_56 


-6% 


0.02 


57% 


0.0022 


Caspase A, apoptosts- 














related cysteine protease 


P63 


9 73E-04 


37% 


0.03 


43% 


0.0010 


Transmembrane protein (63 IcDJ^ 














endoplasmic reticulum/ 














Colgi intermediate compartment 














Cell adhesion and structure 


KR77 


8.O2E-08 


126% 


o.n 


55% 


0.0003 


Keratin 7 


IAMBI 


0.14 


-20% . 


0.01 


60% 


0.0027 


Laminin, pi 














Cell cycle and growth regulators 


BMP2 


0.54 


-21% 


0.27 


47% 


0.0044 


Bone morphogenetic protein 2 


CDC6 


1.3U-05 


1070% 


0.05 


148% 


0.0124 


CDC6 (cell division cycle 6, 














Socchpromyces cerevhioe homolog) 


SIOOP 


2.10F.-08 


1572% 


0.19 


77% 


0.0001 


SI 00 c ale rum- binding pr ot ern P 


SERPINE1 


2.89E-03 


72% 


0.25 


30% 


0.0008 


Serine (or cysteine) proteinase inhibitor. 














clade E{nexin). 


5TX1A 


8.65E-08 


54% 


0.07 


26% 


0.0031 


Syntax in 1A (brain) 














Cell signaling 


ADM 


0.05 


39% 


0.04 


117% 


0.001 6 


adrenomedullin 


AKAP12 


8.53E-03 


-47% 


0.05 


214%. 


0.0010 


A kinase (PRKA) anchor protein (gravin) 1 2 


ARHE 


0.06 


-39% 


0.05 


87% 


0.0092 


ras homolog gene family, member E 


CRB7 


2.02E-03 


38% 


0.63 


15% 


0.0030 


Growth factor receptor-bound protein 7 


VECf 


6.50E-08 


. 174% 


0.02 


85% 


0.0013 


Vascular endothelial growth factor 


WNT10B 


0.05 


31%' 


0.48 


20% 


0.0022 


Wingless- type MMTV integration site family. 














member 1 0B 














Chaperones 


H5PA8 


0.36 


8% 


9.01 E-04 


51% 


0.0008 


Heat-shock 70 kD protein 8 












■ 


Receptors 


ERBB2 


0.04 


92% 


0.37 


1 20% 


0.001 3 


v-eib-b2 avian erythroblastic leukemia viral 














oncogene homotog 2 


FXYD3 


0.10 


111% 


0.31 


73% 


0.0046 


FXYD domain-containing ion transport 














regulator 3 


SLC20A1 


1.34E-03 


58% 


0.02 


66% 


0.0021 


Solute carrier family 20 (phosphate 














transporter), member 1 














En2ymes y cellular metabolism 


C5T B 


1 57E 04 


50% 


0 15 


34% 


0 0001 


C vnt-illn R fttotin R\ 
V.Y3lt/UU D Cl >» ' Dj 


CTSL 


0.48 


-10% 


0.03 


67% 


0.0007 


Cathepsin L 


CYP24 


3.16E-06 


N/A 


0.97 


2% 


0.0008 


Cytochrome P450, subfamily XXtV 














(vitamin D 24-bydroxylase) 


EU73 


1.07E-07 


114% 


0.97 


-1% 


0.0033 


Fucosyltransferase 3 (galactoside 3(4)-l- 














fucosyltransf erase, lewis blood gioup included) 


MLN64 : 


0.20 


32% 


0.42 


80% 


0.0007 


Steroidogenic acute legulalory piotein related 


PD17A 


0.1? 


33% 


0.01 


-35% 


-0.0187 


Phosphodiesterase 7A 


PLCl 


0.04 


-68% 


0.35 . 


-170% 


-0.0011 


Plasminogen- like 


5LC1A6 


0.07 


-32% 


0.12 


86% 


0.0069 


Solute carrier family 1 (high- affinity aspartate/ 














glutamate transporter), member 6 














Transcription and translation 


COPEB 


0.10 


-33% 


0.26 


25% 


0.0016 . 


Core promoter element binding protein 


CRK 


0.10 


32% 


0.03 


48% 


0.0098 


v-crk avian sarcoma virus O 10 oncogene 














■ homolog 


RE LA 


0.26 


■ -7% 


0.01 


70% 


0.0034 


v-rel avion retkuloendotheliosis viral 














oncogene homolog A 














Unknown lurwtion 


K1AA0005 


2.71 1-04 


40% 


0.02 


45% 


0.0010 


KIAA0005 gene product 


MCB1 


0.27 


125% 


0.33 


459% 


0.0018 


Mammagfobin 1 



BotuVd gene* v.rtr also iig»Hlri ant loi suivtva) in 43 tumor toini'nn ie-1 (F in. 3b). 

Table t 5t let led e>amples of the cumulative top 100 genes idrntilird using 
training-testing, < toss validation of all 86 lung tumor samples. The percent 
< hannp, as well a> ihe dlic-c tion, for the average values of the 1 0 norv neoplastic 
lung to all tumors, and lor the 67 stage I to the 1 9 stage 111 tumors are shown. A 
positive < opiritfcni $ value is indicative of a relationship ol gene e>pr ession to a 



poorer patient outcome. The genes ate listed in potential functional categories. 
Genes that were also present in the top 50 survival genes using the 43-tumor 
training sel (Fig. 3b) are indie nled in bold type. Complete listing of the gene 
probe sets and annotated gene and unigene identifiers can be found in the 
Supplementary Methods. 
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f ig. 4 Gene e>presvon patterns of lop survival gtnr\ Gene- e>piession patterns Ue- 
teimtncd using agglomeialive hieianhkat {lujleitny ol the 86 lung adenotan inomas 
against the 100 survival- 1 elated genes ("l^ble identified by ihe tiaining- testing, cro>s- 
validation analysis. Substantially elevated (fed) oi ik-t leased (g'ec,n) e>pjejiion ol tbe 
qerves is observed in individual tumors. Jome turners (blaiV snow and expanded area) 
ibow exiffiTH'l)' elevated opiesMun ol iprt ilit genes. i>. An outlier gene e>piess»on pal- 
tern {>5 times (he inleiquarlile range among samples) i.\ observed tor (he eibft? and 
ffeijIA genes (lop It-It and right, ie.\pe< lively). "I he SI 001' :md frk genes (bollorn lelt and 
right, »espe< tively) show a graded pattern ol e>pre.v.ion related to patient survival. O, 
alive; dead (also in ()• *. ^ ne dumber ol outliers prr person identified in the lop 100 
genes plotted by survival ( Ji%ti rl.mtiun. 
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this difference did not reach statistical significance among all 
patients (?= 0.25), between patients within tumor clusters (/> = 
0.41) or when analyzed separately among stage I {i>= 0.22) and 
stage III {P- 0.53) patients. Nuclear accumulation of p53 was de- 
tected in 17.9% stage 1 and in 22.2% stage 111 tumors. No signifi- 
cant relationship was observed for p53 staining and patient 
survival, cluster or tumor stage. 

Confirmation using an independent set of adenocarcinomas 
The robustness of our 50-gene risk index in predicting survival in 
lung adenocarcinomas was tested using oligonucleotide gene-ex- 
pression data Obtained from a completely independent 
(Massachusetts-based) sample of 84 lung adenocarcinomas (62 
stage I, 34 stage 11 and 8 stage HI; ref. 21, and dataseJ A at 
www.genome.wj.mit.edu/MPR/iung). To ensure equivalent 
power for testing and comparability of samples, the criteria for 
including tumors in the analysis were 40% or greater rumor celiu- 
larity, no mixed histology (that is, adenosquamous) and patient 
survival information. To obtain comparative gene- expression 
measures between the two data sets, gene sequences present on 
the U95A and HuGeneFL array were examined, and expression 
data for our top 50 cross-validation genes for all 84 Massachusetts 
samples were obtained and processed 24 (see. also Supplementary 
Methods online ). When we examined the risk assignment of 
these 84 samples, employing the identical cutolf point used for 
the 86 Michigan-based lung samples, we observed Jow- and high- 
risk groups (Fig. 3g; i>= 0.003). Notably, among the 62 stage 1 tu- 
mors, high- and low-risk groups were observed that differed 
significantly (P - 0.006) in their survival (Fig^ 3A). 

Survival genes had graded and outlier expression patterns 

A statistical and graphical analysis of the 100 survival-related 



genes (table 1) clustered against all 86 tumors revealed individ- 
ual rumors with substantially elevated expression in both a lim- 
ited and larger number of genes (Fig. 4a). Among these genes, we 
observed two distinct patterns of expression related to patient 
survival. One pattern, designated 'outlier', included genes show- 
ing substantially elevated expression (greater than five times the 
interquartile range among all samples), whereas the. othef pat- . 
tern, designated 'graded', was characterized by continuously dis- 
tributed expression with patient survival (Fig. 4b). The abB2 and 
A genes are examples of outlier expression patterns and 
S100P and ak genes of graded patterns. The number of outliers 
per person in the top 100 genes was identified and plotted ac- 
cording to survival times and events (Fig. 4c). Both stage I and . 
stage 111 lung adenocarcinomas .showed outlier gene patterns 
and 10 tumors contained 3 or more outlier genes. 

Because gene amplification may result in increased gene ex- . 
pression, the nine genes with outlier expression patterns (erbBZ, 
S1,C}A$, Wrtt 1, ReglA, AtLAPM, PACE, CYP24, KYNU) 

and one gene with a graded expression pattern (Mm 8) were ex- 
amined using quantitative genomic PCR to evaluate genomic 
copy number (Fig. 5a). Gene amplification of erbBZ (17ql2) was 
detected in tumor L94, which had the highest erbB2 mRNA ex- 
pression (Fig. 4a). Gene amplification was not detected for any 
of the other seven tested genes in tumor L94, as well as in other 
tumors. The two genes most frequently demonstrating the out- 
lier pattern in these lung adenocarcinomas were KYNU and 
CYP24, and were present in. 10 and 9 tumors, respectively. 
CYP24 has been described as a gene amplified and over expressed 
in breast cancer 14 , and these results indicate elevated expression 
in lung adenocarcinoma. 

To determine whether the graded or outlier gene-expression 
patterns also occur at the protein-expression level, 10 of the 100 
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I'tcj.S Cent? amplication and piotein expression 0 ( surv'rvat-related genes. 
a, Analysis ol potential gene amplification tor 9 genes showing outlier expres- 
sion patterns in the lung tumors <e/i>B2, 3/C/A6, Wnt t, MGB1 RcgtA 
AKAPU. PACt, CW74 and KYNU) and examined using quantitative genomic 
PCH. A gene showing graded expression pattern (KRTiS), and one gene 
(WCM) with a similar (hromojome location as PACt. were used as controls. 
Only erbB? and PcglA are shown. An esophageal adenocarcinoma with 
known high-level genomic amplification ol e/oB? was used as a positive con- 
liol and normal esophagus DMA was used as a negative control (Ctl). PC ft 
fragmrr.b sires wie 343 hp tor CAPOH, 166 bp lor erbR? and 1 ?6 bp lor 




fieqlA. DNA ij horn normal lung (N) and lumorfQ horn each patient (lor e* 
ample 137). fc, Immunohistoc hemic al analysis of survival related genes with 
lung adenocarcinoma mic roan ays using The tumors Irom Ihij study. I he 
transmembrane erbB? protein (top lelt) e>pression is substantially increased 
in tumor I 94 containing the amplified etbB2 gene (lig. 4a and 6). Impression 
ol VI CI (top right) and MOOP {bottom left) was located within the neoplas- 
tic cells and the pattern ot immijnoieactivity was c onsis lent with the graded 
e> pression pattern demonstrated by their mRNA profiles. I .y pression" ol ihr 
omogene Cfk (bottom right) was abundantly expressed in neophilM lumi 
tells. Magnification >400 (erbB?); >?00(VfXf, 51 OOP and rrfr). . 
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top survival genes (Table 1) for which specific antibodies were 
available were chosen for immunohistochemical analysis using 
lung-tumor arrays from this study (Fig. Sb). Expression of mem- 
.- brane erbB2 protein was substantially increased in the eri?B2-am- 
plified tumor L94 and very Jow levels of expression were present 
in other tumors; consistent with mRNA- expression measure- 
ments (Fig. 4a and b). CDC6 protein expression was also sub- 
stantially higher in tumor L94; consistent with mRNA levels 
(data not shown). Expression of vascular endothelial growth fac- 
tor (VEGF) and SI OOP (fig. Sb), as well as cytokeratin 1 8 (KRT1 8), 
cytokeratin 7 (KRT7) and fas-associated death domain (FADD) 
protein (data not shown), was located within the lung tumor 
cells and consistent with the graded expression pattern of the 
mRNA profiles. The oncogene: crk showed both graded mRNA as 
well as a graded protein-expression pattern with survival, and 
was abundantly expressed in the tumor cells (Fig. Sb). These re- 
sults indicate that many survival-associated genes are expressed 
at the protein level and demonstrate similar mRNA and protein- 
expression patterns. 

Discussion • 

We used several appioaches for the analysis of gene-expression 
data related to clinicopalhological variables and patient sur- 
vival. One approach, hierarchical clustering, was used to exam- 
ine similarities among lung adenocarcinomas in their patterns 
of gene expression. Previous studies of lung tumors"-" have also 
used this method to describe subclasses. of lung tumors. Here, 
we found three clusters that showed significant differences with 
respect to tumor stage and tumor differentiation. This suggests, 
as expected, that tumors with similar histological features of 
differentiation demonstrate similarities in gene expression. 
This feature also partly underlies the observed statistical associ- 
ation of tumor stage and cluster, as many of the higher-stage tu- 
mors, olten poorly differentiated and previously associated 
with a reduced survival*' 0 , were located in Cluster 3. Although 
this cluster contained the highest percentage of stage 11) tu- 
mors, it also contained a nearly equal mixture of stage 1 and 
stage 111 tumors and not all tumors were poorly differentiated. 
This indicates that a subset of stage I lung adenocarcinomas 
share gene- expression profiles with higher-stage tumors. 
Notably, 10 of the J 1 stage I tumors found in Cluster 3 were the 
high-risk stage I tumors identilied using the risk index in the 
'leave-one-out' cross-validation. 

In contrast to previous analyses oi lung adenocarcinomas 71 ^ 7 , 
we validated the expression data horn the arrays. The strong cor- 
relation ol northern -blot analysis and oligonucleotide-array data 
lor gene expression in the same samples (Fig. 2b) indicates that 
these studies provide robust gene- expression estimates. 
Immunohistochemislry using the same tumor samples in tissue 
arrays demonstrates piotejn expression within the lung tumor 
ceils. Together, these studies indicate that many of the genes 
identilied using gene-espression proliles are likely relevant to 
lung adenocarcinoma. For example, 1GFBP3 gene expression is 
increased in lung adenocarcinomas (Fig. 2c). IGIBP3 protein 
modulates the autocrine or paracrine ellects ol insulin-like 
growth (actors, elevated IGFBP3 expression is observed in colon 
cancer*, and increased serum IGFBP3 is associated wiih progres- 
sion in bieast cancer 7 . Hcai-shock protein 70 (MSP- 70) is in- 
creased in lung adenocarcinomas ol smokers'* and is associated 
with imi eased metastatic potential in breast cancer". Increased" 
serum lactate dehydrogenase is correlated wiih tumor stage and 
tumor burden-'", and cy.Matin C, a cysteine pjotease inhibitor ex- 
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pressed in human lung cancers 31 , is prognostic in some cancers* 2 . 
The decreased expression of this protease inhibitor may affect 
the invasive properties of the tumor cell. 

. The cross-validation analytical strategy we used is particularly 
informative for these types of gene- expression analyses for dis- 
ease outcome 33,34 , and identification of cross- validated genes with 
a larger tumor cohort may help refine this risk index for use in a 
clinical setting. The gene- expression data also provide opportune 
ties to observe overarching patterns that advance our under- 
standing of associations between genes and disease. For example, 
the top 100 survival genes include those involved in signaling, 
cell, cycle and growth, transcription, translation and metabolism. 
Expression of many of these genes is likely a function of increased 
proliferation and metabolism in the more aggressive tumors. 
Some genes, such as erbB2 and ReglA (Fig. 4o and b), were highly 
over expressed in a few patients having poor survival. In one 
tumor, the erbB2 gene was amplified (fig. So), demonstrating that 
genomic changes may underlie the over expression of a subset of 
these outlier genes. Immun ©histochemistry confirmed protein 
oversxpression in this patient's tumor (Fig. Sb). Notably, seven of 
the eight outlier genes were not amplified, indicating that other 
mechanisms underlie the increased mRNA expression of these 
survival-related genes. 

Most genes showed a graded relationship between expression 
and patient survival. Genes such as that encoding VEGF, known 
to be strongly associated with survival in lung cancer 3 *- 3 * were 
identified as related to patient survival in our study. VEGF 
demonstrated a graded expression pattern, as did the S100P and 
crk oncogene (Fig. Sb). S3 OOP is a calcium-regulated protein not 
previously reported in lung cancer. The ak gene, the cellular ho- 
molog of the v-crk oncogene, is a member of a family of adaptor 
proteins involved in signal transduction and interacts directly 
with c-jun N-terminal kinase 1 0NK1) 37 . Although ak has not 
been shown to have a role lung cancer, its role in the MAP-ki- 
nase pathway, which leads to activation ol matrix metallopro- 
leinase secretion and cell invasion 38 , indicates potential 
involvement in the the tumor cell invasion or metastasis of 
some lung adenocarcinomas. Among the many genes identilied 
in this study, like crk, that may be causally involved in lung can- 
cer progression (Table 1), some were related to survival in many 
patients, and others in only smaller subsets of patients. This re- 
sult is consistent with the complex molecular architecture of tu- 
mors in general, the heterogeneity of lung adenocarcinomas in 
particular and the multiple mechanisms underlying tumor-cell 
survival, invasion and metastasis" 

Our results demonstrate that a gene-expression risk profile- 
based on the genes most associated with patient survival— can 
distinguish stage I lung adenocarcinomas and differentiate prog- 
noses. The particular genes that deline the clusters, or are associ- 
ated with survival, likely reflect the characteristics ol the 
particular tumors included in the analysis. Current therapy lor 
patients with stage 1 disease usually consists ol surgical resection 
without adjuvant treatment 21 . Clearly, the identification ol a 
high-risk group among patients with stage I disease would lead 
to consideration of additional therapeutic intervention lor this 
group, possibly leading to improved smvival ol these patients. 

Methods 

PDlKenl population. Sequential paiienti seen at ihe Univfijity ol Mkhigan 
Ho>pit;.l between May 1994 and July 2000 tor stage I up jtnge Mt lung ade- 
notaidnoma weie evDluated.lor thij jludy. Conjeni wai ic<dv«>d and the 
projetl woj approved by the lot^l Inititulional Review Board. Primary tu- 
moi* and adjacent nonneoptajtie lung tiWe were obtained at the time of 
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surgety. Peripheral portiorw of resected long carcinomas were sectioned, 
evaluated by a study pathologist and compared with routine H&E sections 
of the same tumors, and utilized for mRNA isolation. Regions chosen for 
analysts contained a tumor. celhilarity greater than 70%, no mixed histol- 
ogy, potential metastatic origin, extensive lymphocytic inlBtration or fibro- 
sis. Tumors were histopalhologkally divided into. two categories based on 
their growth pattern: bronchiaf-derWed, if they exhibited invasive features 
with architectural destruction, and bronchiqloarveplar, if they exhibited 
preservation of the lung architecture. All stage I patients received only sur- 
gical resection with intra-lhoracic nodal sampling and no other treatments. 
Slagettl patients received surgical resection plus chemotherapy and radio- 
therapy. 

Cene- expression profiling and K-ros mutation anarysis: RNA isolation, 
cRNA synthesis and gene- expression profiling were performed as de- . 
scribed^. Details of gene annotation and K-ros mutation analysis are pro- 
vided in supplementary information. 

Northern-blot analysis. Total cellular RNA (10 ug) was separated in 1 .2% 
agarose-formaldehyde gels and vacuum- transferred to Cene Screen Plus 
(NEN Life Science Products, Boston, Massachusetts); Hybridization condi- 
tions and probe labeling were as described* 0 . Individual sequence-validated 
cDNA image clones for human /G/BP3 (clone 1407750), tDH-A (clone 
2420241), cystatin C (CTS3; clone 949938) were from Research Genetics 
(Huntsville, Alabama). The human hist one HA cDNA and the 28$ ribosomal 
F.NA 26-mer oligonucleotide probe were prepared and labeled as de- 
scribed**. 

Gene-amplification analysis. 11 genes were selected for the analysis of ge- 
■ nomic alterations. Primers were designed vsing PrimerSelect -4.05 Windows 
32 software (DNASTAR, Madison, Wisconsin), avoiding pseudogenes or po- 
tential homologous regions. Forward and reverse primers for the genes are 
provided (Supplementary Methods online). Quantitative genomic-PCR was 
then applied and analyzed as described 0 . 

Immunohistocbemical staining. The H&t-stained slides of all primary 
lung tumors were used to identify the most representative regions ol each 
tumor and a lissue microarray (TMA) block was constructed as described*'. 
Jmmunohislochemistry (IHC) was performed using both routine and sec- 
tions horn the TMA block as described 1 *. Detailed methods and the con- 
centrations used for all antibodies are piovided in the Supplementary 
Methods. 

Statistical methods, f-tests were used lo identity differences in mean gene- 
expression levels between comparison groups. Agglomerative hierarchical 
clustering*' was applied using the average linkage method to investigate 
whether there was evidence for natural groupings of tumor samples based, 
on correlations between gene- expression profiles. To investigate the ro- 
bustness of the clustering inference, gene- expression values weie per- 
turbed by adding random Gaussian error ol magnitude obtained Irom a 
duplicate sample to each' data point and then ret luster ed lo determine con- 
cordance in the tumor's class membership. Pearson, y 1 and fisher's exact 
tests were used to assess whether duster membership was associated with 
physical and genetic characteristics ol the tumois. 

To determine whether gene-expression profiles weie associated with 
variability in survival times, 2 separate but complementary approaches 
were used. In the (iist approach, the 86 tumors were randomly assigned lo 
equivalent tiaining and testing sets consisting ol equal numbers of stage 1 
and IM tumors in order to validate a novel risk- index luiu lion that ( aptuted 
the effect of many genes at once. In the second approac h, cross- validation" 
was used to more robustly identify the genes associated with survival. 
Briefly, a 'leave- one- out' cross-validation procedure in which 85 ol the 86 
tumors (the tiaining set) was used to identify genes that were univariDlely 
associated with survival. The risk index was defined as a linear combination 
ol the gene-e>pression values lor the top genes identified by univariate Cox 
proportional, hazard regression modeling**, weighted by I heir estimated re- 
gression coefficients. Kaplan- Meier survival plots and Ion- rank tests were 
then used to assess whethYi the risk- index assignment io high/tow cate- 
gories was validated in the test set. A more detailed description is provided 
(Supplementary Methods online). 



Note: Supplementary information is ovoiloble on the Nature Medicine website. 
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Rapid quantitation of proinflammatory and chemoattractant cytokine 
expression in small tissue samples and monocyte-derived dendritic cells: 
validation of a new real-time RT-PCR technology. 

Blaschke V, Reich K, Blaschke S. Zippnch S , Neumann C 

Department of Dermatology, von-Siebold-Str. 3, D-37075, Goettingen, Germany. 
vblasch@gwdg!de 

The analysis of cytokine profiles plays a central part in the characterization of disease- 
related inflammatory pathways and the identification of functional properties of immune 
cell subpopulations. Because tissue biopsy samples are too small to allow the detection of 
' cytokine protein, the detection of rnRNA by RT-PCR analysis is often used to investigate 
the cytokine milieu in inflammatory lesions. RT-PCR itself-is a qualitative method, 
indicating the presence or absence of specific transcripts. With the use of internal or 
external standards it may also serve as a quantitative method. The most widely accepted 
method is quantitative competitive RT-PCR, based on internal shortened standards. 
Recently, online real-time PCR has been introduced (LightCycler), which allows 
quantitation in less than 30 min. Here, we have tested its use for the analysis of cytokine . 
gene expression in different experimental in vitro and ex vivo settings. First, we 
compared quantitative competitive RT-PCR with real-time RT-PCR in the quantitation of 
transcription levels of the CD4(+) cell-specific chemoattractant Interleukin-16 during the 
maturation of monocyte-derived dendritic cells, and found a good correlation between 
both methods. Second, differences in the amounts of IL-I6 mRNA in synovial tissue 
from patients with rheumatoid arthritis and osteoarthritis as assessed by real-time RT- . 
PCR paralleled differences in the level of IL-16 protein in the synovial fluid. Finally, we 
employed real-time RT-PCR to study the cutaneous expression of several cytokines 
during experimental immunomodulatory therapy of psoriasis by Interleukin-10, and 
demonstrate that the technique is suitable for pharmacogenomic monitoring. In summary, 
real-time RT-PCR is a sensitive and rapid tool for quantifying mRNA expression even 
with small quantities of tissue. The results obtained do not differ from those generated hy 
quantitative competitive RT-PCR. 

Publication Types: 

• Evaluation Studies 



PMID: 1 1 121 549 [PubMed - indexed for MEDLINE] 



74: Apoptosis. 1997;2(6):5 18-28. Related Articles. Links 

^ Spring erLink 

ButyrateLaduced reversal of dexametkasoue resistance in autonomous rat 
Nb2 lymphoma cells. 

Buckley AR, IOu men acker JS Buckley DJ , Lcff MA , Magnuson NS . Rccd JC . 
Ivttyashita T , de Jong G . Gout PW . 

Department of Pharmacology and Toxicology, University of North Dakota School of 
Medicine and Health Sciences, Grand Forks 58202-9037, USA. 
abuckjey@mail.rned.undjiodak.edu 

The parental rat Nb2 lymphoma is a prolactin (PRL) -dependent T cell line. Exposure of a 
PRL-independent subline, Nb2-SFJCD 1 , to sodium butyrate (NaBT) causes transient 
reversal of their growth factor-independent proliferation in association with constitutive 
expression of protooncogenes pim-1 and c-myc. In the present study, we investigated' the 
effect of NaBT treatment on the sensitivity of Nb2-SFJCD 1 cells to dexamethasone 
(DEX)-induced apoptosis. Pretreatment with NaBT (2 mM, 72 h) partially reversed 
resistance to apoptosis in Nb2-SFJCD1 cells exposed to DEX (1 00 nM) for 12 h, 
assessed by flow, cytometric analyses of DNA fragmentation. However, tlie cytolytic 
effect b f DEX was abrogated by PRL in a time- and concentration-dependent manner. 
Evaluation of apoptosis-associated gene expression in NaBT-pre-treated cultures 
incubated with DEX or DEX+PRL indicated that the apoptosis resistance did not stem 
from altered bcl-2 or bax expression. However, there was a strong correlation between 
: the resistance to DEX-activated apoptosis and their enhanced expression of pim-1 mRNA 
and protein. The results show that it is possible to reverse DEX-induced apoptosis of Nb2 
pre-T cells and suggest the pirn- 1 gene product has an important role as a suppressor of 
this process, perhaps functioning as a mediator of PRL action. 
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Alterations in neuropeptide Y levels and Yl binding sites in the Flinders 
Sensitive Liue rats, a genetic animal model of depression. 

Caberlotto L, Jimenez P. Overstreet DEL Qurd YL . Mathe A A . Fuxe K . 

Department of Neuroscience, Karoiinska Institute, Stockholm, Sweden. 

Previously, we observed specific alterations of neuropeptide Y (NPY) and Yl receptor 
mRNA expression in discrete regions of the Flinders Sensitive Line rats (FSL), an animal 
model of depression. In order to clarify the correlation between mRNA expression and 
protein content, radioimmunoassay and receptor autoradiography were currently 
■performed. In the FSL rats, NPY-like immunoreactivity (NPY-LI) was decreased in the 
. hippocampa! CA region, while Yl binding sites were increased; NPY-LI was increased 
in the arcuate nucleus. Fluoxetine treatment elevated NPY-LI in the arcuate and anterior 
cingulate cortex and increased Yl binding sites in the medial amygdala and occipital 
cortex in both strains. No differences were found regarding the Y2 binding sites. The 
results demonstrate a good correlation between NPY peptide and mRNA expression, and 
sustain the possible involvement of NPY and Yl receptors in depression. 
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Neurokinin 1 receptor and relative abundance of the short and long 
isoforrns in the human brain. 

Cabcrlorio L , . Hurd VL T Murdock P , Wahlin JP t Mclotto S , Corsi Carlctti R . 

Department of Biology, Psychiatry CEDD, GlaxoSmitliKline Medicine Research Centre, 
Verona, ftaly. Laura.L.CaberIocto@gsk.com 

Substance P exerts its various biochemical effects mainly via interactions through 
neurokinin- 1 receptors (NK1). Recently, the NK1 receptor has attracted considerable 
interest for its possible role in a variety of psychiatric disorders including depression and 
anxiety. However, little is known regarding the anatomical distribution of NK1 in the 
human centra! nervous system (CNS). Riboprobe in situ hybridization, quantitative PCR 
and in vitro autoradiography were performed. Highest NK1 mRNA levels were localized 
in the locus coeruleus and ventral striatum, while moderate hybridization signals were 
observed in the cerebral cortex (most abundant in the visual cortex), hippocampus and 
different amygdaloid nuclei. Very low levels of the NIC1 mRNA were detected in the 
cerebellum and thalamus. In view of the existence of a long and short isoform of the NK1 
receptor, it was of interest to assess whether there was a differential distribution of the 
two splice variants in the human CNS and peripheral tissues. A quantitative TaqMan 
PCR analysis showed that the long NK1 isoform was the most prevalent throughout the 
human brain, while in peripheral tissues the truncated form was the most represented. 
3H-Substance P autoradiography revealed a good correlation between receptor binding 
sites and NK1 mRNA expression throughout the brain, with the highest levels of binding 
in the Locus coeruleus. These results provide the anatomical evidence that the NK1 
receptors have a strong association with neuronal systems relevant to mood regulation 
and stress in the human brain, but do not suggest a region-specific role of the two . 
isoforrns in the CNS. 
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Characterization of cyclin D2 expression in human endometrium. 

OkuD, Yoon S , Lee E , Hwang S . Song S , Kim J , Yoon BtC Lec JB 

. Department of Obstetrics and Gynecology, Samsung Medical Center, Sungkyurtkwan 
University School of Medicine, Seoul, South. Korea. dschoi@smc.samsung.co.lcr 

OBJECTIVE: This study. was undertaken to investigate cyclin D2 mRNA and protein 
expression in human endometrium during the menstrual cycle. METHODS: Endometrial 
samples were obtained from 15 premenopausal nonpregnant women who had 
hysterectomies for benign gynecologic reasons. They were divided into the following five 
groups according to histologic dating; early proliferative (n = 3), mid to late proliferative 
(n = 3), early secretory (n = 3), mid secretory (n - 3), and late secretory (n = 3). Cyclin 
D2 mRNA and protein expression were analyzed using reverse transcriptase-polymerase 
chain reaction, Western blotting, and urimunohistochemistry. RESULTS: Cyclin D2 
mRNA and protein were expressed in human endometrial tissue throughout the menstrual 
cycle. Cyclin D2 mRNA and protein expression of proliferative phase endometrium were 
significantly higher than those of secretory phase endometrium (P <.05). The staining 
intensity of cyclin D2 in proliferative phase endometrium was higher than that in 
secretory phase (P <05). Cyclin D2 mRNA level showed good correlation with cyclin 
D2 protein level (R = 0.579, P <03), and cyclin D2 protein also showed good correlation 
with immunohistochemical staining intensity (R = 0.562, P < 03). CONCLUSION: 
Cyclin D2 was expressed in human endometrium tliroughout the menstrual cycle. Cyclin 
D2 mRNA and protein were expressed at high levels in proliferative phase endometrium, 
especially in the early proliferative phase, and then decreased in the secretory phase. 
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Human chorionic gonadotrophs beta expression in malignant Barrett's 
oesophagus. 

Couvclard A, Paraf F , Vidaud O . Dubois S , Vidaud ML Flejou JF , Degott C . 

Service d' Anatomic Pathologique, Hopital Beaujon, 921 18 Clichy cedex, France. 
anne.couvelard@bjn.ap-hop-paris.fr 

BACKGROUND: Human chorionic gonadotrophin beta (hCGbeta) is expressed in 
several non-trophoblastic tumours, and this is usually associated with aggressive 
behaviour. Little is known about hCGbeta expression in Barrett's adenocarcinoma. 
MATERIALS AND METHODS: We determined the hCGbeta profile in a large series of 
surgically resected Barrett's adenocarcinoma (a) at mRNA level using real-time 
quantitative reverse-transcription polymerase chain reaction analysis and (b) at protein 
Jevel using immunohistochemistry with a polyclonal antibody and with a monoclonal . 
antibody specific for free hCGbeta. We then sought links between the hCGbeta protein 
expression pattern and clinical and pathological parameters, including patient outcome as 
well as vascular endothelial growth factor (VEGF) expression. RESULTS: hCGbeta 
protein expression was observed in 43 of 76 (57%) Barrett's adenocarcinomas. We 
showed a strong correlation between hCGbeta protein abundance and CGB mRNA level. 
We observed a statistical link between hCGbeta protein expression and infiltrative 
tumour type ( P-0.023), perineural neoplastic invasion ( P-0.007) and VEGF protein 
expression ( P=0.0I6). hCGbeta expression tended to be associated with a poor outcome 
(16% versus 36% survival 8 years after resection). CONCLUSION: Expression of 
hCGbeta correlates with specific infiltrative characteristics and is associated with higher 
VEGF expression. Both molecules may play a co-ordinated role in the development of 
Barrett's adenocarcinomas. 
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Downregulation of ENsiC activity and expression by TNF-alpha iu alveolar 
epithelial cells. 

Dagcnais A , Frechette R, Yamagata Y . Yamagata T . Carmel JF . Clermont ME 
Brochiero E , Masse C. Berthiaumc Y. 

Cenlre de recherche, CHUM-Hotel-Dieu, 3850 St-Urbain, Montreal, Quebec, Canada 
H2W 1T7. andre.dagenais.chum@ssss.gouv.qc.ca 

Sodium absorption by an amiloride-sensitive channel is the main driving force of lung 
liquid clearance at birth and lung edema clearance in adulthood. In this study we tested 
whether tumor necrosis factor-alpha (TNF-alpha), a proinflammatory cytokine involved 
in several lung pathologies, could modulate sodium absorption in cultured alveolar 
epithelial cells. We found that TNF-alpha decreased the expression of the alpha-, beta-, 
and gamma-subunits of epithelial sodium channel (ENaC) mRNA to 36, 43, and'l6% of 
the controls after 24-h treatment and reduced to 50% the amount of alpha-ENaC protein 
in these cells. There was no impact, however, on alpha(l) and beta(l) Na(+)-K(+)- 
ATPase mRNA expression. Amiloride-sensitive current and ouabain-sensitive Rb(+) 
uptake were reduced, respectively, to 28 and 39% of the controls. A strong correlation 
was found at different TNF-alpha concentrations between the decrease of amiloride- 
sensitive current and alpha-ENaC mRNA expression. All these data show that TNF- 
alpha, a proinflammatory cytokine present during lung infection, has a profound 
influence on the capacity of alveolar epithelial cells to transport sodium. 
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Involvement of the CCNDl gene iu hairy cell leukemia. 

dc Boer CJ , Kluui-Nelemans JC . Dreef E , Kester MG , Kluin PM, Schuuring E , van 
Krieken JIL . 

Department of Pathology, University of Leiden, The Netherlands. 

BACKGROUND: Previous results suggested increased rnRNA expression of CCNDl in 
hairy cell leukemia (HCL). The CCND 1 gene is involved in the t( 11 ; I4)(q 1 3 ;q32) 
chromosomal rearrangement, a characteristic abnormality in mantle cell lymphoma 
(IvfCL). We and others reported that, in contrast to other B-cell lymphomas, almost all 
MCL have over-expression of the CCND 1 gene with a good correlation between RN A 
and protein analysis. Recent studies showed that overexpression of the cyclin Dl protein 
can be easily detected by immunohistochernistry (IHC) on formalin-fixed, paraffin 
embedded tissues. PATIENTS AND METHODS: To investigate whether the CCNDl 
gene is involved in HCL, we performed IHC on a series of 22 cases using formalin-fixed 
paraffin embedded splenectomy specimens. For IHC the sections were boiled in citrate 
buffer. The presence of rearrangements within the BCL-l locus and the CCNDl gene 
was analyzed in 13 of 22 cases by Southern blot analysis using all available break-point 
probes. Expression of CCNDl was analyzed at the mRNA level (Northern blot) and 
protein level (IHC). RESULTS: Overexpression of the cyclin Dl protein using IHC was 
observed in all cases, with strong expression in 5. cases. Pre-existing B- and T-cell areas 
of the spleen did not express significant levels of the cyclin Dl protein. Seven of 9 cases 
analyzed by both IHC and Northern blotting showed overexpression of the CCNDl gene 
with both methods. No genomic abnormalities were observed in any of die 13 cases 
studied by Soutiiern blot analysis. Additionally, no I lql3 abnormalities were detected by 
banding analysis of 19 of 22 cases. CONCLUSIONS: The elevated levels of CCNDl 
mRNA and protein in conjunction with the absence of overt rearrangements within the 
BCL-i locus distinguish HCL from MCL and other B-cell malignancies. This suggests 
that activation of the CCNDl gene in HCL is due to mechanisms other than chromosomal 
rearrangement. 
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luhil>iu aud activin production and subunit expression in human placental 
cells cultured in vitro. 

Debicvc F ; Pampfcr S . Thomas EC 

Department of Obstetrics and Gynecological Endocrinology, Universite Catholiquede 
Louvain, 1200 Brussels, Belgium. 

Inhibins and activins are dimeric proteins, with each subunit being one of three related 
protein subunits (alpha, betaA or betaB); The mRNA levels of these subunits were 
studied quantitatively during in-vitro differentiation of human cytotrophoblast cells into 
syncytium, using Northern blot analysis and semi-quantitative reverse transcription- 
polymerase chain reaction (RT-PCR) analysis. The corresponding protein concentrations 
were determined by specific enzyme-linked immunosorbent assays for inhibin A, B, pro 
alphaC and activin A in cellular protein extracts and culture medium (n = 5). 
Immunofluorescence studies showed syncytium formation after 48 h. The alpha subunit 
was present before plating and increased at 48 h (PO.001) while the betaA subunit was 
weak before plating and increased at 24 h. The betaB subunit was not detected. With 
respect to corresponding protein synthesis, inhibin A. (alpha + betaA) had risen after 48 h 
in cellular protein extract and after 72 h in culture medium, while activin A (betaA + 
betaB) was detected after 24 h, with no significant variations in culture medium. There 
was a good .correlation between inhibin. A and alpha subunit expression (r = 0.736, 
P<0.00 1 ), as well as between activin A and betaA subunit expression (r = 0.755, 
P<0.001). This study showed that mRNA expression parallels protein synthesis of inhibin 
and activin in trophoblast cells; Inhibin A synthesis appears to be dependent on alpha 
subunitmRNA expression, rather than on the betaA subunit which controls activin A 
synthesis. This study has also shown that isolated cytotrophoblast cells do not produce 
dimeric inhibin. However, during the transformation of cytotrophoblast cells into 
syncytium, betaA subunit mRNA expression may be an indicator o f cell aggregation, 
while alpha subunit mRNA expression may be an indicator of cell fusion. 
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Expression of membrane-type matrix metalloproteiriases 4, 5, and 6 in 
mouse corneas infected with P. aeruginosa. 

Pong Z , Katar M AfousTS, Berk RS . 

Department of Immunology and Microbiology, Wayne State University School of . 
Medicine, 540 E. Canfieid, Detroit, MI 48201, USA. 

PURPOSE: To investigate the expression and regulation of membrane-type matrix 
metalloproteinases (MT-MMPs) 4, 5, and 6 in the mouse corneas infected with 
Pseudomonas aeruginosa. METHODS: C57BL/6J mice were intracorneally infected with 
P. aeruginosa. The expression of MT4-, MT5-, and MT6-MMP was detected at both the 
mRNA and protein levels by RT-PCR and immunoblot analysis. Immunohistochemical 
staining was performed tojocalize the expression of MT4- and MT5-MMP in the mouse 
corneas, RESULT&. Expressipn.of MT4- and MT5-MMP was detected in the normal 
(uninfected) cornea by RT-PCR and immunoblot analysis. When infected with P. 
aeruginosa, the corneas snowed significant induction of each MT-MMP. Localization of 
MT4- arid MT5-MMP revealed that the expression of MT5-MMP was restricted to the 
epithelial tissue in the normal cornea, whereas the induced expression of MT4- and MT5- 
MMP was predominantly in the substantia propria, which contained most of the 
infiltrating cells. MT6-MMP expression was not detected in the uninfected cornea but 
was upregulated in the~infected corneas. CONCLUSIONS: Expression of MT4-, MT5-, 
and MT6-MMP was induced in corneas infected with P. aeruginosa. 
Immuiiohistochemistry showed predominant immunoreactivity of MT4- and MT5-MMP 
in the substantia propria. Previous histologic studies have revealed different patterns of 
inflammatory cell infiltration with an increased number of polymorphonuclear 
neutrophils (PMNs) during the early stage of inflammation and increased macrophages 
during the late stage. These results indicate a good correlation between the 
overexpression of the MT-MMPs in the infected corneas and the inflammatory response- 
that is, leukocyte infiltration-indicating that inflammatory cells such as macrophages and 
PMNs may play a role in the upregulation of MT-MMPs during corneal infection, which 
in turn can cause the destruction of corneal tissue. 
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Assessment of proliferative activity in colorectal car cinomas by 
quantitative reverse transcriptase-polymerase chain reaction (RT-PCR), 

Duchrow M Hasemcycr S . Broil R , Bruch VVtndhovcl U . 

Surgical Research Laboratory, Surgical Clinic, Medical University of Lubeck, 
Ratzeburger Alice 160, D-23538 Lubeck, Germany. 

The monoclonal antibody Ki-67 and the isospecific monoclonal antibody MIB-1 are 
routinely used in oncology to assess the proliferation index of tumor cells. A more 
objective and sensitive method is the determination of the of Ki-67 protein-specific 
mRNA by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). In 25 
resected colorectal adenocarcinomas of different stages and grades we determined 
between 0.2 and 4.4 amol (I0(-I8) moi) Ki-67 protein-specific rnRNA per microgram 
total RNA (median = 0.88 amol). The corresponding Ki-67 indices (expressing the 
percentage of Ki-67/MIB-I positive tumor cells) ranged from 41 to 81% (median = 61%). 
We found a good correlation between Ki-67 index and mRNA expression (r = 0.75), a 
significant correlation between both data and tumor stage (primary rumor, regional nodes, 
metastasis fpTNM] staging classification) (p < 0.001), but not between both data and 
tumor grade. Both Ki-67 indices (p = 0.05) and mRNA levels (p - 0.014) correlated 
significantly to the patients* survival. These results demonstrate that the Ki-,67 protein- 
specific quantitative RT-PCR is a useful method for the characterization of tumor cell 
proliferation. 
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Molecular characterisation of carbohydrate digestion and absorption in 
equiue small intestine. 

E>Y cr J, Fcrnandcz-Castano Mercdiz Salmon KS, Proudman CJ , Edwards GB , 
Slurazi-Bccchcy SP . 

Department of Veterinary Preclinical Sciences, University of Liverpool, UK.. 

Dietary carbohydrates, when digested and absorbed in the small intestine of the horse, 
provide a substantial fraction of metabolisable energy. However, if levels in diets exceed 
the capacity of the equine small intestine to digest and absorb them, they reach the 
hindgut, cause alterations in microbial populations and the metabolite products and 
predispose the horse to gastrointestinal diseases.. We set out to determine, at the 
molecular level, the mechanisms, properties and the site of expression of carbohydrate 
digestive and absorptive functions of the equine small intestinal brush-border membrane. 
We have demonstrated that the disaccharidases sucrase, lactase and maltase are expressed 
diversely along the length of the intestine and D-glucose is transported across the equine 
intestinal brush-border membrane by a high affinity, low capacity, Na-H/glucose 
cotransporter type I isoform (SGLT1). The highest rate of transport is in duodenum > 
jejunum > ileum. We have cloned and sequenced the cDNA encoding equine SGLT1 and 
alignment with SGLT1 of other species indicates 85-89% homology at the nucleotide and 
84-87% identity at the amino acid levels. We have shown that there is a good correlation 
between levels of functional SGLT1 protein and SGLT1 mRNA abundance along the 
length of the small intestine. This indicates that the major site of glucose absorption in 
horses maintained on conventional grass-based diets is in the proximal intestine, and the 
expression of equine intestinal SGLT1 along the proximal to distal axis of die intestine is 
regulated at the level of mRNA abundance. The data presented in this paper are the first 
to provide information on the capacity of the equine intestine to "digest and absorb soluble 
carbohydrates and has implications for a better feed management, pharmaceutical 
intervention and for dietary supplementation in horses following intestinal resection. 
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Suppressors of cytokine signaling proteins are differentially expressed in 
Till and Th2 cells: implications for Th cell lineage commitment and 
maintenance. 

Egwuagu CE, YuCR, Zhang M/ Mahdi RM , Kim_gJ. CcryJ 

Laboratory of Immunology, National Eye Institute, National Institutes of Health 
Bethesda, MD 20892, USA. emeka@helix.nih.gov . ■ ' 

Positive regulatory factors induced by tL-12/STAT4 and IL-4/STAT6 signaling during T 
cell development contribute to polarized patterns of cytokine expression manifested by 
differentiated Th cells. These two critical and antagonistic signaling pathways are under 
negative feedback regulation by a multimember family of intracellular proteins called, 
suppressor of cytokine signaling (SOCS). However, it is not known whether these 
negative regulatory factors also modulate Thl/Th2 lineage commitment and maintenance. 
We show here that CD4{» naive T cells constitutively express low levels of SQGSl, 
SOCS2, and SOCS3 mRNAs. These mRNAs and their proteins increase significantly in 
nonpolarized Th cells after activation by TCR signaling.. We further show that 
differentiation into Thl or Th2 phenotype is accompanied by preferential expression of 
distinct SOCS mRNA transcripts and proteins. SOCS1 expression is 5-fold higher in Thl 
than in Th2 cells, whereas Th2 cells contain 23-fold higher levels of SOCS3. We also 
demonstrate that IL-12-induced STAT4 activation is inhibited in Th2 cells that express 
high levels of SOCS3 whereas IL-4/STAT6 signaling is constitutively activated in Th2 
cells, but not Till cells, with high SOCS1 expression. These results suggest that mutually 
exclusive use of STAT4 and STAT6 signaling pathways by differentiated Th cells may 
derive in part, from SOCS3- or SOCS I -mediated repression of IL-12/STAT4- or IL- 
4/STAT6 signaling in Th2 and Th I cells, respectively. Given the strong correlation 
between distinct patterns of SOCS expression and differentiation into the Thl orTh2 
phenotype, SOCS 1 and SOCS3 proteins are therefore Th lineage markers that can serve 
as therapeutic targets for immune modulation therapy! 
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Intravitreal invading cells contribute to vitreal cytokine milieu in 
proliferative vitreoretinopa thy. 

El-Ghrablv IA , Dua HS Orr GM. Fischer D , Tighe PJ. 

Larry A Donoso Laboratory for Eye Research, Department of Ophthalmology, University 
of Nottingham, UK. 

AIM: To examine the contribution of infiltrating cells in the local production of cytokines 
within the vitreous of patients with proliferative vitreoretinopathy (PVR). METHODS: 
The presence of mRNA coding for IL-6, IL-8, IL-lbeta, IL-1 alpha, TNFalpha, 
IFNgamma, IL-12, and HPRT was investigated in 25 vitreous samples from patients with 
PVR, 1 1 vitreous samples from patients with retinal detachment (RD) not complicated by 
PVR, and 1 0 vitreous samples from patients with macular hole (MH). A quantitative 
reverse transcriptase polymerase chain reaction (RT-PCR) using an internal competitor 
was used to investigate these sajnples. From these samples, 15 PVR, &RD, and 8 MH 
were analysed for the' protein levels of the same cytokines using eniyme linked 
immunosorbent assay (ELISA); Spearman correlation was used to test any association 
between mRNA and cytokine protein levelspas an indicator of the contribution these cells 
make to the intravitreal. cytokine milieu. RESULTS: A strong correlation was found 
between mRNA and their respective cytokine levels ;(protein products) for IL-6, IL-8, IL- 
lbeta, IL-1 alpha, TNFalpha, IFNgamma (Spearman r = 0:8.3, 0.73, 0.67, 0.91, 0.73, and 
0.73 respectively), but not for IL-12. The median levels of IL-6, 11^8, IL-lbeta, and 
IFNgamma mRNA and their respective cytokines were significantly higher (p <0.05) in 
patients with PVR than in those with macular hole. There was no statistically significant 
difference in the median levels of IL-1 alpha mRNA between PVR and MH but the 
cytokine IL-1 alpha was detected at a significantly higher level in PVR compared with 
MH patients. Between PVR and RD patients, there was no statistically significant 
difference in mRNA levels for all the investigated cytokines (p >0.05) except for IL-6 
where there was a statistical significance (p= 0.038). In contrast, the median levels of IL- 
6, IL-8, and IL-lbeta cytokines were significantly higher (p <0.05) in patients with PVR 
than in those with RD, whereas for IL- 1 alpha and IFNgamma no significant statistical 
difference was detected between PVR and RD patients (p >0.05)_ When results of RD 
and MH patients were compared, a statistical difference was only detected in mRNA 
levels of INFgamma (p = 0.008). However, no difference was detected for INFgamma 
(protein product) or for any of the other cytokines between RD and MH patients. 
CONCLUSION: Levels of both protein and mRNA encoding IL-6, IL-8, IL-lbeta, and 
IFNgamma is significantly increased in vitreous samples from patients with PVR. The 
strong correlation between ELI S A detectable cytokines (protein products) and their 
respective mRNA levels suggest that intravitreal, invasive cells are the major source of 
these cytokines, with the exception oflL-1 2. Cells invading the vitreous do not appear to 
locally produce IL-12 mRNA. This would appear to implicate cells peripheral to the 



vitreal mass as the major source of this cytokine. 
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Modulation of the glutamatergic receptors (AMP A and NMD A) and of glutamate 
vesicular transporter 2 in the rat facial nucleus after axotomy. 

ELeore_L, Vassias I , Vidal PP T de Wacle C . '.'.•> 

LNRS (CNRS-Paris V), ESA 7060, Centre Universitaire des Saints-Peres, 45 rue des Saints- 
Peres, 75270 Paris Cedex 06, France. 

Facial nerve axotomy is a good model for studying neuronal plasticity and regeneration in the 
peripheral nervous system. We investigated in the rat the effect of axotomy on the different 
subumts of excitatory glutamatergic AMPA (GLuRl-4), NMDA (NR1, NR2A-D) receptors post- 
synaptic density 95, vesicular glutamate transporter 2, beta catenin and cadherin mRNA levels 
and/or protein production were analyzed 1, 3, 8, 30 and 60 days after facial nerve axotomy by in 
situ hybridization and irnmunohistofluorescence. mRNAs coding for the GLuR2-4 NR1 NR2A 
B, D subunits of glutamatergic receptors and for post-synaptic density 95, were less abundant after 
axotomy. The decrease began as early as 1 or 3 days after axotomy; the mRNAs levels were 
lowest 8 days post-lesion, and returned to normal or near normal 60 days after the lesion The 
NR2C subunit mRNAs were not detected in either lesioned or intact facial nuclei. 
Immunohistochemistry using specific antibodies against GLuR2-3 subunits and against NR1 
confirmed this down-regulation. There was also a large decrease in vesicular glutamate transporter 
2 immunostaimng in the ; axotornized facial nuclei at early stages following facial nerve section In 
contrast, no decrease of NR2 A subunit and of post-synaptic density 95 could be detected at any 
time following the lesion, beta Catenin and cadherin immunoreactivity pattern changed around the 
cell body of facial motoneuron by day 3 after.axotomy, and then, tends to recover at day post- 
lesion 60 days. Therefore, our results suggest a high correlation between restoration of 
nerve/muscle synaptic contact, synaptic structure and function in facia! nuclei. To investigate the 
mechanisms involved in the change of expression of these proteins following axotomy the facial 
nerve was perfused with tetrodotoxin for 8 days. The blockade of action potential significantly 
decreased GLuR2-3, NR1 and NR2A mRNAs in the ipsilateral facial nuclei. Thus axotomy- 
mduced changes in mRNA abundance seemed to depend partly on disruption of activity; 
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Altered levels of scavenging enzymes in embryos subjected .to a diabetic 
environment. 

Department of Medical Cell Biology, University of Uppsala, Sweden. 

Maternal diabetes during pregnancy is associated with an increased rate of congenital 
emfr; m th t° ffs ^ ™ e exact -'-lar etiology of the distil 
embryogenes.s ,s unknown, but an involvement of radical oxygen speciesinthe 
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expressed on gestational day 1 1 and both CuZnSOD, MnSOD and catalase were 
expressed on day 12 with increased concentrations of MnSOD transcripts when 
challenged by a d.abet.c milieu. There was a good correlation between inRNA protein 
and act.v, ty levels, suggesting tl ia t the regulation of these enzymes occurs primary T'the 
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Iuducrton of the estrogen receptor by growth hormone and glucocorticoid 
substitution in primary cultures of rat hepatocytes. 

Frevschuss B , Stavrcus-Evcrs A , Sahlru L , Eriksson H . 

Department of Reproductive Endocrinology, Karoiinska Hospital, Stockholm, Sweden. 

Hepatic estrogen receptors (ER) mediate estrogenic effects on mammalian liver 
metabolism and are thereby involved in the regulation of important 
physiological/pathological processes, such as coagulation, atherosclerosis, and 
hypertension. The regulation of the formation of the ER in primary cultures of rat 
hepatocytes was studied by assaying ER and ER mRNA under different endocrine 
conditions. The ER concentration was measured using two different methods, a ligand- 
binding technique and an ER enzyme immunoassay. The results obtained by the two 
methods showed good correlation, and linear regression analysis gave a correlation 
coefficient of 0.95. ER concentrations fell to low steady state levels within 16 h after 
establishing the cell culture and remained low in the absence of hormonal substitution. 
Upon medium supplementation with pituitary GH and the glucocorticoid dexamethasone 
(DEX) in combination, the ER concentration increased 6-fold from 4.2 +/- 1 .0 to 25.8 +/-■ 
7.0 fmol/mgcytosolic protein. ER mRNA was measured by solution hybridization. 
Substitution with GH and DEX in combination increased ER mRNA to 210 +/- 14% of 
control levels. No effect on ER mRNA stability was seen after hormone treatment. It is 
concluded that the regulatory effects of GH and DEX on the hepatic ER in this in vitro 
system are very similar to the effects of these hormones under in vivo conditions. The 
inducible expression of the ERhas never before, to our knowledge, been demonstrated in 
any mammalian liver cell culture system. 
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Cyclin Dl-negative mantle cejl lymphoma: a clinicopathologic study based on 
gene expression profiling. 

A £esJ<S , SieMrLR, DeJong D, JafleES, Wilson WH , Debbie J , Qtt G r Pave BJ . Sanger 
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MaujlLLM, Chan WC ; LYmnhorna/I.enkemm lyTni cc „i ar Pmfiiing Pmj«w 

Department of Pathology and Microbiology, University of Nebraska Medical Center 983135 
Nebraska Medical Center, Omaha, NE 68198-3135, USA. kfu@unmc.edu 

Cyclin Dl overexpression is believed to be essential in the pathogenesis of mantle cell lymphoma 
(MCL). Hence, the existence of cyclin Dl -negative MCL has been controversial and difficult to 
substantiate. Our previous gene expression profiling study identified several cases that lacked 
cyclin D l expression, but had a gene expression signature typical of MCL. Herein, we report the 
clinical pathologic, and genetic features of 6 cases of cyclin Dl-negative MCL. All 6 cases 
exhibited the characteristic morphologic features and the unique gene expression signature of 
MCL but lacked the t(I I ;14)(ql3; q32) by fluorescence in situ hybridization (FISH) analysis The 
tumor cells also failed to express cyclin Dl protein, but instead expressed either cyclin D2 (2 
cases) or cyclin D3 (4 cases). There was good correlation between cyclin D protein expression and 
the corresponding mRNA expression levels by gene expression analysis. Using interphase FISH 
we did not deteel chromosomal translocations or amplifications involving CCND2 and CCND3 ' 
loci in these cases. Patients with cyclin Dl-negative MCL were similar clinically to those with 
cyclin Dl -positive MCL. In conclusion, cases of cyclin Dl-negative MCL do exist and are part of 
the spectrum of MCL. Up-regulation of cyclin D2 or D3 may substitute for cyclin Dl in the 
pathogenesis of MCL. 
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Oxytocin receptors ia boviue cervix: distribution, and gene expression 
during the estrous cycle. 

Fuclis AR, jvell R, Fields PA, Chang SM Fields M.f . 

Department of Obstebics and Gynecology, Cornell University Medical College New 
York, New York 10021, US A. b.^w 

Oxytocin (OT) receptor (OTR) concentrations were determined in the cervix of 
nonpregnant cows on cycle Days 0, 3, 7-8, 17, and 19 (n = 3-4 cows each day)- HHIOT 
was used as the labeled ligand. Mucosal and muscle layers of the cervix were a'lso 
analyzed separately for both ligand binding and expression of the OTR gene using a 
newly developed RNase protection assay (RAP). Cellular localization of OTR protein 
wasdetemuned by immunohistochemistry. All regions of cervix from cows at estrus had 
high concentrations of OTR; in the luteal phase, all were sharply down-regulated At 
^rf t ^ mucosal ,a y er had about 30-fold higher concentrations than the muscle layer. 
OTR mRNA was readily detected by RAP in the mucosa from estrous cows, while much 
weaker s.gnals were found in the muscle. On Days 7-17, the OTR mRNA signals in both 
mucosa and muscle were very faint or nondelegable. Thus, there was a good correlation 
between hgand bind.ng and mRNA expression, which suggests that OTR concentrations 
are ma.nly regulated at the transcriptional level. The epithelial cells at the luminal surface 
of the mucosa were the principal site of immunoreactive OTR; muscle cells showed 
significantly weaker signals. Previously, OT was found to stimulate prostaglandin (PG) 
E2 output in v.u-0 in bovine cervical tissues. Since PGE2 is capable of softening the 
cervix, our findings suggest that OT may have a novel physiological function to cause 
softening ot the bovine cervix mediated by the release of PGE2. 
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Silencing of the thrombomodulin gene in human malignant melanoma. 

Carcinogenesis Division, National Cancer Center Research Institute, Tokyo, Japan. 

wm?^ 0 ^^ 0m k° mo dulin (TM) expression is associated with tumour growth 
TIZ aT md \y™P h ™ d ™«^" human tumours, fn melanoma cell linek TM is 
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hnes suppresses the.r growth. In this study, we analysed TM expression in surgical 
melanoma specnens arid the role of its promoter methylation in the loss of its 
expression. In 15 (75%) of the 20 specimens (five from a pnmary site and 15 from 
metastat.c S1 tes), melanoma cells lacked TM immuaoreactivity. Methylation of the TM 
promoter reg,on was detected in 10 (67%) of the 1 5 TM-negative specimens by 
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meZS- P ^ ^ dCteCted " SiX (46%) ° f 1 3 mdanoma ce » ,ines - whereas no 
methylatxon was detected ,„ two cultured normal melanocytes. There was a good 

correlation between the methylated status of the C P G island and the loss of TM 
messenger RNA (mRNA) expression. Treatment of melanoma cell lines with a 
demethylatmg agent, S-aza^-deoxycytidine, induced demethylation of the promoter 
Sfnln r t0ra,i ° n ° f mRNA md pr0teih session. These findings suggest 

C„cZ» H Uman me ir ^ ^ ™ "P"**^ ^ ** -ethylation of the promoter 
CpG island is one of the mechanisms responsible. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered qualitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance and, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. Hie relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were .dent.fied The behavior of proteins in differential centrifugatibn experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast proteome. 
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Tn this study, >ve examined yeast proteins by hvo-dimensioDa! (20) gel electrophoresis and gathered nuan- 
DtaDve information from about 3,400 spots. We found that there is an enormous range of protein abundance 
and, for idenUSed spots, a good correlation between protein abundance, mRNA abundance, and codon bias 
For each molecule of well- translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and elbanol media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproteins were identified. The behavior of proteins in 
differentia) centrifugation experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proleome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. V/e have 
begun to analyze the yeast proteome bv using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
and the relative lack of introns in S. cerevisiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are. three pillars on which such a database rests: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (in) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for S. cerevisiae have been described else- 
where (7, 8). Here we describe quantitative data for these 
proteins under a variety of experimental conditions. 

MATERIALS AND METHODS 
S» rains and media. S. twitiae W303 {AMTa ade2-l his3 }/,]5 tcu2-5, ]J2 
trphl ura3i can I- J 00) was used (26). - Mcl YND (vcast nitrogen base) medium 
was 1.7 g of YNB (Difco) per liter. 5 g of ammonium sulfate per liter and 
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adenine, uracil, and all ammo acids except methionine; -Met.-Cys YNB me- 
dium was the same but without methionine or cysteine. Medium was supple- 
mented with 27c glucose (for most experiments) or with 2% ethanol (for ethanol 
experiments). Low- phosphate YJ£PO was described by Warner (28). 

ISO topic labeling of yeasl and preparation of cell olradsv Yeast strains were 
labeled and proteins were extracted as described by Garrets et a). (7, 8). Briefly, 
cells were grown to 5 X 10* cells per ml. at 30*C; 1 ml of culture was transferred 
to a fresh tube, and 0J mCi of I^SJmelhionmc (e.g., Express protein labeling 
mix; New England Nuclear) was added to this I-ml culture. The cells were 
incubated for a further JO to J5 min and then transferred to a ) _S-m) microcen- 
trifuge tube, chilled on ice, and harvested by centrifugation. The supernatant was 
removed, and the cell pellet was resnspended in 100 >il of lysis buffer (20 mM 
Tris-HCI |pH 7.6], 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.1% deoxychotate: just before use, phcnylmcthylsulfonyl fluoride was added to 
I mM, teupeplin was added to 1 rig/mi, pepstalin was added to 1 |Ag/mL tosyl- 
sulibnyl phenylalanvl chloromethy) ketone was added to 10 ng/mf, and soybean 
i r>pstn inhibitor was added to 10 u.g/ml). 

The icsuspcnded cells were transferred to a screw-cap L5-ml polypropylene 
lube containing 0.23 g of glass beads (0.5- mm diameter; Biospec Products) or 
0.40 g of 'zircon ia beads (0.5- mm diameter; Biospec Products). After the cap was 
secured, the lube was inserted into a MiniBeadbeater 8 (Biospec Products) and 
shafcen at medium high speed at 4'C for 1 min. Breakage was typically 75%. 
Tubes were then spun in a microcentrifuge for 10 s at 5,000 X g at 4*C 

With. a very fine pipette tip, liquid was withdrawn from the beads and trans- 
ferred to a precluded 1.5-m) tube containing 7 ul of DNase 1 (0.5 mg/ml; Cooper 
product no. 6330)-RNase A (0.25 mg/m); Cooper product no. 5679)-Mg (50 mM 
MgCK) mix. Typically 70 u.1 of liquid was recovered. The mixture was incubated 
on ice for 10 min to allow the RNase and DNase to work. 

Next, 75 pi of 2X dSDS (2x dSDS is 0.67* sodium dodecyl sulfate [SDSJ, 2% 
mercaptocihanot, and 0.1 M Tris-HCI |pH RJ) was added. The tube was plunged 
into boiling water, incubated for 1 min, and then plunged into ice. Alter cooling, 
the lube was ccnlrifuged at 4°C for 3 min at 14 ; 000 X g. The supernatant was 
transferred to a fresh tube and firmer) at -70"C About 5 ul of this supernatant 
was used for each 2D gel. 

2D pt>lyarryl:>midp gtls. 2D gels were made and run as described elsewhere 
(6-8). 

Iroagr analysis of the pels. The Quest II software system was used for quan- 
titative image analysis (20, 22). Two techniques we're used to roJtert quantitative 
data for analysis by Quest )| software. First, belore the advent of phosphor im- 
agers, gels were dried and fluorographed. Each gel was exposed to film for three 
dilJejenl times (typically 1 day, 2 wctks, and 6 weeks) to increase the dynamic 
range of the data. The films were scanned along wilh calibration strips to relate 
him optica) density to disintegrations per minute in the gels and analyzed by (he 
software to obtain a linear relationship between disrnlcgratrom per minute in ihe 
^pot^ and optical densities Oi* the film images. The quantitative data are ex- 
pressed as pans per million of the tola I cellular pTOlein. This value is tttlrulaieri 
from the disintegrations per minute of the sample loadVd onto the gel and by 
comparing the film density of each data spot with dens it)- of the film over the 
calibiation snips of known radioactivity exposed to the same film. This yields the 
disin reunions pei minute per millimeter for each spoi on the gel and thence its 
par is- per- minute value. 

Afier the advent of phosphor imaging, pels bearing ^S- labeled pioieins wcie 
exposed lo phosphor imager screens and scanned by a Fuji pho.^phorirnagei, 
iypi( ally for nv 0 exposures per gel. Calibration strips of known 'radicaciivirv were 
exposed simultaneously. Scan data from the phospboiima^er was assimilated by 
Quest 11 sol twaie, and quantitative data w f ie recorded lor thr spoii on the pels. 
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Mcnsunmrnts of protein turnovrr. Cells in exponential phase were puke- 
labeled with | j5 S)methionine, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 win {in one 
experiment) or up to ) 60 min (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sampje (JO min)L Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., tbe different 
samples had different amounts of protein but equal amounts of 35 S). Spots were 
quantitated with a phosphor imaging and Quest software. 

the software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. Jn one of the experiments, there was one such 
spot. To the eye, this was a m'moj, unidentified spot seen onrv in the first two 
samples (10 and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, wc concluded that none of the identified 
spots (and all but one of the visible spots) represented proteins with Ion* 
half-lives. - o 

Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except th;it no detergent (i.e., no 
deoxychohtc) was present in the lysis bu0er. The crude h/sate was cleared of 
. unbroken cells and large debris by tentrifugation at 300 X g for 30 s. The 
supernatanl of this centrifugal ion was then spun at 16,000 X g for 10 min to give 
the pellet used for Fig. 6B. The supernatant of the J 6,000 X g, J 0-rhin spin was 
then spun at 100,000 X g for 30 min to give the supernatanl used for Fie. 6A. 

Protein abundance calculations. A haploidyeasl cell contains about A >Tj0~" 
g of protein (J, 15). Assuming a mean protein mass of 50 kDa, (here are about 
50 X JO* molecules of protein per cell. There are about 1.8 methionines per JO 
kDa of protein mass, which implies 45 X JO 8 molecules of methionine per cell 
(neglecting the small pool of free Met). We measured (i) the counts per minute 
m each spot on the 2D gels, (ii) the lotal number of counts on each gel (by 
intcgratmg counts over tbe entire gel), and <iii) ihc total number of counts 
loaded on the gel (by scintillation counting of the original sample). Thus, we 
know xvhat fraction of the total incorporated radioactivity is present in each spot. 
After correcting for the methionine (and cysteine [see below}) content of each 
protein, wc calculated an absolute number of protein molecules based on the 
fraction of radioactivity in each spot and on 50 X 10* total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, tbe number of cysteine molecules 
per protein was also taken into account in calculating the number of molecules - 
of protein, but Cys molecules were weighted one-fifth as heavilv as Met mole- 
cules. 

mRNA abundance calculations. For estimation of mRNA abundance we used 
SAGE (serial analysis of gene expression) dara (27) and Asymetrix chip hybrid- 
ization data (29a. 30). The mRNA column in Table J shows mRNA abundance 
calculated bom SAGE data alone. However, ,he SAGE data came from cells 
growing m YEPD medium, where:.* our protein measurements were from cells 
growing m YNB medium. In addition, SAGE data for low-abundance mRNAs 
suffers bom statistical variaiion. Therefore, we pj$p used chip hybridization data 
(29a, .>0) fo» mRNA from cells grown in YNB. These hybridization data also had 
disadvantages. F,isi r I he amounts of high-abundance mRNAs were systemati- 
cally under estimated, probably because of saturation in (he hybridizations which 
used JO ug of cRNA. For example, the abundance of ADHJ mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of ENQ2 mRNA was 248 copies per cell by SAGE but only 4J by 
hybndizairon. When Ihc amount of cRNA used in the hybridization was reduced 
to l ug the apparent amounts of mRNA were similar to the amounts determined 
by SAGE (29a, 29b). However, experiments using J M g of cRNA have been done 
for only some genes (29a). Because amounts of mRNA w CI e normalized lo 
)5 r DO0 per cell, and became the amounts of abundant mRNAs were underesti- 
mated, there is a 2.2-fold overestimate of . the abundance of nonabundant 
mRNAs. We calculated this b.nor ol 72 by adding together the number of 
mRNA molecules bom a large number of genes expiessed at a low level lor both 
• SAGE daia and bybndizalion data. The sum for the same genes from hybrid- 
ization dara rs 2.?- fold greater than thai bom SAGE data. 

To take into account these diiheuhics, we compiled a list of "adjured" mRNA 
abundance as follows. For all high- abundance mRNAs of oui identified proteins 
. *e used SAGE .lata. For all ol mese particular mRNAs, chip hybridization 
suggested that mRNA abundance was the same in YEPD and YNB media For 
medium-abundance mRNAs, SAGE data weie u*ed. but when hybridization 
data showed a s,gnificant difference between YEPD and YNB ihen the SAGE 
data were adjusted by the appropriate factor. Finally, for low- abundance 
? WC U5fd d:>,r> Uom l ' h 'P h >'°' iJi " '^ns from YNB medium but divided 

by 2.2 to normalize to the SAGE results. These calculations we.c completed 
without reference lo protein abundance. 

CAJ. The codon adaptation index (CA1) w as taken from the yeast protcome 
database (YPD) (13).. fo. which calculations were made acroidine to Sha.p and 
L» <?■*). Briefly, .he index uses a lelerence set ol highly expired ccnes to av M \.„ 
a vjJue ro each codon. and ihen a score for a -ene is calculated horn the 
frequency of use of ihc vjiiou* codons in [hat gene (2^J. 

Statistical -Analysis. The IMP ping, am used with !he :il J of T Tultv The 
JMP program >hou :t ! thai nvlihei mRNA nr.r pmiein -..bundamvs weit- nm- 
mallv ifisn touted: theiefoie, Spearman i.ink cor relation i otllk i.nts (/ ) «. fll . 



calculated. The mRNA (adjusted and unadjusted) and protein, data were also 
- transformed so that Pearson product-moment correlation coefficients (r ) could 
be calculated. First, this was done by a Box-Cox transformation of l<jg- trans- 
formed data. This transformation produced normal distributions, and an r of 
0.76 was achieved. However, because the Box-Cox transformation is complcx^c 
also did a simpler JogaTiihmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not ijuite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical lo the coefficient from the Box-Co* 
transformed data. We therefore believe that this correlation coefficient is not 
misleading, despite the fact that tbe log(mRNA) distribution is not quite normal 



RESULTS 

Visualization of JyJOO spots on 'three gel systems. Yeas! 
proteins have isoelectric points ranging from 3.1 to 12.8, and 
masses ranging from Jess than 10 kDa to 470 JcDa. Jt is difficult 
to examine a]] proteins on a single kind of gel, because a gel 
v(hh the needed range in pi and mass would give poor resolu- 
tion of the thousands of spots in the centra) region of the gel. 
Therefore, we have used three gej systems: (i) P H "4 to 8" with 
10% pofyacryJamide; (ii) pH "3 lo 10" with 10% poryacryl- 
amide; and (iii) nonequilibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-3, 10% polyacrylamide gel. The pH 
at the basic end of the isoelectric focusing ge) cannot be main- 
tamed throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend the pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 35% 
acryjamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In lotal, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these conditions (27; 30). Here, we focus 
on the proteins seen on the pM 4-8 gels. 

Although nearly 'aJJ expressed proieins are present on these 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrale. When two proteins are 
resolved, but are close together; and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at Jong exposures when the weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus/ weak spots 
can be seen only when they arc well separated from strone 
spots. 

For a given gel, the number of delectable spots initially rises 
wtih exposure time. However, beyond an optimal exposure, the 
number of distinguishable spots begins lo decrease, because 
signals from strong spots cover signals from nearby weak spots. 
At long exposures, (he whole autoradiogram turns black. Thus, 
there is an optimum exposure yielding the maximum number 
of spots, and al this exposure ihe weakest spots are not seen 
Largely because of the problem of coverage, the proieins 
seen are strongly biased toward abundant proteins. All identi- 
fied proteins have a CA1 of 0.18 or more, and we have iden- 
tified no transcription factors or pioiein kinases, which arc 
nonabundant proieins. Thus, (his technology is useful for ex- 
ammrng protein synthesis, amino acid metabolism, and glyco- 
lysis but not for examining uansoipiion. ONA replication, oi 
the cell cvcJe. 
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Spot identification. The identification of various spots has 
been described elsewhere (7, 8), At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25) The 
main methods used in spot identification have been analysis of 
ammo, acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-chase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-lives (Materi- 
als and Methods). Cells were labeled with [ 35 S)metbionine for 
30 mm. and then an excess of unlabeled methionine was added 
Samples were taken at 0, 10, 20, 30, 60. and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were nm, and spots were quantitated 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
transitional modifications (e.g., phosphorylation of RpaO and 
EfbJ). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-Jives of more than 90 min. Gygi 
el al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein ha If- Jives. This result does not imply 
that all yeast proteins are siabJe. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein nuantilation. Because all of the proteins seen had 
effectively the same half-life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine, and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 1 and 2; Materials and 
Methods). About 3,000 unidentified proteins were also quan- 
tified, assuming an average content of Met and Cys 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total proiein amount 
However, many proteins probably have minor spots thai we are 
not aware of, causing ihc amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had S,904 ppm the 30th 
most abundant had 2,842 ppm, the 100th most abundant had 

M,??™' lbC 5 ° 0th mosI abunda »' had 57 ppm, and the 
J,000th most abundant (visualized at greater than optimum 
exposure) bad 23 ppm. Thus, there is more than a 300-fold 
range m abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gel, the most abundant 60 proteins account 
lor 5U/ 0 , and the niost abundant 500 proteins account for S0% 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins; *c estimate thai half of the total 
cellular protein is accounted lor bv fewer than 300 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein :»bm>dnnce with mRNA nbunuSnrc 
Estimates of mRNA abundance /or each gene have been made 
by 5AGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods Rive broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table 1 li.sis the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second imorpo 

"pm ^w-i 1 ?? b ° th SAGJE ^."^"iHMiion (30) (adjusted 
mRNA) (Table J; Materia and Methods). We counted 
piotein abundance wnh mRNA abundance (Fig. 2). Tor ad- 
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justed mRNA versus protein, the Spearman rank correlation 
coefficient, V was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r py on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is al least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to post transcriptional 
regulation, and some is due lo errors in the mRNA or protein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the Jow protein/mRNA 
ratio of Yef3. It is the most extreme outlier in Fig. 2 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
mm, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be * 
the mRNAs that are translated best. 

A full complement of cell proiein is synthesized in about 120 
mm under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every 2 s. This is a remarkably. high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average proiein has 450 residues; this in turn implies 
translation of over 20 amino acids per s, a rale considerably 
higher than estimated for mammalians (3 to 8 amino zcids per 
s) (3.8). i:hese estimates depend on the amount of mRNA Der 
cell (U, 27); F 

The large number of protein molecules that car) be made 
from a single mRNA raises the issue of how abundance' is 
controlled for Jess abundant proteins. Many nonabundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptimal rales. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translational initiation. For example, there arc over 600 yeast 
genes which probably have short open reading fiames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias wilh protein abundance. The 
mH^As for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may moke translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bios with abuiv 
dance continues for medium- abundance proteins. There are 
various maihemalical.expie.ssions quantifying codon bias- here 
we have used the CAJ (2-1) (Materials and Methods) because 
n gives a result between 0 and J. The r, for CA1 versus proiein 
abundance is 0.S0 (P < 0-0001), similar to the mRNA-proteiri 



TABLE I. Quantitative data" 



Function 



Name 


CA1 


mRNA 


AtljUjtcti IT 


Adhl . 


0.810 


197 


197 


Adb2 


0.504 


0 


Cil2 


0.185 


] 




Eriol 


o!870 


No Nla 


Eno2 


0.892 


248 


243 


Fbal 


0.868 


179 


179 


HxkljZ 


0.500 


33 


10.5 


Jell 


0.253 


0 




Pdbl 


0.342 


5 


5 


PdcJ 


0.903 


226 


226 


mi 


0.465 


5 


5 


Pgil 


0.68) 


34 


.14 


Pycl 


0.260 


1 


0.7 


Tall 


0.579 


5 . 


5 


Tdh2 


0.904 


63 


63 


Tdh3 


0.924 


460 


460 


Tpil 


0.817 


No Ato 




Efbl 


0.762 


33 


16.5 


Eft3,2 


0.801 


26 


26 


Prtl 


0.303 


4 


0.7 


RpaO 


0.793 


246 


246 


Tin a 


0.752 


' 29 


29 


Yef3 


0.777 


36 


36 



Protein (Glu) (MP) 


Pmlrtn (TFlh\ /1fV*t 




3,230 


972 


0.79 


o 


yoj ■ 


>20 


23 


?88 


JZ 


4J0 


07d 
y /** 




650 


215 


0 33 


640 


608 




62 


46 




0 


671 


>20 


41 


33 




280 


205 " 


0.73 : 


75 


53 


0.71 


160 . 


120 


0.75 


37 


34 




110 


35 




430 


876 


NR 


1,670 


1,927 


NR 


No Mel 


No Mel- 




358 


362 




99 


54 


0.55 


12 


6 




277 


300 


036 


233 


106 


0.46 


34 


ND 





Carbohydrate metabolism 



Protein synthesis 



Heat shock 



Amino acid synthesis 



Miscellaneous 



HscS2 

Hsp60 

Hsp82 

Hspl04 

Kar2 

SsaJ 

Ssa2 . 

Ssbl,2 

. Sscl 

. Ssel 
Stil 

Adel 

Ade3 

Ade5,7 

Arg4 

Gdh) 

Glnl 

His* 

11v5 

Lys9 

Met6 

Pro2 

Serl 

Trp5 

Actl 

AdJcl 

AId6 

Atp2 

Bmbl 

Bmh2 

Cdc4S 

Cdc60 
. Erg20 

GppJ 

Gsp) 

Ippj 

LcbJ . 

Moll 

Pabl 

Psnl 

Rnr4 

Sam! 

Sam2 

Sodl 

Uba) 

YKL056 

YLR109 

YMR116 



0.581 

0.381 

0.517 

0.304 

0.439 

0.709 

0.802, 

0.850 

0.521 

0.521 

0.247 

0.229 
0.276 
0.257 
0.229 
0.585 
0.524 
0.267 
0.801 
0.332 
0.657 
0.248 
0.258 
0.319 

0.730 

0.533 

0.520 

0.424 

0.322 

0.384 

0306 

0.299 

0373 

0.603 

0.621 

0.620 

0.173 

0.423 

0.488 

0600. 

0.497 

0.494 

0.497 

0.376 

0.212 

0.731 

0.549 

0.777 



2 
9 
2 
7 

5 - 
2 

30 
50 

2 

8 

3 

4 
2 
2 
3 

30 
11 

3 
6 
4 

No N/a 
3 
2 
5 

54 
No Nla 
3 
1 

46 
3 

2 
2 
5 
36 
3 
4 

0.3 

0 

3 
15 

6 

5 

3 
36 

2 
62 
21 
4] 



2.9 
23 
1.3 
7 

10. J 

43 
5 

50 
2.6 
8 

1.1 



1.7 

1.4 

8.1 
27 
31 

3 

6 

4 
22 

3 

3.2 
5 

54 
3 

4.3 

46 
1.4 

2.4 

0.86 

5 

5 

3 

4 

0.8 

0.45 

3 

15: 

6 

5 
35 
36 

2 
62 
2! 
41 



112 

35 
52 
70 
43 
303 
233 
270 
68 
96 
25 

14 
12 
14 
41 

148 

77 
15 

J52 
32 

190 
30 
15 
28 

205 
47 
181 
76 
391 
334 
32 
6 
92 
234 
115 
254 
19 
20 
41 
348 
44 
59 
63 
631 
34 
253 
930 
J 84 



75 
82 
335 
161 
102 
421 
324 
85 
SO 
48 
44 

27 
9 
4 
41 
55 
104 
23 
109 
17 
80 
12 
8 
J2 

164 

43 
159 
109 
137 
147 

26 
2 

39 
358. 

39 
147 

40 

16 

39 
56 
37 
2] 
20 
638 
20 
332 

40 



0.67 

2.3 

2.6 

23 

2.4 

3.4 

1.5 

1.2 

1.7 



13 

1.5 
0.7 
0.52 
0.42 



0.78 



1.4 

0.72 



034 
0.58 



0.47 



0.44 
0.20 



CAI, 3 meaju.e oi codon bw, b taken bom i he YPD. mRNA. number oi mRNA molecules pci cell horn SAGF dittj my 
moleruo pc, cell based on both SAGE :mcl chip hvb.idnaiion (30) (.see Materials 3 nd Methods); Protein (Glu). number 
YND-jlucosc; Pimc.n (Eih), number o! moJecle* ,,| protem per cell in YNB-ethanoI; E/G »aiio, .alio oi protein abundance in 
not riven .1 >t w« close to J o; .1 il »-« m ,i ,tpcavMc (NR) in ;»..|ii P K- W cls. Some e ene products (e.g. Till and Til7 inn ^> * 
a pmte.n o, a „ mRNA ba*i.<; ihese «e pooled. No A'/ u . ,hr,e u,> no citable A'/nlll sire in the .V region oi ,he gene, h„J so .he" 
i he nature £ cr>e product contains no methionine*; :>n»l-w then- no ,t liable protein ditty. 



adjusted n>RNA, number of mRNA 
n( molecules o( pioirin per cell in 
eth;mol to "luense. Trie E/G ratio is 
vr ie difficult to distinguish on either 
ate no SAGE mRNA dar:t; No Met, 
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TABLE 2. Functions of proteins listed in Table 1 



YPD litlc lines 6 



AdhI 

Adh2 

Cit2 

EnoJ 

Eno2 

FbaJ 

Hxk2 

IcIJ 

Pdbl 

Pdc3 

PfkJ 

Pi>il 

Pycl 

Tail 

Tdh2 

Tdh3 

TpiJ 

EfbJ 
Eft] 
Efi2 
Vjtl 

RpaO (RPPO) 

Tifl 

Tif2 

YeD 

Hsc32 
Hsp60 
Hsp82 
HspJ04 

Kar2 

Ssal 
Ssn2 
Ssbl 
Ssb2 
Sscl 

Ssel 
Sti] 

Adel 
Ade3 
Ade5,7 
• Are4 
Gdh) 
Gin) 
His4. 

Ilv5 

hysD 
Mct6 



Pro2 
Scr] 
Trp5 

Acil 

AdkJ 

Ald6 

Aip2 

Bmhl 

Bmh2 

Cdc48 

Cdc60 

Eii;20 

Gp>] {Rbr2) 
Gspl 
Ipp) 
Lcbl 

Mol) (Thi4) 
Pabl 

Psal 
Rnr4 
Sam! 
Sam2 
Sodl 
Ubal 



i NAD* 



YKL056 

ylrh 

YMR) 



Alcohol dehydrogenase I; cytoplasmic isozyme reducm* acetaldehyde lo etbanol, regenerating J 
Alcohol dehydrogenase H; oxidi2es elbanol to acetaldehyde, glucose repressed 

Citrate synthase, peroxisomal (nonmilocbondrial); converts acetyl-CoA and oxaloacetate to citrate plus CoA 
Enolase J (2^phosphoglycerale dehydratase); converts 2-pbospbo-I>-gfycerate lo phosphoenolpyruvate in glycolysis 
Enolase 2 (2-phosphogryceraie dehydratase); converts 2-phospbo-D-gJycerate to phosphoenolpyruvate in ptycorysts 
Fructose bisphosphate aldolase II; sixth step in glycolysis . 
Hexokinase I; converts bexoses to hexose phosphates in glycolysis; repressed by glucose 

Hexokinase II; converts bexoses to hexose phosphates in glycolysis and plays a regulatory role in glucose repression 
Isqcitrate lyase, peroxisomal; ozrries out pari of the gh/oxylate cycle; required for gluconeogenesis 
Pyruvate dehydrogenase complex, EJ beta subunil 
Pyruvate decarboxylase isozyme 1 

Pbosphofruclokinase alpha subunil, part of a complex with Pflc2p which carries out a key regulatory step in glycolysis 
Glucose-6-phosphale isomerase, converts gIucose-6-phosphate to fructose- 6-phosphale . 
Pyruvate carboxylase J: converts pyruvate to oxaloacetate for gluconeogenesis 
Tmnsaldolase; component of nonoxidalive part of pentose phosphate pathway 

CIyceraldchyde-3- phosphate dehydrogenase 2; converts t>gfycer aldehyde 3-phosphale lo J^-dephosphogrycerale • 
GIyceraIdehyde-3-phosphate dehydrogenase 3; converts P-glycer aldehyde 3-phospbate lo 1,3-dephosphogrycerale 
Triosepbosphale isomerase; inlerconverls gJyccraldehydc-3- phosphate and dihydroxyacelone phosphate 

Translation elongation factor EF-Jp; GDP/GTP exchange factor for Tef Jp/Tef2p 

Translation elongation factor EF-2; contains diphthamide which is not essential for activity; identical lo Efl2p 
Translation elongation factor EF-2; contains diphthamide which is not essential for activity; identical to EftJp 
Translation initiation factor eIF3 beta sobuwit (p90); has an RNA recognition domain 
Acidic ribosomal protein AO 

Translation initiation factor 4A {e!F4A) of the DEAD box family 
Translation initiation factor 4A (e)F4A) of the DEAD box family 
Translation elongation factor EF-3A; member of ATP-binding cassette superfamily 

Chaperonin homologous to E. coli HtpG and mammalian HSPDO 

Mitochondrial chaperonin that cooperates with HspJOp; homolog of E. cofi GroEL 

Heat- inducible chaperonin homologous to£. coli HtpG and mammalian HSP90 

HC |0^(PSrj pViin cOTvcra f on f0r induCeJ lhcrn,t>It) ' eiancc and for resolubilizing aggregates of denatured proteins; important for [psi - ]- 

Heat shock protein of ihe endoplasmic reticulum lumen required for protein translocation across the endoplasmic reticulum membrane 

and lor nuclear fusion: member of the HSP70 family 
Cytoplasmic chaperone; heat shock protein of the HSP70 family 
Cytoplasmic chaperone; member of the HSP70 family 

Heat shock protein of HSP70 family involved in the translation I apparatus 
Heat shock protein of HSP70 family, cytoplasmic 

M j^&nZS l^™ot™o7^™Z$' h Tiro44p phyi 3 ch3pc,0nin IO,e " recd *« and ,oWi "« of >» 0,ein *™ 

Heat shock proiem of the HSP70 family; multicopy suppressor of mutants with hyperactivated Ras/cvclic AMP pathway 
Siiess-mduced proiem required for optimal growth at high and low temperatnre;>as letratricopeplide repents 

Phosphoribosylamidoimidazole-succinocarboxamide synthase: catalyzes the seventh step in de novo purine biosynthesis pathway 
C, letrabydrofolale synthase < [Afunctional enzyme), cytoplasmic 

Phosphor ibosybmine-glycine Jigase plus phosphor ibosvlformvIt!lvcin a mid in e evelo- lipase; Afunctional protein 
Argjmnosuccmate lyase: catalyzes the final step in arginine biosynthesis ' • 
Glutamaic dehydrogenase (NADP"*); combines ammonia and a-ketoglulnrate to form glutamale 
Gluiammc synthetase; combines ammonia lo ghitamate in ATP-drivcn reaction 

Ph hb P Wc^ pyropbosphohydrobse/histidinol dehydrogenase; 2nd, 3rd, and Jdh steps of 

K ^1i Mv^^^ aC ' d ^"^fcO'neiwe) (alpha-keio-p-hydioxylacyl) reducloisornerasc); second step in V a | " 

Sf^! 1 ? ™ aeh ?*?S«™« (NADP*, L-glulamale forming) (saccbaropihe reductase), seventh step in lysine biosvnthesis pathway 
cXb.^ (^eibylietryhydropterovl iriglutamate-bomocysieine mclbyllransfcras'e), meihionine synthase, 

7- Glutamyl phosphate irductase (phosphogluiamate dehydrogenase), proline biosynihetic enzyme 
Phasphoscrine irimsammase; involved m synthesis of serine from 3-pbospboglyceraie 
1 ryptophan synthnse, last (5th) step in tryptophan bios>-nthesis pathway 

Aclin; involved in cell polarization, endocyiosis, and olhcr cvtoskeleial functions 
Adenylate kinase f,GTP:AMP phosphotransferase), cytoplasmic 
Cytosolic jcetaldehyde dehydrogenase 

Be la subunil of PJ-ATP synthase; 3 copies are found in each F J oligomer 
Homolog of mammalian 14-3-3 piolein; has strong similarity lo Bmh2p 
Homolog of mammalian 14-3-3 protein; has strong similarity to BmhJp 

Piotein ofihe AAA family of ATPases: required for cell division and homotvpic membrane fusion 
Leuol- IK NA synthetase, cytoplasmic 

Farnesyt pyrophosphaie synibetase; may be rate-limiting step in sterol biosynthesis paihway 
DL- Glycerol phosphalc phosphatase 

Ran, a GTP-hindmg p.oicin ot (he Ras superfamily involved in tr jfheking through nuclear pores 
Inoigaiiic pyrophosphatase, cytoplasmic • . 

Component of serine C- palmitoyltransferase; hrst ^tep in biosynthesis of long-chain base component of sphingoids 
Thiamine- if pressed pint em essential lor growth in the absence of thiamine - 
P< ?«o£S °' Cytnpl ' JSm 3 " d nm,Cn ^ P : "' of 3 '-^ J RNA p.occssing complex (cleavage factor I); has 4 RNA 

M a nnose- J -phosphate guanylliansfeiasc; GDP-mannose pyrophosphorvlase 

Ribonucleotide reductase small subunil 

5-AdenosyImeihionine synthetase 1 

5-Adenosylmethiomne synthetase 2 

Coppei-zinc superoxide dismuiase 

Ubi<|uitin :(ctivaimg (EJ) enzyme 

Resembles iranslationalJv conliollcd tumor protein of animal cells and hl^het rlanls 



Y1.R109 (Ahp]^ Alley! hydroperoxide reductase 

" )6 (Ascl) Abundant protein with effects on (ransblional efficiency and cell si?e, has two WO (WD-4Q) repeats 



- AcccpieJ n:.n« bom the Socfhoromyca gennme darahasc anJ Y YD. Names in pa.enthrses represcnl rtcrn. chances 
(.oiiiic^- o\ I roieurirc. Im . icpnntcd wvih per miss ton. 
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Adj. mRNA 

FIG. 2. Correlation of protein abundance with adjusted mRNA abundance. 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cefl of I be cognate mRNA, with an r p of .0.76. Note the 
logarithmic axes. Data for mRNA were taken from references 27 and 30 and 
combined as described in Materials and Methods. 



correlation, confirming 3 strong correlation between CAI and 
protein abundance (Fig. 3). The relationship between CAI and 
protein abundance is Jog linear from ab6m 1,000,000 to about 
10,000 molecules per cell. We have no data /or rarer proteins. 

It is not clear whether CAJ reflects. maximum or average 
levels of protein expression. The proteins used for the CA1 
protein. correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg-i His4, and others). 
When these proteins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitutive) proteins^ was recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (proiein 
molecules/ceJI) = (2.3 X CAI) + 3.7. Thus, under certain 
conditions (a CAI of 03 or greater: a constitutively expressed 
gene), a very rough estimate of proiein abundance can be 
made by raising 10 to the power of [(2.3 X CAJ) + 3.7]. 

The distribution of CAI over the genome (Fig. 4) consists of 
a lower, bell -shaped distribution, possibly indicating a region 
where there is no select ion. for codon bias, and an upper, flat 
distribution, starting at a CAJ of aboul 0.3, possibly indicating 
a region where (here is selection for codon bias. Almost all of 
the proteins whose abundance we have measured are in the 
upper, fiat portion of the distribution. Jn the lower, bell-shaped 
region, we do nol know whether there is a correlation between 
CAI and protein abundance. 

Changes in protein abundance in glucose and ethanol. A 
comparison of cells grown in glucose (Fig. 1A) with ceJJs grown 
in ethanol (Fig. IB) is shown in Table 1. As is well known, 
some proteins ;ue induced tremendously during growth on 
ethanol. Two striking examples are the peroxisomal enzymes 
IcIJ (isocitrate lyase) and Cit2 (cilrate synthase), which are 
induced in elhanol by more lhan 300- and 12-fold, respectively 
(Fig. J; Table J). These enzymes are key components of the 
glyo^ylntc shunt, which dive its some acetyl coenzyme A 
(areryJ-CoA) from the tricarboxylic acid cycle !o gluconeogen- 
esis. S. cerevisiae requires large amounts of carbohydiate for its 
cell wall; in ethanol medium, this carbohydrate comes from 
gluconeogenesis, which depends on the plyoxylate shunt and 
on. the glycolytic pathway running in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylale shunt enzymes 
in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. : 

During gluconeogenesis, substrate-product relationships are . 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse . 
. reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Ad hi and Adh2 each inlerconvert 
acelaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 mM), while Adh2 has a- lower K m (0.8 mM) (5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acelaldehyde) (5,29). Similarly, Enol (enoJase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by . 
2-phosphogrycerate (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpyruvate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K m for phosphoenol- 
pyruvate than doesEno2, though to our knowledge this has nol 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol); . .. 

Many heat shock proteins (e.g.,.Hsp60, Hsp82, Hsp304, and 
Kar2) were about twofold more abundant hi elhanol medium 
than in glucose medium. This, is consistent with the increased 
heat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (Eftl, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose: 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproteins. About 300 distinct spots, probably represent- 
ing 150 to 200 proteins, could be seen on pi) 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a 
different kind of labeled protein ( 32 P only |Fig. 5B], 32 P plus * 
35 S [Fig. 5A], and 35 S only [nol shown, but see Fig. 1 for . 
example]). Jn this way, we made provisional identification of 
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Codon Adaptation Index 

FIG. 3. Con clarion ol piotcin abundance whh CA), The rmmbei of mole- 
cules pci cell oJ each pioitin is plotted against iitt CAJ toi lhal protein. Noie the 
logaiiihrotc scale on the pjotrin ajus. Ddis rot the CAI aie fiom the YFD 
d;itnba>e ft?). 
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Codon Adaptation Index 

FJG. 4. Distribution of CAI over the whole genome, shown in intervals of 0.030 (i.e., there are 150 genes with a CAI between 0-000 and 0.030, inclusive; 31 genes 
with a CAJ between 0.031 and 0.060; 269 genes with a CAI between 0.061 and 0.090; 1,296 genes wiih s CAI between 0.091 and 0.1 20;. etc.). The distribution peaks 
whh 2,028 genes with a CAI between 0.121 and 0.150. 



some of the 32 P-labeled spots as particular 35 S-labeled spots. 
A)) such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigrate. Nevertheless, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P-Jabeled proteins are the bexokinases Hxtct and 
BxJcZ the acidic rihosome-associated protein RpaO, the trans- 
lation factors Yef3 and EfbJ, and probably Hsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and Efol are quan- 
titatively monophosphorylated. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has five or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no 32 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tift has at least seven spots (7); two of these 
overlap with some 12 P label but five do not (Fig. 5). Eftt has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 37 P-labe)ed spots 
(a, c, and d in Fig. 5). Spots that seem to be extra forms of 
Met6, Pdc], Eno2, and Fbal can be seen in Fig. 6A, but there 
is lif fie 3 "P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different ptotein isoforms seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 32 P-labeled proteins from cells synchronized in G, 
with a -fact or, in ceils synchronized in G t by depletion of G, 
cyclins, and in cells synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
lo reproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 3> S-labeled ex- 
tracts by centrifugal ion (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high- speed 
(lOUOOO X 30 min) centrifugal ion, while Fig. 613 shows the 
proteins in the pellet of a low- speed (16.000 X g t ]0 mm) 
centrifugalion. Many proteins are tremendously enriched in 
one fraction or the other, while oihcis ;nc present in both. 



Most glycolytic enzymes (e.g., Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is PfkJ (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Efll r Yef3, Prll, Tifl, and RpaO) are in the pellet, . 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantial portion of the Eft J. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 [not shown) and JIv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gsp) is in 
the pellet fraction. The enrichment produced by centrifugalion 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding pToteins. Figure 6B shows that the 
previously identified Tifl spot is surrounded by as many. as six 
other spots that cofrnctionnte. We observed six identical or 
very similar additional spots when we overexpressed Tifl from 
a bigb-copy-number pjasmid (not shown). Signal overlaps only 
one or two of these spots in 33 P-labeJing experiments (Fig. 5), 
and so the different forms are not mainly due lo different 
phosphorylation states. 

DISCUSSION 

Our experience with developing a 2D gel protein database 
for S. cerevisiae is summarized here. With current technology, 
we can see the most abundant J, 200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified J48, with a 
bias toward the most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene over expression will be parlicu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 
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2D gels of the kind that we have used are nol suitable for 
visualization of rare proteins. However it will be possible To 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu- 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et a I. (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
mRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi ef aL feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (10). These different conclusions are partly a matter of 
. viewpoint. Gygi et al. focus on the fact that the correlations of 
mRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
jange of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and to real differences 
in data. With respect to statistics, Gygi et a), used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
coyariance of mRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r p values with some subsets of the 
data, Gygi et al. concluded that the correlation of mRNA to 
protein was poor. However, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, it would be valid only if both mRNA and 
protein abundances were normally distributed. In fact, both 
distributions are very far from normal (data nol shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion that codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r y ). Since this 
statistic is n on parametric, there is no requirement for the data 
to be normally distributed. Using the r Jf we find that mRNA 
abundance is well, correlated with protein abundance (r } = 
0.74), and the CA1 is also well correlated with protein abun- 
dance (r, = 0.80) (and also with mRNA abundance (data not 
shown]). For the data of Gygi et al. (30), we obtained similar 
results, though with their data the correlation is not as good; r s 
~ .0.59 for the mRNA-to-protein correlation, andr^ = 0.59 for 
the codon bias-to-protein correlation. 

Jn a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Bo.x-Cox and logarithmic, were used; both gave 
good correlations with our data |e_g.. r p - 0.76 for lbg(adjusied 
RNA) to Iog(protein)]. Wc were nol able to transform the data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to the different measure- 
ment techniques used: Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
r imaging of intact gels coupled io image analysis. We com- 
pared our data io i heirs for the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minimal media, and excluding Tin, 
which wasanomalous in differing by 100- fold between the two 
.data sets). The r f between the two protein data sets was 0.88 
[P < 0.0001). Although this is a strong coi relation, the fact thai 



it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume. the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data.of Gygi et al. to be slightly higher than ours for 
the highest-abundance proteins and also for the lowest-abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do.nol 
know what (he errors are, we suggest the following as a rea^. 
sonable speculation. For the highest-abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the phosphorimager screens. For ■ 
the lowest-abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately cutting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak- 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi el al. 
have a poor correlation with mRNA abundance. We calculate 
that the is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteias, a statistically significant dif- 
ference. However, with our data set, the r s is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorim aging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fact that we used both SAGE data and RNA 
hybridization data, which is roost helpful for the least abundant 
mRNAs. In summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi el al. 
may reflect difficulty in accurately measuring these nonabun- 
dant proteins and mRNAs. rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less- abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? With r p = 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r p ) 7 , is 0.58. 
This means that more than half (in Jog space) of the variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), and (r p ) 7 = 
0.58 for the Jog data means that of this 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about 30- fold is unexplained (but could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance wilJ be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. . 

Some important conclusions can be drawn from this sam- 
pling of the pioteome. First., there is an enormous range ol 
protein abundance, from nearly 2,000 : 000 molecules per cell 
for some glycolytic enzymes lo about 100 per cell lor some cell 
cycle pioicins (26a). Second; about half of all cellular protein 
is found in fewer than 100 different gene products, which are 
mostly involved in carbohydrate metabolism or pjotcin svnthc- 
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sis. Third, the correlaiion between prolein abundance and CAI 
is Jog linear as far as we can see, which is from about 10,000 
prolein molecules per ceil to about 1 ; 000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels. It also 
means that codon bias is a useful predictor of protein abun-. 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between protein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This last conclusion 
raises questions as to how the levels of nonabundarit proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
* tant for these proteins. " ■ ^ 
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Pre-transiational ^regulation of cytochrome P450 genes is responsible for 
drrhoS PeC ' 0f iadiyid ™ 1 P450 enz y mes a ™»g Paints with 

GeorgeJ, LjddjeC, Murray M , BythK, FarreU GC 

We have recently reported that disease-specific differential alterations in the hepatic 
express.cn of xenohotic-metabolizing cytochrome P450 (CYP P450) enzymes occur in 
patients wr* advanced liver disease. In order to ^irL^J^^JZls 
n CYP pro ems are modulated at pre- or pbst-translatiorial levels, we have now exS«I 
the he P at,c levels of mRNA for CYPs 1A2,2C9,2E1 and 3A4 b ; solution ^yTriSon 
m the same hvers of 20 controls (surgical waste from histological normal lS 32 

mRNA and CYP1 A immunoreactive protein were both reduced in livers with 
hepatocellular and cholestatic types of cirrhosis. In contrast, CYP3A4 mRNA and protein 
were reduced only in livers from patients with hepatocellular disease! FoT^ Tald 3A4 
there were significant correlations between mRNA species and the respective^*! 
contents (rS 1 A2 = 0.74, rS3A4 = 0.64, P < 0.0001). CYP2C9 mRNA was reduced^n 
patients with both cholestatic and hepatocellular types of liver dSatTut 2C wl 

c;si?rv with r estatic dysfonction - ^ <^^2^: m 

CYP2C9 mRNA and protein, was also significant (rs = 0.36, P < 0.005) but mRNA levels 
accounted for only 13% of the variability in protein rankings. This is probabryT 

antibody CYP2E1 mRNA and protem were reduced in patients with cholestatic liver • 

decreased. CYP2E1 mRNA was significantly correlated with CYP2E1 protein but 
lotted ( for only 18% of the variability in ^Idn („ -oSLT^SSs) 

mtboiS^ 

metabohzmg CYP enzymes among patients with cirrhosis is due, at least in part to pre- 
translational mechan.sms. The lack of a strong correlation between CYP2E 1 mRNA and 
protem suggests that tins gene, like its rat orthologue, may be subject to pre-translalnM ' 
as well as translation^ and/or post-translational regulation. transiauonal 
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Cyclooxygenasc-2 expression in macrophages: modulation by protein 
kinase C-alplia. 

Ginmx M , Descoteaux A 
SIT" 

Cyclooxygenase-2 (COX-2) is an inducible enzyme responsible for high levels of PG 
production during inflammation and immune responses. Previous studies with 
pharmacology inhibitors suggested a role for protein kinase C (PKC) in PG production 
poss.bly by regulating COX-2 expression! In this study, we addressed the role of PRC- 
alpha in die modulation of COX-2 expression and PGE2 synthesis by (he overexposing 

Ia^T7 (DN) mUtant ° f this iS ° en2yme in the mouse macrophage cell line 
f± W rr^r > nve f f *d the effect of various stimuli on COX-2 expression, namely 

A Vr^T" 13 ' 'n^cellular parasite Leishmania donovani. Whereas LPS- 

induced COX-2 mRNA and protein expression were down-regulated in DN PKC-atoha- 
overexpressing clones, IFN-gamma-induced COX-2 expression was up-regulated in DN 
f K.C-alpha-overexpressing clones with respect to normal RAW 264 7 cells 
Measurements of PGE2 levels revealed a strong correlation between PGE2 secretion and 
IFN-garnma-mduced COX-2 mRNA and protein levels in DN PKC-alpha-overexpressing 
clones. Taken j together, these results suggest a role for PKC-alpha in the modulation of 
LPS-^and IFN-gamma-induced COX-2 expression, as well as in IFN-gamma-induced . 
PGE2 secretion. 
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Transcript profiling of human platelets using microarray and serial 
analysis of gene expression. 

GnatenkoDV, DujmJJ, Mc Corkle SR, Weissmann D , Perrotta P L, BahouWF. 

Sj^S ProPam ' n ° CnetiCS ' StatC U ™™"y of New York, Stony 

Human platelets are anucleate blood cells that retain cytoplasmic mRNA and maintain 
funcuonally mtact protein translational capabilities. We have adapted compleZSrt 
techniques of m.croan-ay and serial analysis of gen e expression (SAGE) for genetic 
proGlmg of^gWy punfied human blood platelets. Microarray analysis using the 
™, >?i^ AV2 a PP r0Ximate, y 12 600-probeset maximally rdentiffed the 
220/^ lect i <™«J.. 13 *-' J*) Platelet-expressed transcripts, with approximately 
22 /. collectively mvolved .n metabolism and receptor/signaling, and an 
^presentation of genes with unassigned function (32%). Ln contrast, a nidified 

laZ P 22 h^T/ " S u restnCti0 » e -3' me (generating 21 -base pair 

[bp] or 22-bp lags) demonstrated that 89% of tags represented mitochondrial (mt) 
tonsenpts (ennched in 16S and 12S nbosomai RNAs), presumably related to persistent 
mt-fransenpuon in me absence of nudear-derived transcripts. The frequency of non-mt 

S ? F average difference values (relat,ve ex P ression ) *> r 

abundant transenpts as determined by microarray analysis, establishing the 
concordance of both techniques for platelet profiling. Quantitative reverse transcnp.ion- 
polymerase cha.n reaction (PCR) confirmed the highest frequency of mt-derived 
uraxsenpts, along with the mRNAs for neurogranin (NGN, a protein kinase C substrate) 
and the complement lysis inhibitor clusterin among the top 5 most abundant transcripts 
For confirmatory characterization, immunoblots and flow cytometric analyses were 
performed, estaW.shmg abundant cell-surface expression of clusterin and intracellular 
express.on of NGN. These observations demonstrate a strong correlation between high 
transcript abundance and protein expression, and they establish the validity of transcript 

pathologic platelet (and/or megakaryocyte) functions. 
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Transcript profiling of human platelets using microarray and serial analysis 
of gene expression 

Dmitri V. Gnatenko, John J. Dunn, Sean R. McCorkle, David Werssmann, Peter L. PerroMa. and Wadie F. Bahou 



Human platelets are anucleate blood cells 
that retain cytoplasmic mRNA and main- 
tain functionally intact protein transla- 
tions! capabilities. We have adapted 
complementary techniques of microarray 
and serial analysis of gene expression 
(SAGE) for genetic profiling of highly 
purified human blood platelets. Microar- 
ray analysis using the Affymetrix HG- 
U95Av2 approximately 12 600-probe set 
maximally identified the expression, of 
2147 (range, 13%-177.) platelet* expressed 
transcripts, with approximately 22% col- 
lectively involved in metabolism and re- 
ceptor/signaling, and an overrepresenta- 
tion of genes with unassigned function 
(32%). In contrast, a modified SAGE proto- 
col using the Type IIS restriction enzyme 

Introduction " 



M/neJ (generating 21-base pair [bp] or 
22-bp tags) demonstrated that 89% of 
tags represented mitochondrial (ml) tran- 
scripts (enriched in 16S and 12S ribo- 
somal RNAs), presumably related to per- 
sistent mMranscription in the absence of 
nuclear-derived transcripts. The fre- 
quency of norwnt SAGE tags paralleled 
average difference values (relative expres- 
sion) for the most "abundant" transcripts 
as determined by microarray analysis, 
establishing the concordance of both 
techniques for platelet profiling. Quantita- 
tive reverse transcription-polymerase 
chain reaction (PCR) confirmed the high- 
est frequency of mt-dertved transcripts, 
along with the mRNAs for neurogranin 
(NGN, a protein kinase C substrate) and 



the complement lysis inhibitor clusterin 
among the top 5 most abundant tran- 
scripts. For confirmatory characteriza- . 
tion, immunoblots and flow cytometric 
analyses were performed, establishing 
abundant cell-surface expression of clus- 
terin and intracellular expression of NGN. 
These observations demonstrate a strong 
correlation between high transcript abun- 
dance and protein expression, and they 
establish the validity of transcript analy- 
sis as a tool for identifying novel platelet 
proteins that may regulate normal and 
pathologic platelet (and/or megakaryo- 
cyte) functions. (Blood. 2003;101:2285-2293) 
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Human blood platelets play critical roles in normal hemostatic 
processes and pathologic conditions such as thrombosis, vascular 
remodeling, inflammation, and wound repair. Generated as cytoplas- 
mic buds from precursor bone ma now megakaryocytes, platelets 
are anucleate and lack nuclear DNA, although they retain 
megakaryocyte-derjved roKNAs. 17 Platelets contain rough endo- 
plasmic reticulum and polyribosomes, and they retain the ability 
for protein biosynthesis from cytoplasmic mRNA. 3 Quiescent 
platelets generally display minimal ifansJaiional activity, although 
newjy formed platelets such as those found in patients with 
immune thrombocytopenic purpura (ITP) synthesize various o-gran- 
1)1 e and membrane glycoproteins (GPs) : including GPJb and 
GPJJb/JIJa (a }Jb p 5 ). Furthermore, stimulation of quiescent platelets 
by agonists such as a-lhrombin increases protein synthesis of 
various platelet proteins, including Bcl-3. 4 Like nucleated ceils, the 
rapid translation of preexisting mRNAs may be regulated by 
iniegrin ligation to extracellular matrices. ■ )n the case of platelets, 
the primary integrin involved in. this process appears to be a, It> p 3 
with cooperative signals mediated by the collagen receptor cr,0, ,. 6 - 7 



Jntegrin-mediated platelet protein synthesis appears to be regulated 
at the level of translation initiation involving the eukaryotic 
initiation factor 4E (cJF4E). Instead of directly influencing elF4E 
activity via postradiational modifications (ie, phosphorylation), 
platelet e)F4E activity best correlates with its spatial redistribution 
to the mRNA-enriched cytoskeleton. 8 Furthermore, because pro- 
tein translation is partially inhibited by the immunosuppressant 
rapamycin, it suggests that adhesion- and/or aggregation- induced 
outsidc-in-signaling function to regulate protein synthesis through 
the mTOR (mammalian target of rapamycin) pathway.*- 8 * 9 

Despite the biologic importance of platelets and their intact 
protein synthetic capabilities, remarkably Tittle is known about 
platelet mRNAs. Younger platelets contain larger amounts of 
mRNA with a greater capacity for protein synthesis, as determined 
by using fluorescent nucleic acid tlyes such as thiazole orangc. ,u 
This assay has been used as a quantitative determinant of younger 
or "'reticulated" platelets (RPs). Indeed increased reticulated plate- 
lets are typically found in patients with conditions associated with 
rapid platelet turnover such as )TP; typically RP percentages in 
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such patients approach J0% to 20% of all platelets, considerably 
higher than in healthy control subjects." Interestingly^ high KPs 
have been associated with enhanced thrombotic risk when identi- 
fied in patients with thrombocytosis, 10 suggesting that quantita- 
tively increased mRNA levels may be associated with the prothrom- 
botic pbenorype. Whether this is related to globally altered gene 
expression profiles or to select changes more evident during 
situations of rapid platelet turnover remains unknown. Certainly, 
technical limitations of this assay limit its utility in defining 
prothrombotic genotypes, 1 °" n and it cannot identify differentially 
expressed genes that may be causally implicated in disordered 
platelet pheno types. 

Toward the goal of defining - the molecular anatomy of the 
platelet genome, we have adapted complementary techniques of 
microarray and serial analysis of gene expression (SAGE) for 
genetic profiling of highly purified human blood platelets. Microar- 
ray technology represents a "closed" profiling strategy Jiroited by 
the target genes imprinted onto gene chips. In contrast, SAGE is an 
"open" architectural system that can be used to identify novel genes 
and to quantify differentially expressed mRNAs. rJ ' r5 The sequence 
of each tag along with its positional location uniquely identifies the 
gene from which it is derived, and differentially expressed genes 
can be identified in a quantitative manner because the tag frequency 
reflects the mRNA level at the time of cellular harvest and analysis. 
By using both technologies, we have identified a number of 
previously uncharacterized genes that appear to be expressed in 
human platelets, while simultaneously establishing the dominant 
frequency of mitochondrial- expressed genomes comprising the 
platelet mRNA pool. These observations provide a panoramic 
overview of the plnielel transcriptome, while additionally provid- 
ing insights into the molecular pathways regulating platelet (and/or 
megakaryocyte) function in norma) and pathologic conditions. 



Materials and methods 

R&aoents and supplies 

Thermus atfuaiitus (Tnq) polymerase was purchased from (Roche, Indianap- 
olis. IN). T'l DNA lipase was purchased fionVlnvjirogen (Carlsbad, CA), 
and resiricrion enzymes were from New England Biobbs (Beverly. MA), 
except for <VtW). which was obtained from the Center for Technology 
Transfer (Gdansk, Poland). Alt oligonucleotides were synthesi2ed on an 
Applied Diosysiems (Foster City, CA)3-channel synthesizer and are listed 
in Table I. Monoclonal antibodies' used for flow cytometric analysis 
included ihe F1TO (fluorescein isorhiocyanate^onjugaied anti-CD*! I 
(ojioPj) immunoglobulin GJ (IgG); Jmmunotech, Miami. FL); phyco- 
eryihrin (PE> conjugated amif lycophorin (lgG2; Bceion Dickinson Pharm- 
ingen. San Diego. CA): and pcrrJimn chlorophyll protein (PERCP)- 
conjmratcd anti-CDJ <■ (IgG I : Breton Dickinson Pharrningcn). 

Platelet isolation, purification, and immunodetection 

All human subjects provijed informed .ronseni for an IRB (Institutional 
Review 13 o:i id)- approved pi o loco I completed in conjunction with the 
General Clinical Research Cenici ai Stony Brook University Hospital. 
Peripheral blood (?0 mil from healthy volunteers drawn into 2 mL of 4% 
sodium riti ate t0.'i% vol- vol final concentration) was used to isolate 
erythrocyte* by difTcrenual crnlrihrjjatron (1S0O&) or to isnhre pure 
leukocytes by ilnt.wh -"radii ni ccnirifugalion as previously tlesci tbed. "' 
Platelets collected (i..oi healthy volunteers by n phoresis were used wiihin 
24 hours of collet t»on AMn addition of 2 mM ED7A (elhylcocdiammeict- 
raaceiic acid), aphcicsisdr- rived platelets (torn a sinele donor were 
centriliiged at Mfcy for IS minutes at ?5°C To minimize leukocyte 
contamination, only the uppei 9.' 10 of the pbrclel-rieh plasma |PRP) was 



used for gel filtration over a BioGel A50M column (1000 mL total volume) 
equilibrated with HBMT (HEPES-buffered modified Tyrodes buffer 10 
mM HEPES (^2-hydtoxyethVlpir«rozine-A''T2-elbonesulfonic acid) pH 
7.4/150 mM NaCI, 2.5 mM KCI. 0.3 mM NaH 2 P0 4 , 12 mM NaHCOj, 
0.2% bovine serum albumen [BSAJ, 0.1% glucose, 2 mM EDTA). 
Gel-filtered platelets (GFPs) were subsequently filtered through a 5-pm 
nonwetting nylon filament filler (BioDesign, Carmel, NY) at 25°C and 
harvested by centrifugation at J500g for 10 minutes at 25°C. Platelets were 
gentry and thoroughly resuspended in JO mL HBMT buffer and incubated 
with 120 >L murine monoclonal anti-CD45 antibody conjugated to 
magnetic microbeads (Miltenyi Biotec, Bergisch Gladbach, Germany) on a 
rotating platform, for 45 minutes at 25°C. Magnetic separation columns 
were used to capture CD45* cells (leukocyte fraction) by positive selection 
(MACS 11; Miltenyi Biolee). Purified platelets were concentrated by 
centrifugation at 1 500g and immediately used for total RNA isolation. 

The efficiency of platelet purification was documented at each step by 
flow cytometry." Briefly, aJiquols containing 2 X 10* platelets were 
incubated with saturating concentrations of FITC- conjugated arifvCD4I, 
PE-conjugated antiglycophorin, and PERCP-conjugatcd anti-CD45 for 15 
minutes in the dark at 25°C, washedwith phosphate-buffered saline (PBS), 
and fixed in PBS71% formalin. Samples were analyzed using a FACScan 
(fluorescence- activated cell sorter scan) flow cytometer (Becton Dickinson) 
using CELLQuesl software designed to quantify the number of CD45 + and 
glycophorin-positive events in the sample (expressed as the number of 
events per J 00 000 CD4J + events). For some experiments, fixed platelets 
werepermeabilized with 0.1%Triton-X/PBS for 30 minutes at 25"C prior 
to the addition of primary antibodies, all as previously described 17 

Platelet protein detection was completed by sodium dodecyl sulfate 
(SDS)-polyacrylamide gel electrophoresis (PAGE) and immunoblol analy- 
sis as previously described, using the species-specific horseradish pcroxidase- 
conjugated secondary antibody and enhanced chemi luminesce nee.'* Anti- 
bodies included the anticluslerin monoclonal antibody (Quidel, Santa Clara, 
CA; 1:1000 primary and I:)0 000 secondary) and the nnlincurogranin 
rabbit polyclonal antibody (Chernicon International, Temecula, CA; J: 1000 
primary and 1:10 000 secondary). 

Molecular analyses and microarray profiling 

Purified, individual cell fractions were resuspended in 1 0 mL Trizol reagent 
(Invitrogcn), transferred into diethylpyrocarbonate (DEPCHreated Corcx 
(Springfield, MA) tubes, and serially purified and precipitated by using 
isopropanol essentially as previously described. ".Total cellular RNA was 
harvested by centrifugation at J2 500g for 20 minutes at 4°C, washed 2 
times with 75% ethanol (JO mLitube), and resuspended in 100 pX 
DEPC- treated water. Platelet mRNA quantitation was performed by using 
fluorescence- based real-time PCR (polymerase chain reaction) technology 
(7aqMari Real-Time PCR; Applied Biosystems, Foster City. CA). Oligonu- 
cleotide primer pairs were generated by using Primer3 software (www- 
genome. wi.mil. edn), designed to generate approximately 200-base pair 
I bp) PCR products at (he same annealing temperature, and are outlined in 
Table I. Purified platelet mRNA (4 p.g) was used for first-strand cDNA 
synthesis using oligo(dT)and Superscript II reverse transcriptase (Invitro- 
gcn). For real-time reverse transcription )TO>PCR analysis, the RT 
reaction was equally divided among primer pairs and used in a 40- cycle 
PCR reaction for each target gene by using ihc following cycle: 9-1 °C for 30 
seconds, 55°C for 30 seconds, 72°C for I minute, and 7TC for io seconds 
(40 cycles total). mRNA levels were quantified by monitoring real-time 
fluoromciric intensity of SYBR gieen 1. Relative -mKNA abundance was 
determined from triplicate assays performed in parallel for each primer pair 
and was calculated by using the comparative rhrcshold cycle number (A-O 
method) as previously described. ?< 

Gem- expression profiles were completed by usrnr the approximately 
l?600-probe set HG-U95Av2 gene chip (A flymen i-V. Santj Clara r CA). 
Total cellular RNA (5 pg) was used 7or cON'A synthesis by using 
Superscript Choice system (Life Technologies. Rockv"t{le ; MD) and an 
olieojdl) primer containing the 77- polymerase recognition sequence 
I Primer Si; Tabic I), followed by cDNA purification using GF,\ spin 
columns. In vilio tianscription was completed in ihe presence of bioiinyl- 
ated ribonucleotides by using a BioArray High Yield RNA Transcript 
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Table 1. Oligonucleotide primers 



Gene and primer 
* direction 



SI 


Oiigo(dT) 


Cassette A 


SAGE 


Cassette B 


. SAGE 


S2 


SAGE . 


S3. 


SAGE 


GP4 


Glycopr olein IIB |FJ 


GP5 


Glycoprotein UB |R| 


GP6 


Glycoprotein MA IF) 


GP7 


Grycoprotein IIIA [RJ 


PAR18 


PAR1(F) 


PAR 19 


PARI [RJ 


SR8 


16S rRNA [F) 


SR9 


16SrRNA|R) 


NADH10 


NADH2JFJ 


NADH11 


NADH2 IR| 


THYM12 


Thymosin 04 f FJ 


1HYM13 


Thymosin B4 (RJ 


CLUS14 


ChJStefin|F) 


CLUS15 


Cluster in [RJ . 


NRG 16 


Neuroorantn |FJ 


NRG 17 


Neurogranin (RJ 




T- cell receptor 


7CR18 


p-charn [FJ 




T-cell receptor 


7CRJ9 


B-chain |RJ 


CD4520 


CD45 |FJ 


CD4521 


CD45 fR) 



Sequence (5' - 3') 



Nucleotide 
Position 



5'- im-GGCCAGTGAATTtrrAATACGACTCACTATTvGGGAGGCGG- (dTi J( 
y ■. 

S TTTGGATTTGCTGGTCGAGTACAACTACGCTTAATCCGACftTG- 3 ' 
3*- •CCTAflAO»COU;CTCATGTTGATCCGAATAAGGCTp-S' 
5 ' - pTTCATGGCGGAGACGTCCG C(^CTAGTGTCGCAA CTGACTA • - 3 ' 
3 ' - MN AA GT AC CGCCTCTG CAGG CGGTGATC AC AGCGTT G A CTG A T - 5' 



' - Bn - GGATTTGCTGGTCGAGTACA - 3 ' 
'-Bn- TAGTC AGGTCCGACACTAGTGGC - 3 ' 
' - AGGGCTTTGAGAGA CT CATCTGTA - 3 ' 
' - ACAATCTTGCTCTTTCGATTCTG- 3 ' 
' - TAT7VAAGAGGCCACGTCTACCTTC' 3 ' 

- C^CTTCCACATACTGACATTCTCC- 3 ' 

- AATGTC AGTTCTGATATGGAAG CA - 3' 

- CCCAAATGTTCAAA C 1 ICrTTAGC- 3 ' 

- TGCAAAGGTAGCATAATCACTTGT- 3 ' . 

- GTTTAGGA CCTGTGGG1 TTGTT AG - 3 ' 

- CTAGCCCCCATCTCAAATCATATAC- 3 ' 
AATC^TTATGTTAGGGTTGTACGG- 3 ' 
AAGACAGAGACGCAAGAGAAAAAT- 3 ' 
G CAGCA CAGTCATTTAAACTTG AT - 3' 
CCAACAGAATTCATACGAGAAGG- 3' 
CCTTATATTTCCTCCTCAACCTCT- 3 ' 
GCCCTTTTAGTTAGTTCTGCAGTC - 3 * 
TTTTCTTTAACTGAGTGTGCTTGG - 3 " 



S ' - CCACAA CTATGTTTTGGTATCGT- 3 ' 

5 ' - CTAGCACTGCAGATGTAGAAGCT- 3* 
5 ' - GCTCAGAATGGACAAGTA - 3 ' 
5 ' - CACACC CA TACACA CATACA - 3 ' 



2094^2117 
.2301-2279 
2335,2358 
2532-2509 

2585- 2608 
2776-2753 

2586- 2609 
2785-2762 
4B75-4B98 
5075-5052 

135-158 
336-313 • 
1006-1028 
1222-1199 
1351-1374 
1567-1544 



332-310 
3771-3788 
4280-4261 



|F| indicates forward (sense) sliand; |RJ. reverse (antisense) strand; Bn. blotin; p. a phosphorylaled 5' end (cassettes A and 8); 
arrews. corresponding sequence for S2 and S3 within cassettes A and B. respectively; bold, the Wmel site: and N. A. C. T. oi G, 
accession numbers: glycoprotein )IB (J02764). grycoprotein IDA (M3S999). PARI (M62424). 16S rRNA and NADH2 (NC 001807) 
neurogr3nin(X99076).TCRp-chain(AF043182).CD45{Y00638). ~ 

"Indicates an amino- modified 3' end in both cassettes; — . not applicable. 



underlining. Nlbitt sties in cassettes A and B; 
nucleotide position based on the following 
thymosin B4 (Ml 7 733). cluster in (M25915). 



Labeling Kit (Enzo Diagnostics, Farmingdale, NY), and. after meial- 
inchical fragmentation, )5 u.g biormylaied eRNA was hybridized to the 
HG-U95Av2 oligonucleotide probe array for 16 hours at 45°C. After 
washing, the cRjNA was delected with streptavidi>i-phycc*7ylhrin {Molecu- 
lar Probes, Eugene, OR) and analysis was completed by using a Hewlett- 
Packard Gene Array Scanner (Affyrocirix). The average di (Terence value 
(AD) for each probe set was quantified using MAS 4.0) software 
I Affy metric), calculated as an average of fluorescence differences for 
perfecily matched versus singlc-nucleotide mismatched 25-mer oligonucle- 
otides (16 to 20 oligonucleotide pairs per probe set). The software is 
designed to exclude "positive calls" in the presence of high average 
differences with associated high mismatch intensities. 

SAGE profiles 

Plate lei SACF. libraries were general ed essentially as previously de- 
scribed. 1 ' modified as otH lined in Figuie I for the use of Mme\ as the 
trigging enzyme." This type US restriction enzyme cleaves 20 of 18 hp past 
its imnpalindtomic (TCCRAC) recognition sequence, thereby generahng 
longer tags (71- or 22-mcr) than those obtained using B*mf\ as the standard 
t:>ggin- en/ytnc (13-14 bp tags), These longer ,t/me l-genciatcd tags 
potentially provide for more definitive "tag-io-gene" identification and are 
particularly useful in characterizing expiession pnums in the absence of 
complete genomic sequence data (comprehensive me i hods detailed in Dunn 
ct 3l l? ). Briefly. poly(A) rnRNA was isolated from 10 ng fool platelet RNA 
using (he ofpgo-tdT) Sf primer conjirgaied to magnetic beads (DynaJ 
Bioicch. Lake Success. NY), followed by cDNA synthesis using Supei- 
Script IJ K-vetic transcriptase Mnviuogen). The eDNA was ihrn digested- 



with the restriction enzyme Nla 111 (anchoring enzyme), ligatcd to cassette A 
using T4 DNA ligase, and. a/ler the beads were extensively washed, the 
cDNA was digested with Mmc\ to release the tags from the beads. After 
purification, tags were ligated to degenerate cassette B linkers (specifically 




f'tQui* 1. Schema outlining the modified SAGF piolocol used in plaletfl 
analyses. Ihe hnat lays ate flanked by the AtoMI (rtneboting enzyme) CA1G 
sequence, thereby providing tao-io gene identification when exported to a relational 
database (refer to "Biointormaiic analyses' and lat-le l loi details). 
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designed to anneal to the nonuniform Mmel overhangs), and PCR-amplified 
using biolinylated primers S2 and S3 for 30 cycles (95°C for 30 seconds; 
58°C for 30 seconds; 72X for 30 seconds) using Platinum Taq DNA 
polymerase (Gibco BRJL). A fraction (20%) of the pooled PCR products 
were then subjected to one round of linear amplification using primer pair 
S2/S3, followed by a second round of 25 amplifications using primer S2 
alone (95°C for 30 seconds.. 5Jt°C for 30 seconds, 72°C for 30 seconds). 
Primer S3 was subsequently added for one cycle (95*C for 2.5 minutes, 
58°C for 30 seconds, 72°C for 5 minutes): the latter steps were collectively 
adapted to exclude bcleroduplex formation.'" Unincorporated primers were 
removed by incubation with 200 U Escherichia coli exonuclease I for 60 
minutes at 37°C. PCR products were then pooled ami digested with AVoIU 
to release tags, and biotinylated linker arms were cleared using streptavidin- 
coaled immunoaffiniiy magnetic beads (Dynal Biotech). Tags were coo- 
. catamerized using 5 U/pX T4 DNA Jigase, and products more than 100 bp 
were isolated by size- fractionation in low-melting agarose gels. The DNA 
was purified by GFX spin columns, and the concatamers were cloned 'into 
the ^pAI site of pZero (Invitrogen). A tier transformation into E coli TOP) 0 
cells, recombinant clones were isolated and sequenced in 96-weII microliter 
plates using, an ABJ 377 sequencer and ABI Prism BrgDye terminator 
chemistry (Perkin-Elmer Applied Biosystems. Branchburg, NT). 

Bioinformatic analyses 

Functional grouping of genes determined to be present by AfTymetrix MAS 
4.01 software was performed using a dChip program linked to the National 
Center for Biotechnology LocusLink, which is an annotated reference 
database for genes and their postulated functions. 70 Of the approximately 
12 600-probe sets represented on the AfTymetrix HG-U95Av2 Gene chip/ 
functional annotations exist for approximately 8100 with the remainder 
categorized as unknown. Microarray data were visuafoed an d analyzed 
using BRB-ArrayTools software (Version 2.1), kindly developed and 
provided by Dr Richard Simon and Amy Peng (limis.nci.nih.gov/BRB- 
ArTayTools.ht)nI)- A logarithmic (base 2) transformation was applied to the 
. average difference values for individual data sets for determination" of 
microarray concords ncies. Discordancy was defined as a 2-Iog difference in 
the maximum log intensities between individual experiments. 

SAGE tags were extracted by using in-house.SAGE software uniquely 
modified to identify M mK \ lags. The software ensures that only unambigu- 
ous 21- to 22-bp tag sequences are extracted for transcript profiling. Tags 
with ambiguities (Ns), lengths other than 21 or 22 bp p or with ambiguous 
orientations were extracted to separate files for manual editing or further 
examination. Finalized data were exported to a regional database for lag 
quantification and genetic identification/ 0 




PRP 



Figure 2. Determination of platelet purity. (A) Total cellubr RNA (1.6 ^g) from 
platelet-rich plasma (PRP) or purified platelets from a single apheresis donor wef e 
anaryzed by RT-PCR (35 cycles) using ofigonucteo&te primers specific for glycopro- 
tein )tb (GPlIb). T-ceB receptor p-chain (T CRp), or CD45; 10 uL of the 50 reactions 
were analyzed by ethioWstained agarose get eleclrophoiesfe. Minimal to no 7 CRp 
gene product was visual evident only En PRP. Size markers corresponding to 
Haelll-restricled *X174 DNA are showa (B) ReaJ-lime RT-PCR was completed by 
using 1.8 M9 total RNA and TCRp-specific oKoonocteolkte primers optimized tor 
quantitative analysis by reat-timo PCR," On the basis of parcel determinations 
using RNA isolated from Known amounts of purified leukocyte standards, the 
leukocyte-depletion protocol represents an approximate 2.5-tog purification from the 
starting PRP. Results are representative of on© complete set of experiments repeated 
on 2 separate occasions, and data points represent tho mean from triplicate wefts 
with standard errors of the mean fSEM) less than 1% [not shown) 



2). The platelet recovery was nearly 65% of the starting material, 
yielding approximately 2.3 X J0» platelets from an initial aphere- 
sis pack conlaining approximately 3.6 X 10" platelets. The bulk of 
the losses occurred during the initial cenlri rogation and filtration 
steps. The purification protocol was less effective at removing 
eryihrocy tes, although there were less than 50 glycophorin- positive 
cells per 1 X 1 0 5 pi ate lets aflcr the final purification step. Nonethe- 
less, these cells represent unlikely sources for contaminating 
cellular RNA (see "Cellular microarray analysis" below). 



Cellular microarray analysis 



Results 

Platelet purification 

To ensure that the RNA profiles accurately represented those of 
circulating blood platelets, a number of complementary methods 
were implemented lo remove contaminating nucleated leukocytes. 
Purification methods incorporator- gel filtration, a 5-,im leukocyte 
reduction filler, and magnetic OX* nmm.nodeplelion allowed "for 
the cumulative enrichment of highly purified platelets. The efficacy 
' of this purification method was initially established by using 
peripheral blood platelet-rich plasma as the starting material. The 
final product contained no more than 3 to 5 leukocytes per I X I0 5 
platelets as detci mined In p: n,tfel flow cytometric analysis, 
representing an approximate -iMJ-fohl reduction of nucleated 
leukocytes. These results eonelaied well with molecular evidence 
lor leukocyte depletion as determined by RT-PCR using both CD45 
nnd T-cell receptor fi-eham flCRf}) primers (sec Figure 2). 
Because the iol.nl RNA yield hom peiipheial blood platelets was 
insufficient for mieroanay studies, wc adapted the protocol to 
platelet apheresis doners wi,h nearly identical final purity (Figure 



The purified platelet RNA was sufficient for microarray studies and 
was used for cRNA generation and hybridization to the AfTymetrix 
HG-U95Av2 OeneChip. The anatomic profile of plalelet RNAs 
from 3 healthy male donors was determined by using AfTymetrix 
software. Of the 12 599 probe sets imprinted onto the chi p> a 
maximum of 2147 (17%) transcripts were computationally identi- 
fied as "present" by the AfTymetrix software, 152 (1.2%) were 
equivocal, and nearly 82% were absent. As a fraction of the total 
genes present on the chip, the percentage of platelet- ex pressed 
genes (J5%-I7%) was generally lower than that obtained from 
other hrtman cell types in which 30% to 50% of genes are present as 
determined by AfTymetrix software (J. Schwedes, personal commu- 
nication, May 2002). The "limited number" of platelet-expressed 
transcripts presumably reflects the lack of ongoing gene transcrip- 
tion in the anuclcate platelet. Because less than 1% of circulating 
red blood cells contain residual RNA, it is unlikely that any of these 
transcripts are ery throcyte derived, although this was formally 
addressed by isolating total cellular RNA from 20 mL of .whole 
blood (corresponding to an ~3-log fold excess of erythrocytes than 
that identified in our final sample). The total ceiluJar yield of RNA 
from this starting material was approximately 250 ng ; suggesting 
that less than I m>. eryi hi cxiyie- derived RNA was present in the 
purified platelet preparations. Despite this, however, both o- and 
(3-globin transcripts— alone with both the ferritin heavy and light 
chains— w C r«* identified as abundant transcripts (Table 2). Al- 
though the most parsimonious explanation would be ircioW contami- 
nating reticulocytes, ibis is not supported by our erythrocyte com ami- 
notion estimates, and their significance remains unresolved. 

As a means of better dissecting the molecular anatomy of tfic 
platelet, expressed genes were grouped on the basis of assigned 
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Table 2. Top 50 human platelet-expressed genes ' 



Accession no. 


Gene symbol . 


AD values, range* 


Gone transcript} 


Leukocyte expression} 


M 17733 


TMSB4X 


140142-307 852 


Thymosin 64 mRNA,. complete cds 


+ 


X99076 


NRGN 


101510-148279 


Neurogranin gene 


+ 


M25079 


HBB 


40 839-229 556 


B-gbbio mRNA. complete cds 


4 


M25915 


CLU 


84 720-140 246 


Complement cytolysis inhibitor fclusterin) complete cds 




J04755 


FTHP1 


82 980-148 621 


Ferritin H processed pseudogene, complete cds 




D78361 


OA21 


73 098-118140 


mRNA (or ornithine decarboxylase anlizyme 




X04409 


GNAS 


77 761-94 78t 


mRNA Tor coupling prole** G(s) o-subunit (alpha-Si) 




, M25897 


PF4 


6281M26 908 


Plalelet factor 4 mRNA, complete cds 




AB0212B8 


B2M 


• 61689-108 921 


P 2- microglobulin 


• + - 


X00351 


ACTB 


25 143-73 775 


mRNA lor p-aclin 




D2I261 


TAGLN2 


76 687-101 931 


in*\r*r\ r\i/v\u gene 


■f 


ALO31670 


FTLL1 




Ferritin, light polypeplide 1 


■ ' + 


■ U59632 


GPHB 


At a r\A- 1 1 n Qon 
** i huh- i iu j£ty 


Platelet glycoprotein IbB chain mRNA 


— 


M21121 


CCL5 


47 308-106 399 


T- cell- specific protein (RANTES) mRNA, complete cds 




• X13710 


GPX1 


41 31B-96 878 


IMspficed mRNA lor glutathione peroxidase 




J00153 


HBA1 




Alpha gtobtn gene cluster on chromosome 1 6 


+ 


M22919 


MYL6 


AC I^T '-IfH? ml 

4 b 33/ - iUd 833 


Nonrmisdesternoolh muscle alkali myosin light chain gene 




L20941 


FTH1 


52 787-74 763 


Ferritin heavy chain mRNA. complete cds 




J 03 040 


SPARC 


51 156-74 261 


SPARCVosteonectfn mRNA, complete cds 


- 


- Vienna 


GNAS 


45 543-72 096 


GSA mRNA lor a subunit ol GsGTP binding protein ' 




X58536 


H 


31 1B3-82 613 


mRNA tor major HLA class 1 locus C heavy chain 


+ 


M54995 


PPBP 


46 571-67 169 - 


Connective tissue activation peptide lit mRNA 


- 


U34995 


GAPD 


35 095-70 250 


Normal fceraiinocyte subslr action library mRNA clone H22a 




L40399 


MLM3 


32 107-73 364 


Clone zapl 12 (mutL protein homolog 3) mRNA 






NCOA4 


■ 31 452-61 036 


cDNA tor RFG (RETprolo-oncogene RET/P7C3) 




U90551 


H2AFL 


■ 35 086-51 892 


Hislone 2A-fike protein (H2A/I) mRNA 




Ml 1353 


H3F3A 


31 614-55 813 


H3.3 hislone class C mRNA 




212962 


RPL41 


36 003-54 853 


reRNA lor homologue to yeast rfbosoma! protein L41 




X 06956 


7UBA1 


20.988-61 798 


HALPJ-IA 44 gene lor o-tuboRn 




AB02895O 




24 571-58 611 


mRNA for KIAA 1027 protein 


- 


Y1271 1 


PGRMCl 


33 680-43 1 74 


niRNA lor putative progesterone binding protein 




MI6279 




30 894-48 166 


Integrated membrane protein (MIC2) mRNA 




D76577 


YWMAH 


. .24 785-50 437 


Drain 14-3-3 protein p-chain 




. AF 070585 


70P3B 


20 027-67 945 


Clone 24675, unknown cDNA 




AA5?4802 


Unknown 


23 846-39 461 


CDNA. IMAGE clone 954213 




AD009010 


UBC 


28 745-38 389 


mRNA lor polyubiquitin UbC 


-t 


X57985 


H2AFO 


21 678-52 108 


Genes lor hfetones H2B.1 and H2A 




X54304 


MIXB 


25 733-34 109 


mRNA lor myosin legutatory fight chain 




M 14539 


F13A1 


23 691-48 474 


Factor XH1 subunit n polypeptide mRNA. 3' end 




A1540958 


Unknown 


24 872-11 110 


cDNA, PEC 1.?_ !5_HOI.r 5' end/cton 




AL05039& 


FLNA 


13 634-55 235 


cDNA DKF2p 586K1720 




X56841 


HLA-E 


12 890-49 327 


Nonciassical MHC class 1 antigen gene 




M2625? 


PKM? 


15450-47 786 


TCB (cytosolic thyroid hormone- binding protein) 




M 14630 


PIMA 


19 314^15 088- 


Piothymosin alphamRNA 




AF045??9 


RGStO 


19156-34 243 


Regulator ol G piotcin signaling 10 mRNA 




AA477898 


Unknown 


16863-44 756 


cDNA. 2S34108.rl5' end 




X95404 


FL1 


15 216-37 456 


mRNA foi nonmuscle type cofilin 




M34480 


n GA28 


8 627-45 4Q5 


Platelet glycoprotein lib (GPHb) mRNA 




283738 




18 001-31 306 


HH2B/e gene 




1 19779 


H2AFO 


17 319-38 951 


Histone H2A.2 mRNA. complete cds 





«e {AD} value (marched versus mismatched otigonucleolrdes) lor each piobe set using Attymelrn 

micioannys is shown: the normalization vntue to 3ll micoanay analyses was 250. 

software by log intensities ol AD values obtained Iron 3 different healthy donors; 33 ot the top 40 



wt- i^iiummiii ifurt.iiiiH^jDLHib were carcuiaieu as me average drllerei 
Ge'reChip soitwate. version 4.01. the range ol values from 3 distinct piaiele 

pranscripts are r;+nk- ordered (highest to lowest) using BRB-Anayloois 
tronscripls wert* listed among Ihe top 50 in all 3 mic/oai ray sets, ' 

H euKocyte e> pressiop was uetennined by microarray analysis using pu.if.ed peripheral blood leukocytes, followed by construction ol rank- intensity plots lor comparison to 
platelet .op SO ftansenpis "lop leukocyte- derived t,onscrip,s identified will*, the ranked top Mphtfelei transcripts are depicted by a | * ) present or (- ) absent 

cds mdic^tes cod»ng seguencc. 



gene nnnott lions, and this analysis was used to provide 3 pan 
oiainic dehniiion ol" ihe platelet transcriptome.*Ot" the gems ib.it 
could be cataloged within resigned "cliiMers." those involved in 
metabolism (iJ%) .nnd teccptor/signalint; (1J%) icprcscnicd the 
hrprsi groups. Also evident in these analyses is ibe icbltvelv buije 
peiccotDpe ol u*enrs involved in funciions unrelated to these key 
groups lie. miscellaneous, 25%).. and the oven e present a lion of 
eenes with unknown function (3?%) as annotated bv /MTymeiiix 



atnl RcfSeq dalabases. ?l These results identify a vasi array (neajly 
one half) of" plaielct genes (and gene producis) that presumably 
have important but pootly charade; i7ed funciions. in plalelel 
and-'ot megakaryocyte biology. 

Although micioanay analysis is nol imly (juantiiaiive. tank- 
oidering using the mean log-intensilies bom 3 independent microar- 
ray analyses allowed fot the categorization of ibe lop platelei 
nanscripis (Table 2). Computational analyses demonstrated thai 
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only J 0 of the top 1 00 genes weie discordant among the 3 pbteiet 
microarrays, although 71 of J 00 genes were discordant between 
platelet and leukocyte arrays. An inventory of the top 50 platelet 
genes is listed in Table 2, which also delineates those found to be 
highly expressed in peripheral blood leukocytes by parallel microar- 
ray experiments with this purified cellular fraction (data not 
shown). Further analysis of these cell subsets demonstrated thai 
approximately 25% (n = 547) of the total platelet transcripts were 
platelet restricted. Furthermore, only 10 of the 50 most highly 
expressed genes were found to overlap,, confirming the distinct 
cellular profiles of each transcriptome. Of the 12 overlap genes, 3 
corresponded to globin or ferritin chains (again suggesting the 
presence of contaminating reticulocytes in both purified fractions), 
and another 4 were involved in aclin cytoskeletal reorganization 
and human leukocyte antigen (HLA) expression, gene products that 
regulate critical functions in both cell types. Given the importance 
of cytoskeletal reorganization in downstream platelet activation 
events, it is not unexpected that components of the actin machinery 
system would demonstrate prominent transcript expression. Previ- 
ous estimates suggest that 20% to 30% of the total platelet 
pTOleome is comprised of actin with other components such as 
actin- binding protein, mysosin, and talin accounting for an addi- 
tional 2% to 5% of the total protein.'- 22 The mRNAs encoding the 
actin-related machinery are overrepresented in our mieroarray 
analysis, with 8 such transcripts found among the 50 highest 
platelet-expressed genes. Interestingly thymosin 04 demonstrated 
the highest expression pattern. In unstimulated platelets, 30% to 
40% of actin is polymerized as F- actin, 22 whereas the balance of 
actin monomers (G-actin) are polymerization inhibited by seques- 
tering proteins such as profilin (100 pM) and thymosin 04 (600 
u-M). 7 - 1 The high thymosin 04 transcript expression not only 
correlates with its known abundance in platelets but also supports 
the importance of actin inhibitory proteins in maintaining the 
nonstimuJated state of circulating platelets. 
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Figure 3. Schema of the mitochondrial genome with SAGE lag distributions 
(only lags with identical matches are displayed). The abundance of ihe SAGE 
tags [n = 1B00) at individual Mailt sites (arrows) within the mitottondrial fteavy 
strand is shown on Ihe bottom, whereas thoso tags conespdnding to tho mitochon- 
drial light strand are delineated above the arrows (the presence of an unaccompanied 
arrow implies no SAGE tags at that Mtolll site). The gene products ol mt-DNA 
(RelSeq accession no. NC_00t807) are cteRneated by the open rectangles, whereas 
stippled boxes represent tRNA genes and control regions (the single lag represented 
by, Ihe ("J refers to mitochondrial transfer RNA-serho). Note thai MADH6 is encoded 
by the tight strand and that there are no Nfelll sites within Ihe ATPaseB gene segment 
COfnJ. cytochrome c oxidase subunit; Cyt b, cytochrome b. 



Platelet SAGE analyses 

Although these initial studies identified the distribution and relative 
expression patterns of the genes within the A/fymetrix data set, 
they do not allow for analyses of genes that arc unrepresented by 
these oligonucleotide chips. Unlike closed microarray profiling 
strategies, SAGE is an open architectural system that is ideally 
suited for novel gene and pathway identification. Accordingly, the 
platelet RNA used for microarray studies was used for platelet 
SAGE. A total of 2033 lags were initially cataloged, of which J 800 
(89%) corresponded to mitochondrinl-derived genes. These results 
were quite different from those obtained by microruray analyses, 
but ihe discrepancy can be resolved by the nomepresenlnlion of the 
mitochondrial genome on the gene chip. The mitochondrial ge- 
nome is a compact approximately I6.6-kilobase (kb) sequence 
encoding 13 genes and 2 ribosomal subunits/* Primary mitochon- 
drial imnscripts are polycislronie and typically contain premature 
termination or unpredictable splice site*, resulting in multiple 
polvadenylated transcripts from individual genes. ^ Indeed, iht- 
overall distribution of platelet- derived mitochondrial SAGE tags is 
nunc .sjmiiru to ih;,t found in muscle. 7 - 1 All 13 genes containing 
AV.iJIJ sites were detected, whereas neither on he non-AV/jllf 
coniaining genes were identified (nicotinamide adenine dinucJco- 
hdc | NA 01 1 J dehydrogenase subunit 41 ;mU adenosine triphos- 
■ phalasc |ATPaseJ S). Most of ihe lags were from ihe I6S and 12S 
ribosomal RNAs— which collectively accounted for 68% of the 
total mitochondrial lags— with ihe fewest lags represented by 
NAOlt dehydrogenase subnniis 3. 5. o ; and cytochrome c oxidase I 



(Figure 3). The NADH dehydrogenase subunit 6 RNA is the only 
mRNA encoded by ihe light (L) strand of mitochondrial DNA and 
was Ihe least abundantly detected transcript. 

The . unusually high preponderance of mitochondrial- derived 
genes is not inconsistent with the known enrichment of these 
genomes in human platelets,'- 3 * and presumably reflects persistent 
transcription from Ihe mitochondrial (mt) genome in the absence of 
nuclear-derived transcripts. This overrepresentation of mtDNA in 
platelets is considerably greater than that of its closest cell type 
(skeletal muscle), in which mt genomes represent approximately 
20% to 25% of all SAGE tags. 25 Interestingly, the energy metabo- 
lism of platelets is not dissimilar from that of skeletal muscle, both 
cell types actively using glycolysis and large amounts of glycogen 
for ATP generation.^ Like muscle, platelets are metabolically 
adapted to rapidly expend large amounts of energy required for 
aggregation, granule release, and clot retraction. Similar to the 
situation in all eukaryotic cells, platelet mitochondria represent the 
primary source of ATP, which is generated from oxidative phosphor- 
ylation reactions occurring wilhin these organelles. Mitochondria 
are also responsible for most of the toxic reactive oxygen species 
generated as by-products of oxidative phosphorylation and are 
central regulators of Ihe apoptotic process in other cellular types. 
The mlDNA encodes polypeptides found within 4 of the 5 
multifunctionaJ complexes that regulate oxidative phosphorylation 
within the platelet mitochondria." Whether the continued genera- 
tion of these polypeptides has a role in platelet energy metabolism 
ami/or the. apoptotic mechanisms regulating platelet survival re- 
mains speculative, all hough not inconsistent with our observations. 

Comparative analysis of SAGE and microarray 
transcript abundance 

Complete SAGE libraries require the sequencing of up to 30 000 
tags for an exhaustive cataloging of individual mRNAs, especially 
those with limited copy numbers. 0 --' Grven the preponderance of 
mi-derived iianscripis. comparable sampling would have required 
sequence analysis -of nearly 300 000 SAGE lags, an inordinate 
number for comprehensive analysis of the pbtelei iranscriplome. 
f or platelets, alternative methodologies .incorporating subtractive 
SAGE .wil| be required for more comprehensive transcript profil- 
ing. Our initial sampling of mmmitochomlrinj genes remains 
informative, however, and entirely consistent with, the results of 
platelet microarray s indies. As shown in Table 3„ SAGE lags for the 
genes encoding thymosin p-1 ; |J2- microglobulin, neurogranin, and 
the platelet glycoprotein Ibft polypeptide were among the most 
frequently i den li lied platelet genes, similar lo the rank-ordered 
results deiet mined by mjcroanay analysis. To formally confirm the 
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Table 3. SAGE-idcntified nonmitochondrial tags 


Frequency 


CATG -t SAGE lags' 


Accession no.t 


Gene 


Microarray? 


26 


GTTG7 GGTTAA7 CT GGT 


NM_ 004048.1 


p2-microglobuftn (B2M), mRNA 


PPP 


21 


7TGG7GAAGGAAGAAGT 


NM_021 109.1 


Thymosin fj4 ; X chromosome (TMSB4X). mRNA 


P 


8 


AGCICCGCAGCCAGGTC 


NM.002620.1 


Platelet factor 4 variant 1 (PF4VI), mRNA - 


f> 


8 


AGCTCCGCAGCCGGGTT 


NM_002619.1 


Platotei tectw 4 (PF4), mRNA 


P 


7 


TGTA7AAAGACAACCTC 


NM_0O2704.1 


Proplatetet basic protein (pvthrombogJobuKn) 


Pp 


5 


GGGCACAATGCGGTCCA 


NM_OO0407.» 


GVcoprolein lbp polypeptide. mRNA 


P 


3 


AGGTAA7AAAAGGTAAT 


NM_0035t2.1 


H2A Wslone lamiry, member L {H2AFL). mRNA 


P 


3 


AG7 GGCAAG7 AAA7GGC 


NM_02t914.2 


Cofilin 2 (muscle) {CFL2). mRNA 


N/A . 


3 


7 GAC7G7GC7GGGT7 GG 


NM_006176.1 


Neurogranin (protein kinase C substrate, RC3) mRNA 


P 


3 


7TGGGG7T7CCTTTACC 


NM_002032.1 . 


. Ferritin, heavy polypeptide 1 (FTH1), mRNA 


P 


2 


CCCT7G7GAC7ACCTA7 


NM_02515B.1 


Hypothetical protein FU22251 (FLJ22251). mRNA 


N/A 


2 


CC7G7AACCCCAGCTAC 


NM_032779.T ' 


Hypothetical protein FU14397 (FLJ14397), mRNA 


N/A 


2 


CT7GTAG7 CCCAGCTAC 


NM.017962.1 


Hypothetical protein FLJ20825 (FU20825), mRNA 


. N/A 



Unique tags identified more than once. .' 

t Refers to the RetSeq accession no.*' Note J hot ihb number does not necessarily correspond to the accession no, provided by Alrymelrw'soRware annotations (Table 1). 

^Presence (P) or absence (A) is based on results from 3 distinct platelet microanay experiments. Capitalized *P" designates a gene that is in the top 50 on all 3 microarray 
experiments, whereas small *p" designates those banscripls not in the top 50. Two of the genes (p2-micfogk>buBn and fMhromboQlobti£n).are represented by 3 and 2 probe 
sets, respectively, Dn the HG-U95Av2 gene chip; lor 02-M. all 3 probe sets were in the top 50 genes, whereas lor thymosin (J4 1 of 2 was in the top 50 lor ad experiments (the 
other probe set was in rbe top 75 tor alt experiments). N/A indicates oligonucleotide not present onAHymetrix HG-U95Av2 gene chip. 



results independently obtained by SAGE and microanay analysis, 
quantitative RT-PCR was completed by using oligonucleotide 
primers specific for 2 abundant mitochondrial transcripts, )6S 
iRNA and NADH2 thymosin p4 (high- abundance by microanay 
and SAGE), 2 incompletely characterized high-abundance tran- 
scripts (neurogranin and clusterin; see "Protein immunoanarysis of 
platelet clusicrin and ncurocranin"), a low-abundant transcript 
(T-celJ receptor polypeptide), and the genes encoding proteins 
with well-established quantitative determinations (ie, glycoprotein 
a iinPs (~~50 000 receptors/platelet]; protease- activated receptor ) 
(PAR I) |-1800 receptors/platelet]).' As shown in Figure 4, these 
analyses reveal excellent concordance between SAGE and microar- 
ray studies, demonstrating the predominant frequency of the 
mitochondrinl-derived 16S rRNA/NADH2 transcripts, with incre- 
mentally lower expiession of other transcripts as initially demon- 
strated by microarray (16S > NADII2 > thymosin f}4 > neurogra- 
nin > clusterin > n lto fo > PAR 1 > TCRfJ)- 

Givcn the small number of nonmilochondrial SAGE lags 
available for analysis (n = 233), limited conclusions can be drawn 
using traditional (nonsubtraction) platelet SAGE libraries as prc- 




" Figure 4. Quantitative rc.tllimp R1-PCR analysis ot platelet transcripts. Real- 
lime RTPCR completed by us inn. puntreit platelet RNA and oligonucleotide 
piimet po«s specifically designed us»iy Pitmei3 software to generate simitady-sized 
(-?00-bp) PCR products, optimtrcd to the come annealing temperature. In graph. 
(□) represents ttb. (O) rr-pi eton is lib. (^presents PAR l, (X) represents 1&S 
»RNA. ) »epre*enls NAUM7. (▼) »e p»eienii thymosin, (O) represents clusterin, (♦} 
represents nemogtanitt. onO (•) pppitLcmj. lCR[i Curves are representative ol one 
complete set ol e>pe*' fT1 * | repeated iwir.e). and line plots reflect average 
deleiminalirmj hum 3 v.r!!j. t-wtnmifcf in pmyUel with SEM less than 1% l or all 
drita points. 



sented here. Overall, a total of 126 unique tags were identified, the 
majority of which (94) were represented only once. Of the tolaJ 
unique tags, nearly one half represented novel genes not present on 
the AfTymetrix U95Av2 GeneChip. Of the genes with unique lags 
identified more than once, there was excellent concordance with 
microarray expression analysis, with nearly all of the SAGE tags in 
Table 3 corresponding to platelet top 75 microarray iranscripts. The 
pi ale let factor (PF) A variant represents a single aberration because 
this was rank-ordered approximately 350 by microarray, although, 
its SAGE tag frequency was identical to that of the predominant 
PF4 transcript. The lack of extensive nonmilochondrial SAGE 
sampling precludes any further extrapolations from this apparent 
aberration. Of note, a subset of these tags had long poly(A) tracts; 
although they all corresponded to genes identified in the RefScq 
database/' We cannot exclude the possibility of a SAGE artifact 
for this small subset of, tags (~2%, representing 46 of 2033 tags), 
although the authenticity of the vast majority of tags (—98%) 
clearly validates the methodology. These tags are most likely 
explained by the unique biology of the platelet (ie, mRNA decay in 
the absence of de novo transcription) or lo mRNA degradation 
occurring during the extensive purification methods. In summary., 
even with a remaikably limited sampling, the power of this 
approach in gene identification of relatively abundant and less 
abundant iranscripts is evident. It is clear, however, given the 
unique molecular anatomy of the platelet (ie. abundance of 
mitochondrial transcripts), (bat SAGE adaptations will be required 
for more comprehensive genetic profiling. 39 

Protein imm»irioan3lysis ol platelet cluslerin and newogranin 

Although most of the "most abundant" transcripts would conform 
to n priori predictions for platelet-expressed iiiRNAs, a number of 
transcripts were identified that had been poorly characterized in 
human platelets. To further establish the authenticity of highly 
expressed iranscripts .such as clusterin and neurogranin, confirma- 
tory protein analvses were completed. As shown in Figure 5, both 
proteins were clearly delected in purified platelet lysates: fun her* 
more, their cellular platelet distributions conformed to those 
predicted based on previously proposed functions. Noic for ex- 
ample that eluMerin — functionally characterized as a complement 
lysis inhibitor able to block the terminal complement icnction — is 
primarily expressed on the extracellular platelet niembiane. J " 
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Figure 5. tmmurwanarysis of ptetotet neurogranin and 
clusterin. (AO) GeMi&ered pteteleta wore e&hw feted in 
3,7% forrnatfehycte (nonpefrneabifeed) or fixed with perme- 
abtSzaSon in Iho presence of 0.1% TritorvX; toDCwed by Uow 
cytometric anafysis using ontkiustwin. an^fib/lIJa, or an6r>tn> 
rogranin antibodies and (b» FITC^onftigared species- 
specific secondary antibody (in ; C the F^^C^»niu9aU>d 
an&rabbtt and antimouse controls are'essonfofly aperim- 
posett). (E-F) Ten rmexograrns of sdub&ed MepG2 ccBs 
(nepatocyte ceB fine), human brain, a pujfied pbtotet tysates 
were anafyzed by SDS^AGE,™ end irnrmjncUot analysis 
were compteted by using 1:1000 dUi^oris of eitha anfineuro- 
granin (18% SDS-PAGE) cf anfidustenn (8% SDS-PAGE) 
anybodies. The arrlidusionn antibody recognized 2 platelet 
immunoreaclive species under, shorter exposure. Although 
the retetrve neurogranin and duslom protein abundances 
a/e suboptrnany quarOjfiod by those, a^bryses. pbtoto dus- 
terirt appears to demonstote conskferabte expression when 
compared with thai previously identified in hepatocytes.^ 



Given the importance of complement activation in platelet destruc- 
tion, the prominent expression of eel I -surface cmslerin might 
suggest a role , for this protein in normal and pathologic events 
regulating platelet survival. Interestingly, a clusterin-deficient 
knockout mouse has been generated that demonstrates enhanced 
cardiac dysfunction in a model of autoimmune myocarditis.*' 
Although these mice apparently have normal baseline hemograms 
(B. Aronov/, personal communication, October 2002), it remains 
unestablished if they would be predisposed to immune type 
thrombocytopenia in systemic models of autoimmunity. 

Similarly, the gene encoding an intracellular effector protein 
that may have key roles in downstream platelet activation events 
lias now been demonstrated to have abundant transcript expression 
in human. platelets. Neurogranin is a highly expressed platelet 
transcript with its gene product demonstrating a primarily intracel- 
lular pattern of distribution. Neurogranin is generally described as a 
brain- specific. Ca ?4 -sensitive calmodulin-binding phospboprotein 
that is preferentially expressed in neuronal cell bodies and den- 
drites "'^ It is a specific protein kinase C (PKC) substrate that can 
also be modified by nitric oxide and other oxidants to form 
intramolecular disulfide bonds. Both its phosphorylation and 
oxidation state attenuate its binding affinity for calmodulin. 33 In 
Stimulated platelets, PKC generation is linked to various activation 
pathways such as calcium- regulated kinases, mitogen- activated 
protein (MAP) kinases, and receptor tyrosine kinases.' Thus, these 
observations suggest that platelet neurogranin may function as a 
previously unidentified component of a PKC-dependcnt activation 
pathway coupled to one (or more) of these effector proteins. 



Discussion 

These data provide documentation for a unique platelet mRNA 
profile thai may provide a tool for analy?ing platelet molecular 
networks. Nonetheless, the molecular analysis of the platelet 
transeriptome may be confounded by rhe constant decay of 
mRNAs in the absence of new gene transcription, a si mat ion 'hat 
may. for example, limit the identification of Imv- abundance 
transcripts. Similarly, because the circulating platelet pool contains 



a mixed population of variably aged platelets, a "static" mRNA 
profile represents an average of this heterogeneous blood pool. 
Despite these potential limitations, the combination of genomic 
and proteomic technologies are lively to provide powerful tools for 
the global analysis of platelet function. Current strategies for 
cataloging "whole cellular proteomes" are generally accomplished 
by using 2 developing methodologies:./ 1 ) high resolution 2-dimen- 
sional polyacrylamide gel electrophoresis (2-DE) with mass spec- 
trometric sequence identification, 34 and (2) miciocapillary liquid 
chromatography with tandem mass spectrometry (pJLC-MC/ 
MC). 35 Further modifications of both procedures have been devised 
for direct comparative studies between 2 cellular proteomes. The 
introduction of 2-DE differential gel electrophoresis has now made 
it possible to detect and quantify differences between experimental 
sample pairs resolved on the same 2-dimensional gel. 36 Likewise, 
the application of isotope-coded affinity tags to pLC-MC/MC 
represent a novel means of quantitative analyses between cellular 
proteomes. 37 The success of both approaches relies on the availabil- 
ity of comprehensive genomic databases and mathematical algo- 
rithms for optimal protein identification. Indeed, mathematical 
modeling studies have demonstrated the need to delineate both 
protein and mRNA expression levels for optimal definition of 
intracellular networks 38 Our data present an initial framework for 
delineating platelet function by defining the molecular anatomy of 
human platelets, information that is likely to provide important 
clues into the dynamic protein interactions regulating normal and 
pathologic platelet functions. Furthermore, because the platelet 
iransciiptome minors the mRNAs derived from precursor 
megakaryocytes, these analyses may provide insights into the 
biochemical and molecular events regulating megakaryocytopoi- 
esis and/or proplatelet formation. 
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Overexpression of a DEAD box protein (DDXl) in neuroblastoma and 
retinoblastoma cell lines. 

Godbout R . Packer ML Bie W. 

Department of Oncology, Cross Cancer Institute and University of Alberta 1 1 560 
University Ave., Edmonton, Alberta T6G I Z2, Canada. 

The DEAD box gene, DDX1, is a putative RNA helicase that is co-amplified with 
MYCN in a subset of retinoblastoma (RB) and neuroblastoma (NB) rumors and cell lines. 
Although gene amplification usually involves hundreds to thousands of kilobase pairs of 
DNA, a number of studies suggest that co-amplified genes are only overexpressed if they 
' provide a selective advantage to the cells in which they are amplified. Here, we further 
characterize DDX1 by identifying its putative transcription and translation initiation sites. 
We analyze DDXl protein levels in MYCN/DDX1 -amplified NB and RJB cell lines using 
polyclonal antibodies specific to DDXI and show that there is a good correlation with 
DDXl gene copy number, DDXl transcript levels, and DDXl protein levels in all cell 
lines studied. DDXl protein is found in both the nucleus and cytoplasm of DDX1- 
amplified lines but is localized primarily to the nucleus of nonarnplified cells. Our results 
indicate that DDXl may be involved in either the formation or progression of a subset of 
NB and RB tumors and suggest that DDXl normally plays a role in the metabolism of 
RNAs located in the nucleus of the cell. 

PM1D: 9694872 (PubMed - indexed for MEDLINE] 
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rnstitute of Biophysics Carlos Chagas Filho, UFRJ, Brazil. 

eel L fo r S l t ' I u° W h,gh C( ? nflue «<* states, and upon treatment of the 
fPKcf Cel^ T T h aCt ' Vat0rS ° f Pr ° tein Unme A *A) protein kinase C 

ScS Yel oT r "T Were « readUy C0UP ' ed ' " determined b > ^ 
ra^iSr' ^ .Tf; ^ "fiance, the cells uncoupled. Western blots and RT- 
PCR revealed ttat connexm 43(Cx43) was abundantly expressed in TM3 Leydig ceUs 

Tr PKcTndTT r deCreaSed aftCT * e Cd,S aChieVed S^ulSon of PKA 

1^5 J deCrc " SC m CeH " Ce " ^"nication. Staurosporin, an inhibitor of 

IT leases, mcreased coupling and was able to prevent and reveree theun^unlL 
actions of d.butyryl cAMP and 12^^^ 

LcreasS it ThtS , 6 membrane ,abeli «g for Cx43, whereas staurosporin 

mcreased iL These data show a strong correlation between functional coupling and the 
membrane d.stnbut.on of Cx43, implying that this connexin has an ia^SSS in 

secretion n response to lutemizing hormone was accompanied by a decrease in 
PM1D: 12740021 [PubMed - indexed for MEDLINE] 
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Expression level of Ubc9 protein iu rat tissues. 
Golcbiowski F, Szulc A, Sakowicz M . Szutowtcz A . Pawelczvk T , 

Department of Molecular Medicine, Medical University of Gdansk 80-2 1 1 Gdansk 
Poland. ' «"oi>, 

Ubc9 is a homologue of the E2 ubiquitin conjugating enzyme and participates in the 
cova lent linking of SUMO-I molecule to the target protein. In this report we describe a 
simple and efficient method for obtaining pure human recombinant Ubc9 protein The 
punfied Ubc9 retained its native structure and was fully active in an in vitro sumoyiation 
assay with the promyelocytic leukaemia (PML) peptide as a substrate. In order to better 
understand the physiology of Ubc9 protein we examined its levels in several rat tissues 
lmmunoblot analyses perfoimed on tissue extracts revealed quantitative and qualitative 
differences .n the expression pattern of Ubc9: The Ubc9 protein was present at a high 
level in spleen and lung. Moderate level of Ubc9 was detected in kidney and liver Low 
amount of Ubc9 was observed in brain, whereas the 1 8 kDa band of Ubc9 was barely 
visible or absent in heart and skeletal muscle. In heart and muscle extracts the Ubc9 
antibodies recognized a 38 kDa protein band. This band was not visible in extracts of 
other rat tissues. A comparison of the relative levels of Ubc9 mRNA and protein 
indicated that the overall expression level of Ubc9 was the highest in spleen and lung In 
spleen, lung, kidney, brain, liver and heart there was a good correlation between the 1 8 
kDa protein and Ubc9 mRNA levels. In skeletal muscle the Ubc9 mRNA level was 
unproport .onally high comparing to the level of the 1 8 kDa protein. The presented data 
indicate that in the rat the expression of the Ubc9 protein appears to have some degree of 
tissue specificity. & 
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ABSTRACT 

Motivation: Protein abundance is related to mRNA 
expression through many different cellular processes. 
Up to now, there have been conflicting results on how 
correlated the levels of these two quantities are. Given that 
expression and abundance data are. significantly more 
complex and noisy than the underlying genomic sequence 
information, it is reasonable lo simplify and average them 
in terms of broad proleomic categories and features (e.g. 
Junctions or secondary structures), for understanding their 
relationship. Furthermore, it will be essential to integrate, 
within a common framework, the results of many varied 
experiments by different investigators. This will allow one 
to survey the characteristics ol highly expressed genes 
and proteins. 

Results: To this end, we outline a formalism lor merging 
and scaling many different gene expression and protein 
abundance data sets into a comprehensive reference 
set, and we develop an approach lor analyzing this in 
teims ol broad categories, such as composition, function, 
structure and localization. As the various experiments are 
not always done using the same set ol genes, sampling 
bias becomes a central issue, and our formalism is 
designed to explicitly show this and correct tor it. We apply 
our formalism to the cunenlly available gene expression 
and protein abundance data lor yeast. Overall, we found 
substantial agreement between gene expression and 
protein abundance, in terms ol the enrichment ol slructuial 
and functional categories. This agreement, which was 
considerably greater than the simple correlation between 
these quantities tor individual genes, reflects the way 
broad categories collect many individual measurements 
into simple, robust averages. In particular, we found 
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that in comparison to the population of genes in the 
yeast genome, the cellular populations of transcripts and 
proteins (weighted by their respeciive abundances, the 
transcriptome and what we dub the translators) wore both 
enriched in: (i) the small amino acids Val, Gly, and Ala; 
(ii) low molecular weight proteins; (iii) helices and sheets 
relative to coils; (iv) cytoplasmic proteins relative to nuclear 
ones; and (v) proteins involved in "protein synthesis,' 'cell 
structure/ and 'energy production.' 

Supplementary information: http://genecensus.org/ 

expression/translatome 

Contact: maik.gerslein@yale.edu 

INTRODUCTION 

High throughput experimentation, measuring mRNA 
(Schena et ai r 1995; Eisen and Brown, J999; Ferea 
and Brown, J 999; Lipsbut2 er al., J999) and protein 
expression (Anderson and Seilhamer, J 997; Futcher ei al., 
1999; Gygi er al, 1999a; Ross-Jvlacdonald et ol., J 999; 
Lopez, 2000; MacBeaih and Schreiber, '2000; Nelson 
et al. T 2000; Zbu er al t 2000) are currently the single 
richest source of genomic information: However, how to 
best interpret this data is still an open question (Bassetl 
et al., J996; Wines and Friedman, J 999; Zhang, J 999; 
Gerstein and Jansen, 2000: Searls, 2000; She; lock, 2000; 
Clavenc, 1999; Einarson and Golcmis, 2000; Epstein and 
Buiow, 2000; Shapiro and Harris, 2000). Understanding 
how protein abundance is related to mRNA uanscript 
levels is essential for interpreting gene expression/protein 
interactions, structures and functions in a cellular .sys- 
tem (Bnt'/imanikalis et ah, 1999). Moreover, as protein 
concentration is the more relevant variable with respect 
to enzyme activity, it connects genomics to the physical 
chemistry of the cell (Kidrl et oL, 2001). Protein abun- 
dance may also be invaluable for diagnostics and for 
determining drug ta/gers (Corthals et a!. 2000). 
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Previously, we surveyed ihe population of protein 
features — such as folds, amino acid composition, and 
functions — in yeast, and other recently sequenced 
genomes (Gerstein, 1997, J998a,b; Gerstein and Hegyi, 
1998; Hegyi and Gerstein, 1999; Das and Gerstein, 
2000; Un and Gerstein, 2000), and we extended this 
concept . to compare the population of features in the 
yeast transcriptome to that in the genome (Drawid et 
ai, 2000; Jansen and Gerstein, 2000). Others have also 
done related work (Frishman and Mewes, 1997; Tatusov 
et ai y 1997; Jones, J 998; Wallin and von Heijne, 1998; 
Frishman and Mewes, 1999; Wolf et ai, 1999). Here, we 
present a new methodology to compare the features of the 
. mRNA expression population with the protein abundance 
population. 

Precise terminology is essential for this comparison. 
Unfortunately, 'proteome* is used inconsistently. Pro- 
teome can logically be used to describe all the distinct 
proteins in the genome (Qi et ai, 1996; Cavalcoli et ai, 
1997; Fey et ai, 1997; Garrels et ai, 1997; Caasterland, 
J 999; Jones, 1999; SaJi, 1999; Tekaia et oi, 1999; 
Bairoch, 2000; Cambilhu and Claverie, 2000; Doolitlle, 
2000; Pandey and Mann, 2000; Rubin et ai, 2000) and, 
in this context, it is equivalent to what others may refer 
to as the coding part of the genome. However, in papers 
on two-dimensional (2D) electrophoresis, it is often used 
to describe the sum total of proteins in a cell, taking 
into account the differeni levels of protein abundance 
(Shevchenko et ai, 1996; Gygi et ai, 2000a; Lopez, 
2000; Washburn and Yates, 2000). In an effon io be clear, 
we propose l he. term 'translatome'' for this second usage 
of proteome. 

With this definition, we are able to refer compactly to 
three different cellular populations. These are illustrated 
in Figure I. 

(i) We use" the term genome when we refer to the 
population of open rending frames, where each ORF 
counts once. 

(ii) We use the term transcriptome when we refer to 
ihe population of mRNA transcripts. This term was 
originally coined by Velculescn etnt. (1997). Note 
thai each ORF may give rise io- different numbers 
of transcripts. Consequently, ihe iranscrjptome is 
essentially the same as ihe genome but with each 
ORF weighted by its expression level. 

(iii) The next level is the cellular population of proteins. 
As each protein represents a translated transcript, 
we make an analogy with the term transcriptome 
and use the term translatome as described above 
to describe this third population. Thus, the trans- 
latome is a subset of the genome where each 
ORF. is weighted by its associated level of protein 
abundance. 



Note that one could also, less compactly call the trans- 
latome a 'weighted proteome.* However, doing so assumes 
one of the two aforementioned definitions of proteome. To 
avoid ambiguity, we studiously avoid the use of proteome 
altogether in the paper. 

Differences between the translatome and the transcrip- 
tome exist given that transcripts from different genes 
can give rise to different numbers of proteins, due to 
different rates of translation and protein degradation. 
Post- transcriptional modifications further affect the 
translatome. 

In our analysis of the transcriptome and translatome, we 
focus on global protein features rather than the compari- 
son of individual genes. Previous analyses have shown that 
differences between mRNA expression and protein abun- 
dance levels can be quite dramatic for individual genes. 
This may either be due to the noise in the data or to funda- 
mental biological processes. However, our analyses show 
that the variation between transcriptome and translatome 
is much smaller for global properties that are computed by 
averaging over the properties of many individual genes. 

METHODS 

Dal o sources used 

For our analysis we culled many divergent data sets, 
representing protein abundance and mRNA expression 
experiments and also other sources of genome annotation. 
These are all summarized in Tabic I. 

Biases in the data 

The databases thai annotate the specific genes may 
nol always be accurate (Jshii et at., 2000). Gene Chip 
experiments suffer with regard to cross hybridization 
and ihe saturation of probes. SAGE data degrades for 
lowly expressed mRNAs. 2D gels are unable lo resolve 
membrane proteins (approximately 30% of ihe genome) 
and basic proteins (Gerstein, 1998c; Krogh et ai, 2001). 
Jn addition, the procedures for identification and quan- 
tification of the protein spots are subjeel ^.uncertainties 
(Hayncs and Yates, 2000). Human biases include the 
lack of low abundance proteins (Fey and Larsen, 2001; 
Gygi et oi, 2000b; Harry et ai, 2000) and the differences 
between laboratories in sample preparation. On/ reference 
expiession data set attempts to resolve these problems. 

Oat a set scaling 

A reference set for mRNA expression. With many differ- 
eni mRNA expression data sets available, it is worthwhile 
io integrate them into a single unified reference set. with 
the intention of reducing the noise and errors contained in 
the individual daia sets and io obtain a unified estimate of 
ihe normal expression state in a cell. 
We adopt an iterative scaling and merging formalism. 
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Fig. 1. Schematic overview of the analysis. On (he left-side we outline the terms we use to describe the process of gene expression. The 
coding section of the genome is transcribed into a population of mRNA transcripts called the 'trahscriptome/ The transcripts in turn are 
translated (o a population of proteins; we use the term 'translatome' for this piotein population rather than the alternative *proteome* because 
the latter term may be confounded with the protein complement of the genome (which is not necessarily associated with a quantitative 
abundance level). 

The matrix in the middle schematically shows an analysis of the three stages of expression. In general, we define a protein 'population* as 
a set of genes associated with a corresponding number of expression or abundance levels ('weights'): In the matrix each row represents a 
weight and each column a gene set. In particular, we differentiate between the mRNA reference expression set (GmjRNA = ^Ctn)> which 
essentially covers the complete genome, and the reference protein abundance set (Gp lot ) which contains the proteins in data sets 2-DE if 1 and 
2-DE #2 (see Table I) because the protein abundance set is a significantly smaller subset of the genome. By definition, this subset contains 
only proteins that can be identified by 2 D gel electrophoresis and is therefore biased in this sense. The enrichment figures throughout this 
paper, through a comparison of the right- and left-sides of this figure, show the results of the experimental biases of 2D gels on the data set. 
Each pie chart represents a composition of a particular proteinfealure F (for instance, an amino acid composition) in a population '(represented 
by the symbol /i). We can further look at the 'enrichment' of this fearure in one population relative to another (represented by the symbol A, 
see Section 'Methods* for an explanation of the formalism). 



which we summarize below. We present n more detailed 1 ^ 
review of ihe methods on our web sile. 

We start with the values of one gene chip data sel 0, 
where i is used throughout as a subscript to denote gene 
number. We I hen transform [he values of the next Gene 
Chip data sel X t lo Y t with the following non- linear regres- 
■ sion: min£ ( - [Y t - V;) 7 with Y; = AX? where A and B 
are the parameters of the regression'. Note thai two Gene 
Chip sets may not be defined for the same sel of genes, 
so we have lo perform "he fit only over ihe genes com- 
mon to both sets. The motivation for scaling is that the 
dynamic range of obseived expression levels varies some- 
what between different data sets, although cell types and 
growth conditions ;ue very similar. Reasons lor dispar- 
ity may include diffeicnt calibration procedures for relat- 
ins fluorescence intensity to a cellntoi concentration (mea- 
sured in copies of transcripts per cell) or different pro- 
tocols for harvesting and reverse- transcribing the cellular 
mRNA. 

We (hen merce an J average the data to create a new 



reference set V as follows: 



If Ui and Y; are both defined for gene i and 



}Yi-Ui\ 



Then Vf = UY, ; + (A) 

Eke if only >',- exists, V; = >', 

Eke V, = Ui. 

As presented above, where only one data set has a value 
for the corresponding ORF, we incorporated that value 
and did not exclude it. When both data sets have values 
for an ORF, we averaged the values if they were within 
153- of each other; otherwise, we jusl stayed with the 
original chip data set (,',. We used cr — 15% in order to 
prevent outliers from skewing the result. This )5?c value is 
a reasonable threshold for excluding outliers though other 
values (e.g. JO or 2Q?c) would give similar results (data 
not shown). Other data sets are subsequently included in 
the same procedure, continuing the iteration from the new 
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expression values V t . The initial iteration starts with the 
Young Expression Set, as since we have the highest 
confidence in its accuracy. 

The SAGE data (Velculescu et oi, J 997) was not 
included in the above procedure since it is of a funda- 
mentally different nature. An advantage of the SAGE 
technology over Gene Chips, is that there is no. possible 
signal saturation for high expression levels, as is possible 
for chips (Futcher et a/., 1999). Conversely, SAGE values 
are less reliable for lowly expressed genes since there 
is a chance that one might not sequence a SAGE tag 
corresponding to such a gene altogether. Therefore, if 
after the last iteration, the average Gene Chip expression 
level Vj was both above a certain threshold and below 
the SAGE expression level S; for the same gene, it was re- 
placed with the SAGE value; otherwise the average Gene 
Chip value was kept. This gave us our final expression 
set w mRNA . Our treatment of the SAGE data is modeled 
after that in Futcher et at (J 999), and like them, we used 
0=16. 

This incorporation of the SAGE data into the reference 
data set ensures that the highly expressed outliers are as 
accurate as possible. 

Rather than plain arithmetic averaging, this overall scal- 
ing procedure with the or cutoff avoids 'artificial averages' 
that combine very different values for a particular gene. 
Some expression values might be statistical outliers. In . 
additional may be possible that the expression levels of 
a variety of genes can only be within mutually exclusive 
ranges or modes, such as when two alternative pathways 
are switched on or off. Simply averaging these would give 
values that are less representative of the particular mode 
values. This situation is analogous to that in averaging 
together an ensemble of protein structures (i.e from NMR 
structure determination). Each structure could be stereo- 
chemical^ correct, with all side-chain atoms in predefined 
rotamer configurations. However, an average of all struc- 
tures could yield one that is stereochemically incorrect if 
this involved averaging over particular side- chains in dif- 
ferent roiameric slates. 

With regard lo our regression analysis, we have investi- 
gated both non- linear and linear fits but found a non linear 
procedure lo be more advantageous. The non-linear rela- 
tionship between different expression daia sets perhaps 
reflects saturation in one or more of the Gene Chips — not 
an uncommon phenomenon. This non-linearity is imme- 
diately evident on scatter plots of two data sets against 
one another (see website). Accordingly, the non- linear 
fit produces a smaller residual than the linear fit: 98 297 
(non- linear) versus J 22 IS? (linear) for the scaling of the 
Chmch data set and 59828 (non-linear) versus 67-162 
(linear) for the Samson data set. 



A reference set for protein abundance. We followed a 
similar procedure to calculate a reference protein abun- 
dance set from the two gel electrophoresis data sets. We 
first scaled the two data sets against the mRNA expres- 
. sion reference data set, getting regression parameters Cj 
and Djz 

i 

where the subscript j indicates the data set 2-DE #1 or. 
2-DE #2 respectively; P; j is the protein abundance value 
in data set j t and u> m RRft;; the corresponding reference 
expression value, and Cj and Dj are the parameters of 
the non-linear regression. 

Using these parameters, we transformed the values of set 
2-DE #2 onto 2-DE #1. Then we combined both sets into 
the reference protein set vfptot by averaging them, if both 
values existed. Otherwise, by using the existing value, viz: 

wpioU = {Pi.) + Qij)f2 if both Pi j and Q t2 exist. 
Else if only P t j exists, wp f0t j = Pij 
Else if Q; 2 exists, Wpjw ; = Qj.2- 

Enrichment of features 

Formalism. In the next part of our analysis, we want 
to group a number of proteins together into various 
categories based on common features and characterize 
those features that are enriched in one population relative 
to another, i.e. the translatome population of proteins 
as measured by 2D gels relative to the transcripiome 
population of transcripts or the genome population of 
genes. To this end, we set up a formalism that could 
be applied universally to all the attributes that we were 
interested in. Due io the limitations of the experiments, 
the translatome, transcripiome, and genome populations 
are defined on different sets of genes, and sometimes we 
want to remove this 'selection bias' by forcing ihem lo be 
compared on exactly the same set of genes. This is a key 
aspect of our formalism* as presented in Figure I . 

We call an entity like |w, 0) a 'population/ where C 
is a set describing a particular selection of genes from the 
genome and w is vector of weights associated with each 
element of this population. In particular, we focus On three 
main populaiions here: 

(>) |J ; O'ci)} is the population of genes in the genome, 
all 6280 genes weighted once (w — J); 

(ii) h v mRNA; GmRNA ) is the observed population of the 
transcripts in the transcripiome. i.e. the 62-19 geries 
in the reference expression set weighted by their 
reference expression value: 
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(iii) [w Frol , Gp IO i] is the observed cellular population of 
the proteins in the transJatome, i.e. (he 181. genes 
in Ihe reference abundance set weighted by their 
reference abundance value. 

(The set of genes in the genome Gocn js approximately 
equal to the genes in sel G m RM A , such thai we can use 
both symbols interchangeably.) We can also use this nota- 
. tion to describe specific experiments— e.g. [w^, G, 3C z) 
describes (he gene set and weights relating to the transpo- 
son abundance set. 

Furthermore, we define Fj as the value of a feature F 
in ORF j. For example, F could be the composition 
of leucine (a real number) or a binary value (0 or 1) 
indicating whether an ORF contains a trans-membrane 
segment. Given these definitions, the weighted average of 
feature F in population [w, G] is: 

mC% I>v, G.]) s — ^ 

The weighted averages of two populations [w, G] and 
[v, S) can be compared by simply looking at their relative 
difference A: 

A(7\ Iv, SI K G)) = ^KCT-^.^Gl) 
' ■/*(/=•. KCJ) 

where v and w are weights for the sets of ORFs S and G 
respectively. We call A the 'enrichment' of feature F 

because it indicates whether F is enriched (if A is 
positive) or depleted (if A is negative) in population [v, S) 
relative to (w, G). 

Usually, ihe gene sel G is defined by ihe particular 
.experiment, for which the weight w was measured. 
However, it is also possible to combine the gene set 
associated with one experiment wjib expression levels 
from another set. One may want to do this to compute 
the enrichment only on the genes common to both 
populations, for which there are defined values for both w 
and v, viz: A{F. \v, S D G). \w, $ O GJ). In practice, 
this is most relevant for comparing Gi> IOI and G mR N A . 
Since Gp lol is completely a subset of G m R^ A( we need 
not explicitly deal with intersections if we calculate all 
statistics directly over Gp, 01 . 

One can adjust the weight vectors to take into account 
different types of averaging. For instance, when com- 
puting the amino acid composition {F - on) from the 
amino acid compositions of individual ORFs Fy ~ ooj 
(V; e G). we weight by ORF length. Jn the case of - 
expression weights, we have: 

Wj = Njw mR *iA.j v 7 € G 

where A'; is a me asm e of the length of ORF / (such as the 
number of amino acids). 



On the other hand, when computing the average molec- 
ular weight per amino acid, we need to normalize by the 
number of amino acids per ORF, which is equivalent to 
choosing Ihe following weights: 

. Application of methodology lo quantitative 
abundance sets 

Having defined our formalism, we applied it to a diverse 
set of protein features in yeast. 

. Ammo acid enrichment. As shown in Figure 2a, we used 
our methodology to measure the enrichment of individual, 
amino acids in both the transJatome arid the transcriptome 
relative to the genome. We found that three amino acids — • 
valine, glycine and alanine — were consistently enriched in 
both transcriptome and transJatome populations. 

In Figure 2a we compare different gene sets. In Fig- 
ure 2b we focus mainly on the variation in enrichments 
when all the comparisons are restricted to the set of 1 8 J 
genes (Gp l0t n G m RNA = Gp ;0 |) common to all data sets. 
Thus, the differences between the populations now only 
reflect the effects of differential transcription of certain 
genes and differential translation of certain transcripts. 
We find here an enrichment specifically of cysteine in the 
translatome in relation to the transcriptome. 

To measure ihe statistical significance of the results on 
amino acid enrichment, we have performed a control anal- 
ysis on a randomized data set (Figure 2d). We randomly 
permutated the expression values of (he ORFs 1000 times 
and then recomputed the enrichments. This allowed us to 
compute distributions for the amino acid enrichments and, 
from integrating these, one-sided />- values indicating the 
significance of the observed enrichments. 

Amino acid enrichment in Transposon data set. We 
also tried to extend our methodology, ineffectively, lo 
cope with the semi-quantitative Transposon set. We used 
only those 450 ORFs that consistently yielded either no 
expression or high expression, as binary data, on or off. We 
show the enrichments of amino acids computed from this 
filtered Transposon abundance set in Figure 2a. Overall, 
the enrichments from this set seemed to be attenuated in 
comparison to other data. 

Bicmoss enrichment. A corollary to amino acid enrich- 
ments is the determination of the aver ape biomass of the 
transcriptome and translatome populations (shown in Fig- 
ure 2c). We found that the average molecular weight of 
a protein in both populations \v;ts. on average, lower than 
in the genome population. These preliminary observations 
suggest a cell preference to use less energetically expen- 
sive proteins for those that an- highly-transcribed or trans- 
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Fig. 2. Amino acid and biomass enrichment, (a) Shows the amino acid enrichments between different populations as indicated by the legend 
to the right of the plot (the legend is ordered in the same way as the schematic illustration in Figure 1). The bars indicate the enrichment of the 
transcriptome relative to the genome, whereas the circles indicate the enrichment of the U a ml a tome relative to the genome. In addition, we 
also show the enrichment for protein abundance from the Transposon abundance set, represented by the circles with the line through them, 
(b) Shows a different view of amino acid enrichment from that contained in (a), now focusing on changes, and thus restricting the comparison 
to the genes common to all the data sets. The graph is ordered according to the enrichment from transcriptome to translatome (black squares). 
We focus here only on ihe changes for the abundance gene set (Gp, 0 i) to exclude the effects that arise from looking at different subsets. )n this 

; view the enrichments from genome to transcriptome (while squares) and from genome to translalome (white diamonds) look more similar 
lhan do the analogous sets in (a). To make comparison with (a) easier we again show the enrichment from genome lo the transcriptome for 
the complete gene set (Gcen. shown in bars), (c) Shows biomass enrichment. The left panel depicts the average molecular weight per ORF 
(in units of kDa) and the right panel, the average molecular weight per amino acid (in units of Daltons) in each of the three stages of . gene 
expression. The numbers inside the circles indicate the average molecular weights. The values next to the arrows indicate the enrichments 
in biomass between different populations. Both the circle diameters and the arrow widths are functions* of the corresponding values (ihe 
hollow arrow indicates □ positive value). It is very clear that the average molecular weight per ORF is much lower in the translatome (by 20 
or 15%) and transcriptome (by 29%) lhan in the genome, This relative depletion of biomass mainly takes place as a result of transcription; the 
effect of translation is less clear, depending on the populations compared. On the other hand, the depletion in the average molecular weight 
per amino acid (-3.3?c from genome lo translatome) is an order of magnitude smaller than in the average weight per ORF This shows 
that the yeast cell favors the expression of shorter ORFs over longer ones, and agrees with our earlier observation that there is a negative 

• correlation between maximum ORF length and mRNA expression (Jansen and Gcrstein. 2000); it seems that this effect mainly takes place 
during transcription lathci than tianstaiion. (d)This plot shows thai the amino acid enrichments are statistically significant. We have assessed 
significance by landumlv permuting the expression levels among the genes and then recomputing the amino acid enrichments. This procedure 
can be repeated and used 10 generate distributions of random enrichments that can then be compared against the observed enrichments. )n 
ihe plot the gray bars represent ihe observed enrichments already shown in Figure 3a. On top of the gray bars we show standard boxplois 
of enrichment distributions based, on 1000 random permutations. (The middle line represents the distribution median. The upper and lower 
sides of the box coincide with the upper and lower quariiies. Outliers are shown as dots and defined 35 data points that are outside the range 
of the whiskers, the length of which is 1.5 the interquartile distance.) Based on the random distnbuiions, we can compute one-sided p- values 
for the observed .-nrkhrwms. Amino acids for which the p~ values arc less than 10" 3 ate shown in bold font. 



toted. However.. uc also found thai the .average molecu- 
lar weigh l pet ammo act J differed much Jess between the 
ttanscrjpiome an J the nanshiome on the one ha»d : ;inij the 



genome on the orher hand (ibough ii was still slightly Jess). 
This finding indicates that lower molecular weights in the 
translatome and transoiptome relative 10 the cenome aie 
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predominantly due to greater expression of .shorter pro- 
teins rather than the incorporation of smaller amino acids. 

Scco?\dary. structure composition. We also used our 
meihodology to study the enrichment of secondary- 
structural features. Secondary structural annotation was 
derived from structure prediction applied uniformly to all 
the ORFs in the yeast genome as described in Table J. 
As shown in Figure 3a, all three populations — genome, 
transcriptome, and translatome — had a fairly similar 
composition of secondary structures — sheets, helices, and 
coils. The differences between populations were marginal 
and based only on the small subset of genes. 

We also found that Transmembrane (TNI) proteins 
were significantly depleted in the transcriptome (see 
website and caption). These results are consistent with 
om previous analyses (Jansen and Gcistein. 2000). The 
protein abundance data does not have any membrane 
proteins. . 

Subcellular localization. Figure 3c shows the enrich- 
ment of proteins associated with the various subcellular 
compartments. For clarity, we divided the cell into five 
distinct subcellular compartments, (see Table 1). We 
round that, in comparison to the genome, both ihe iraiv 
scriptome and translatome are enriched in cytoplasmic 
proteins. This is true whether we make our comparisons in 



relation to the relatively large reference mRNA expression 
set or the smaller reference protein abundance set. As 
Figure 3c shows, the 2D gel experiments are clearly 
biased towards proteins from the cytoplasm. However, in 
the biased subset Cp l0t transcription and translation lead 
to an even higher fraction of cytoplasmic proteins in the 
translatome. 

Functional categories. Finally, we compared the enrich- 
ment of various functional categories in both the trans- 
latome and the transcriptome (see Figure 3b). This gives 
us a broad yet informative view of the cell as a whole. As 
described in Table I, we used the top-level of the MIPS 
scheme for the functional category definitions. We found 
broad differences between the various populations, with 
some of the functional categories showing strikingly high 
enrichments. 

DISCUSSION AND CONCLUSION 

We developed: (i) a methodology for integrating many dif- 
ferent types of gene expression and protein abundance into 
a common framework and applied this to a preliminary 
analysis; (ii) a procedure for scaling anil merging different 
mRNA and protein sets together; and (iii) an approach for 
computing the enrichment of various proteomic features in 
the population of transcripts and proteins. Wc showed that 
by analyzing broad categories instead of individual noisy 
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Fig. 3. Breakdown of the tianscriptome and tianslatome in terms of broad categories i elating to structure, localization, and function. All 
of the subfigures arc analogous 10 the schematic illustration in Figuje ). (a) Represents the composition of secondary structure in the 
different populations, (b) Represents the distribution of subcellulaj localizations associated with proteins in the various populations. We used 
standardized localizations developed earlier (Diawid and Gei'stein, 2000), which, in turn, were derived from the MJPS, YPD, and SwissProt 
databases (Bairoch and Apwcilei, 2000; Costamo et ai, 2000; Mewes et aL, 2000). The subcellular localization has been experimentally 
determined for less than half of the yeast proteins, so our analysis applies only to this subset, (c) Shows the division of ORFs into different 
functional categories (accoidjng to the MJPS classification) in the various populations. Only the largest functional categories of the top level 
of the MIPS classificaiion are shown. The group 'other contains the smaller top-level categories lumped together. This 'other' group is 
different from the gioup "unci as sifted,' which contains genes without any functional description. 



data points, we could find logicnl trends in the underlying 
data. For example, individual transcription factors might 
have highet oi lower pioiein abundance than one expects 
from I heir mRNA expression, but the category transcrip- 
tion factors" as a whole has a similar representation in the 
trnnscriptome ami trans la tome. 

We found, as previously described (Futcher et al. : J 999; 
Gygi ei ni, 1999b; Greenbanm et oi. a weak 

conelation between individual measinemems of mRNA 
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and protein abundance. The outliers" of this correlation 
lend lo be associated with cellular organization. One 
might conceive of using these outliers (i.e. those with 
significantly different transcriptional and uanslationa) 
behavior) to rind consensus regulatory sequences. One 
possible method would involve using predicted mRNA 
structures f.laegei et nl. t 1990; Zuker : 2000) to find and 
investigate consensus structural elements in these outliers 
to which the yeast transitional machinery is known to be 



mRNA expression and protein abundance data 



TaMe I. Data sets 



Description 



Size lORFs} 



Reference ■ 



mRNA expression 
Young 

Church 1 

Samson 

SAGE 

Reference expression 

Protein abundance 
2-DE#I 

2-DE#2 
Transposon - 

Reftience abundance 

Annotation 

Annotated localization 

TM segments- 

MIPS functions 

COR secondary structure 



Gene chip profiles yeast cells with 
mutations that affect transcription 
Gene chip profiles of yeast cells 
under four different conditions 
Comparing gene chip profiles for 
yeast cells subjected to alkylating agent 
Yeast cells during vegetative growth 

Scaling and integrating the mRNA 
expression set into one data source 



Measurement of yeast protein 

abundance by 2D gel 

electrophoresis and mass spectrometry 

Similar to 2-DE set #1 

Large-scale fusions of yeast genes 
with locZ by transposon insertion 
Scaling and integrating the 2- DE 
data sets into one data souice 



Subcellular localizations of yeast 
pioteins 

Predicted TM and 
soluble proteins in yeast 
Functional categories foi yeast 
ORFs 

Predicted secondary structure 
yeast ORFs . 



5455 

6263 

6090 

3778 
6249 

156 

71 
1410 



2133 
(6280) 

2710 
(6280) 

3519 
(6)94) 

6280 



Hoktege/ioUI99B) 
Roth rt ol. (1998) 
Jclinsky and Samson (1999) 
Velcufcscti et al. (1997) 

Gygi tt at. (1999a,b) 

Futcher et al (1999) 
Ross-MacdonaJd et ol (1999) 



Drauid and Geistein (2000) 
Gersttm(J998a.b,c) 
Mewes et at. (2000) 
Gerstein(1998a f b.c) 



This table provides an overview of the data sets used in out analysis. The table is divided into . hrce sections. The rop section lists different mRNA cxpressron 
sets The middle section shows the protein abundance data sets used. The botlom section contains different annotations of piotein features. The column "Data 
set' Hits a shorthand reference 10 each data set used throughout this paper The next columns contain a brief description of .he data sets, the number of ORFs 
contained in each of them, and the literature reference. In contrast to the other data we investigated, the refeience expression and abundance data sets have 
been calculated (or the purpose of out analysis (see text). An expanded version of the table is available on our web site. 
Some further information on. the genome annotations: 

Localization. Protein localisation information from YPD. MIPS and SwissPiot were merged, filtered and standardised (Bai.och and Apweilei. 2000; 
Costanzo tt ol.. 2000; Mewes n ol., 2000) imb five simplified compartments— cytoplasm, nucleus, membrane, extracellular (including proteins in ER and 
golgi). and mitochondrial-according to ibe protocol in Drawid , r al. (2000). This yielded a standardized annotation of protein subcellular localization lor 
2 1 33 out of 6780 ORFs/ 

m itsmrnts. In 2710 out of 6280 yeast ORFs TM segments arc predicted to occur, ranging from low to high confidence (732 ORFs). The TM prediction was 
perf onned as follows: the values from the scale lot amino acids in a window of size 20 (the typical size of a TM helix) were averaged and then compared 
against a cuioff of -1 kcal mol" 1 . A value under this cutoff was taken to indicate the existence of a TM helix. Initial hydrophobic soeiches conespond.ng to 
sijna] sequences for membrane insertion were excluded. (Theie have the pattern of a charged residue within the first seven, followed by a soetch of 14 with 
in overage hydrophobic ify under the cutoff.) These parameters have been used, tested, and tc fined on surveys of membrane protein in genomes. -Sine' 
membrane proteins had at Iras, two TM-segmcn.s with an average hytbophobicil v less .firm - 7 V cal mol" 1 (ROM et nl. 1 995; GeiMf in " al.. 2000. San.on. 
fin/.. ?(XX>; Senej 1 1 til. . 2000). 

l umiians. MIPS functional categories have been assigned lo35l9 out ol 6194 ORFs. (TV remainder arc assigned lootcgory ?8' or W which 



corresponds -to unclassified function.) 

sensitive (McCarthy, 1998). 

In jctolioM to functional categories, we found three 
trends that were particularly notable: (i) lne 'cellular 



organization.' 'protein synthesis/ and 'energy pioduciion' 
categories were increasingly enriched as we moved from 
genome ,( > tianseriptome to transbtome. )n (he tianicrip- 
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tome and translatome population relative to the genome; 
(ii) proteins with 'unclassified function* are significantly 
depleted, perhaps reflecting a bias against studying them; 

proteins in the 'transcription' and 'cell growth, cell 
division, and DNA synthesis' categories were consistently 
deputed. This reflects the fact that many of these proteins, 
such as transcription factors, act as 'switches' such that 
onfy small quantities of the protein are necessary to 
activate or deactivate a process. These results concur with 
previous calculations (Jansen and Gerstein, 2000) wherein 
we found the transcriptome is enriched specifically with 
proteins involved in protein synthesis and energy. 

Limitations given the small size of the protein 
abundance data 

Even with the extended . coverage made possible by 
merging many data sets together into reference sets, the 
analysis is still limited by the minimal data. This was 
most applicable to the protein abundance measurements, 
potentially biasing our statistical results towards certain 
protein families. Moreover, the J 81 proteins in Cp f0 i do 
not represent a random sample. They are skewed towards 
highly expressed, well-studied proteins. Our methodology 
attempts to control for ihis gene- selection bias through 
our enrichment formalism, which allows one to rather 
precisely gauge various aspects of the bias. Conversely, 
many protein features in both the translatome and the 
transcriptome are dominated by highly expressed proteins. 
Under these circumstances, it is often sufficient to look at 
this smaller number of dominating proteins to characterize 
the whole population. This is similar to the development of 
the codon adaptation index lor yeast {Sharp and Li, 1987). 
While based on only 24 highly expressed proteins, it has 
proven to be robust in predicting expression levels for the 
entire genome. 

We believe thai the essential formalism and approach 
that we develop will remain quite relevant for future data 
sets (Smith, 2000). 

ACKNOWLEDGEMENT 

M.G. thanks the Keck foundation for support. 

REFERENCES 

An,H.. Scopcs.R.K. et cl. (1991) Gel elecirophorctic analysis 
ot Zymomonas mobilh glycolytic and fermentative enzymes: 
identification of alcohol (tehytliogcna.se II as a sire.ss pi ore in. J. 
Bncieriol.. J 73, 5975-59S2. 

Anderson,!., and SeilhamcrJ. (1997) A comparison of jckcted 
n)RNA and protein abundances in human livei. Electrophoresis, 
)S, 533-537. 

Bairoch.A. (7000) Serendipity in bioinformatics, the tribulations of 
a Swiss bioinfomiarician thtotigh e.xciiing times f Bioinjnrnuirics, 



BairoclnA. and Apweilerji. (2000) The SWJSS-PROT protein 
sequence database ami its supplement TiEMBL in 2000. Nucleic 
. Acids Res., 28,45-48. 

Bassett JXE Jr., BasraiJUA. et ol (1 996) Exploiting the complete 
yeast genome sequence. Curr. Opin. Genet. Dev., 6,763-766. 

BafteJ., Benito,V.A. et al (1 992) A possible in vivo mechanism 
of intermediate transfer by glycolytic enzyme complexes: steady 
state fluorescence aoisotropy analysis of an enzyme complex 
formation. Arch: Btochem. Biophys., 296, 654-659. 

Ca/nbil)au,C and Oaverie^.M. (2000) Structural and genomic 
correlates of hyperthermost ability. J. Biol. Chem.,215, 32383- 
32386. 

CavalcoJU.D., VanBoge)en,R.A. et ol (1 997) Unique identification 
of proteins from small genome organisms: theoretical feasibility 
of" high throughput proleome analysis. Electrophoresis, 18, 
2703-2708. 

Oaverie^I.M. (J 999) Computational methods for the identification, 
of differential and coordinated gene expression Jin process 
citation]. Hum. Mot. Genet., 8, ] 821-1832. 

Corthah^G.. Wasingej.V.C. HochsirasscT^D.F- and SarrchezJ.C. 
(2000) The dynamic range of protein expression: a challenge for 
proteomic research. Electrophoresis, 21 r 1 104-1 115. 

Costanzo,M.C., HoganJ.D. et al (2000) The Yeast Pioteome 
Database (YPD) and Ca'enorh'abditis elegans Proleome Database 
(WormPD): comprehensive resources for the organization and 
comparison of model organism protein information. Nucleic . 
Acids Res., 28, 73-76. 

Das,R. and Geistein,M. (2000) The stability of thermophilic pro- 
teins: a study based on comprehensive genome comparison. 
Funct. Int. Genom., 3,33-^5. 

DooIilile.W.F. (2000) The naiwe of (he universal ancestor and the 
evolution of the proleome. Curr. Opin. Struct. Biol., 10, 355-358. 
Drawid.A. and Gersicin.M. (2000) A Baycsian system integrating 
expression data, with sequence patterns /or localizing proteins: 
comprehensive application to the yeasi genome. J. Mot. Biol., 
301, J 059- 1 075. 

Drawid.A., Janscn.R. et al. (2000) Gene expression levels are 
correlated with protein subcellular localization. Trends Genet., 
10, 426-^130. 

Einarson t M. and Golemi.s.E. (2000) Encioaching genomics: adapt- 
ing large-scale science to small academic laboratories. Physiol. 
Genom., 2, 85-92. 

Eisen.M.B. and Brown.P.O. (1999) DNA arrays for analysis of gene 
expression. M eth. Entympl, 303, 179-205. 

Epstein r C. and Butow,R. (2000) Microarray technology — enhanced 
versatility, persistent challenge. Curt. Op in. Bio techno!.. I J, 36- 
41. 

Ferea.T. and Brown.P. 0999) Observing the living genome. Cu/r. 

Opin. Genet. Dev.. 9, 715-722. 
Fey.SJ., Nawrockj,A. et at. (1997) Proirome nnatysis of Sncchu- 

romrces cerevisioe: a methodological out line. F.lcarophnreiii, 

n, 1361-72. 

Fey. S.J. and La r sen, P.M. t?00l ) 2D nr not 2D. Two dimensional eel 
electrophoresis. Curr. Opin Client. Bioi.S. 26-33. 

Frishman.D. and Mcwcs.H.W. (1997) Piotrin structural classes in 
five complete genomes |)eiitij. A'.-j/. ?.inia. Biol., t. 626-628. 

Frishman.D. and Mcwes.H.W. (1999) Genome- based structural 
biolopy. Pru$. Biophys. Mol Biol.. 17. )- J 7. 



mftNA expression and protein abundance data 



Futcher3-. Latter ,G. etoL (1999) A sampling of the yeast proteome. 

Mot. Celt Biol, 19. 7357-7368. 
Gaasterland/T. (1999) Archaeal genomics. Curr. Opirt. Microbiol., 

2,542-547. 

Garrels IJ., McLaughKn.CS. et al. (1997) Proleoroe studies of 
Soccharomyces cerevisiae'. identification and characterization of 
abundant proteins. Electrophoresis , 18, 1 347- 1 360. 

Gerstein,M. (1997) A structural census of genomes: comparing 
bacterial, eukaryottc, and archaeal genomes in terms of protein 
. structure. J. Mo). Biol , 274, 562-576. 

Gerslein,M. (1998a) How representative are the known structures of 
the proteins in a complete genome? A comprehensive structural 
census. Fold. Des., 3, 497-51 2. 
• GersteinJM; (1998b) Patterns of protein-fold usage in eight micro- 
bial genomes: a comprehensive structural census. Proteins, 33, 
518-534. . 

Gerslein,M. (1998c) Patterns of protein-fold usage in eight micro- • 
bial genomes: a comprehensive structural census. Proteins^ 33, 
518-534.. 

GersleirUvL and HegyiJl. (1998) Comparing genomes in terms of . 
protein structure: surveys of a finite parts list. FEMS Microbiol. 
Rev., 22, 277-304. 

Gerstein,M. and Jansen,R. (2000) The current excitement in bioin- 
formaties, analysis of* whole- genome expression data: how does 
it relate to protein structure and function. Curr. Op'm. Struct. 
Biol, 10, 574-584. 

Gerstein,M., Lin J. et al: (2000) Protein folds in the worm genome. 
Pac. Symp. Biocomput. , 30-4 1 . 

Grecnbaum,D., Luscombe.N. et al. (2001) Interrclaiing different 
types of genomic data, from proteome to secretome: 'oming in 
on function. Genome Res., J J, I463-M68. 

Gygi.S.P., Rist.B. et al (1 999a) Quantitative analysis of com- 
plex protein mixtures using isotope-coded affinity lags. Nature 
Bintechnol, 17,994-999. 
■ Gygi.S.P, Rochon r Y. et al. (1999b) Correlation between piotcin and 
mRNA abundance in yeast. MoL Cell Biol, 19, 1720-1730. 

Cygi.S.P, Conhals.GL. et al. (2000n) Evaluation of two- 
dimensional gel electrophoresis- based prole omc analysis tecb- 

. nology. Proc. Nat} Acad. Sci USA, 97, 9390-9395. 

Gygi.S.P., Rist.B- ct al. (?000b) Measuring gene expression by 
quantitative proteome analysis. Curr. Op'm. Biotechnol, J J, 396- 
401. 

Harry J. L, WiJkins.MR. i-t a I. (2000) Pioicomics: capacity versus 

ut i) > ly. Elecfroplwreisis , 2 1 , 1 07 1 - 1 08 1 . 
Hat2irnoniliaiis,V. p ChoeJ.Jl. et nl. (1999) Proteomics: theoretical 

and experimental considerations. Biotechnol. Prog., 15. 312- 

318: 

Havnes.PA. and Yaics ,J.R. (2000) Proteome profiling- pit falls and 

progress. Yean. 17, St-87. 
Hegvi.H. and Gerstfin.M. (1999) The relationship between protein 

structure and fwrction: a comprehensive juivey with application 

to ihe yeast grnomr. / Mot. Biol. 28*. I-t 7- 164. 
Holste^e.F.C, JmninpsX.G. rt al. (1998 J Dissecting ihe regulatory 

circuit! y of a iid.aryotic genome. Ceil. 95. 7 I7-72S. 
lshii.M., Hashinioio.S. ft al. (2000) Diiect comparison of gencchip 

and SAGE on ihr tponliiativr .iccm.icv in nansoipr profiling 

analysis. Grnomin, 68, 13b- 143. 
Jto.T. Tashiio T K. ft al. (2000) Toward a protein- pi oiein interaction 



map of the budding yeast: a comprehensive system to examine 
two-hybrid interactions in all possible combinations between the 
yeast proteins. Fwc. Natl Acad. ScL USA, 97, 1 143-J 147." 
Jaeger,J.A., Turner,D.R et al. (1990) Predicting optimal and 
suboptimal secondary structure for RNA. Meih. Enzymol., 183, 
281-306. 

Jansenjt. and GersteinJM. (2000) Analysis of the yeast transcrip- 

tomc with structural and functional categories; characterizing 

highly expressed proteins. Nucleic Acids Res. , 28, 148 1-1488. 
leliusJry^S.A. and Samson JLD. (J 999) Global response of Saccka- 

romyces cerevisiae to an alkylating' agent. Proc. Natl Acad. Sci. 

USA, 96,1486-1491. 
Jones,D.T. (J 998) Do transmembrane protein superfolds exist? 

FEBS LetL.423, 281-285. 
Jones ,D.T. ( 1 999) GenTHREADER: an efficient and reliable protein 

fold iccognition method for genomic sequences. J.. MoL Biol, 

287,797-815. 

KiddJO. et al (2001) Profiling serine hydrolase activities in complex 

proteomes. Biochemistry, 40, 4005-4015. 
KloseJ. (1975) Protein mapping by combined isoelectric focusing 

and electrophoresis of mouse tissues. A novel approach to 

testing for induced point mutations in mammals. Humangenerik, 

26,231-243. 

KroghA et al. (2001) Predicting transmembrane protein topology 

with a hidden Markov model: application to complete genomes. 

J. MoL Bid, 305, 567-580. 
VmJ. and Gersiein,M. (2000) Whole- genome trees based on the 

occurrence of folds and orthologs: implications for comparing . 

genomes on different levels. Genome Res. r 30, 808—818. 
Lipshutz,R.F. S„ Cingcras.T.R. and Lockhan.DJ. (1999) High 

density synthetic oligonucleotide arrays. Nature Genet., 21, 20- 

24. 

Lopez ,M.F. (2000) Better approaches to finding the needle in a 

haystack: optimizing proteome analysis through automation. 

Electrophoresis, .21 , 1 082- 1 093. 
MacBeaih.G. and Schjeiber.S.L. (2000) Printing proteins as mi- 

croarrays for high-throughput function determination. Science, 

289, 1760-1763. 

MattonJD.P., Constabel.P. et al. ( 1 990) Alcohol dehydrogenase gene 

expression in pot a to following e lie i tor and stress treatment. Plant 

MoL Biol, 34, 775-783. 
McCanhy.J.E. (1998) Posttranscriptional control of gene expression 

in yeast. Microbiol MoL Biol Rev., 62, 1492-1553. 
Mewes.H.W., Frishman.D. et al. (2000) MIPS: a database for 

genomes and protein sequences. Nucleic Acids Res., 28, 27-40 
MHIai.A.A., Olive.M.R. ct al (1994) The expression and anaerobic 

induction of alcohol dehydrogenase in cotton. Biochem. Genet., 

32, 279-300. 

MolIoy.M.P (2000) Two dimensional electrophoresis of membrane 
proteins using immobilized pi 1 gradients. Anal. Biochem., 280, 
1-JO 

Nauchitrl.V.Y :md Somo»jai,R.l.. 1)994) SpaiiaJ and free energy 
disuibmion patterns of amino acid residues in water inlublc 
proteins. Biophyi. Chrm.. 51. 327-336. 

Nelson.R.YY. NedclV.ov.D. ft al (2000) Biosensor chip mass Spec- 
trometry: a chip- based proieoinics approach. Electrophoresis, 21. 
I 155-1163. 

O'FaneJI.P.ll. 11975) High resolution two-dimensional elcc- 



595 



D.Greenibaum et 



trophoresis of proteins. J. Biol Chem., 250, 4007-4021. 

PandeyA and MannjVj. (2000) Proteornics to study genes and 
genomes- Nature, 405, 837-846. 

Qi,S.Y., MoirA et ol (19%) Proteome of Salmonella typhimurium 
SL1344: identification of novel abundant cell envelope proteins 
and assignment to. a two-dimensional reference map. J. Bacte- _ 
rial, 178. 5032-5038. . 

Ross-Macdonald.P., Coelbo,P.S. et at (1999) Large-scale analysis 
of the yeast genome by transposon tagging and gene disruption. 
Nature,402, 413-418. 

Rost,B., Casadio,R. _tt ol (1995) Transmembrane helices predicted 
at 95% accuracy. Protein Sci.,, 4, 521-533. 

Roih>F.P., Hoghes^D. et ol (1998) Finding DNA regulatory motifs 
within unaligned noncoding sequences clustered by whole- 
genome roRNA quantitation. Nature Biotechnol. r 16. 939-945. 

Rubin.G.M, Yandell,M.D. et al (2000) Comparative genomics of 
the eukaryolcs. Science, 287, 2204-2215. 

Sali,A: (1999) Functional links between proteins.. rvVj/ur-r, 402, 25- 
26. 

Santoni.V., Molloyjvl. et ai (2000) Membrane proteins and pro- 
teornics: un amour impossible? Electrophoreisis, 2), 1054-1070. 

Schena,M, ShalonJO. et al (1995) Quantitative monitoring of gene 
expression patterns with a complementary DNA micioarray. 
Science, 270, 467-470. 

SearlsJD.B. (2000) Using bioinformatics in gene and drug discov- 
ery. Drug Discos Today, 5, , 1 35r- 1 43. 

Senes,A„ Gerslein,M et al (2000) Statistical analysis of amino 
acid patterns in transmembrane helices: the GjuxG motif occurs 
frequently and in association with beta-branched icsidues at 
neighboring posirions. J. Mol Biol, 296, 92 1-936. 

Shapiro.L. andHarris.T. (2000) Finding function through structural 
genomics. Cttrr. Opin. Biotechnol, 11,31-35. 

Sharp.RM and U.W.H. (1987) The codon adaptation index— a 
measure of directional synonymous codon usage bias, and its 
potential applications. Nucleic Acids Res., 15, 1281-1295. 

Sherlock.G. (2000) Analysis of large-scale gene expression data. 
Cutt. Opin. Immunol. 12, 201-205. 



ShevcbenkoA. Jensen,O.N. et al (1996) Unking genome , and 

proteome by mass spectrometry: large-scale identification of 

yeast proteins from two dimensional gels. Proc. Natl Acad. Sci. 
93, 14440-14445. 
Smith,R.D. (2000) Probing proteomes-seeing the. whole picture? 

Nature Biotechnol , 1 8, 1 04 1 - 1 042. 
. Tatusov^R.L, Koomn,E.V. et al (1997) A genomic perspective on 

protein families. Science, 278, 631-637. 
TekaiaJ*:, LazcanoA et al (1999) The genomic tree as revealed 

from whole proteome comparisons. Genome Res.* 9> 550-557. 
VelcuIescu,V.E., ZhangX* etal (1997) Characterization of the yeast 

transcriptome. Cell, 88, 243-251 . 
Wallin,E. and von Heijne,G. (1998) Genome- wide analysis of 

.integral membrane proteins from eubaclerial, archaean, and 

euk aryotic organisms. Protein Sci , 7, 1 029- 1 038. 
Washbum.M.R, Wblters.D. et al (2001) Large-scale analysis of 

the yeast proteome by multidimensional protein identification 

technology. Nature Biotechnol, 19. 242-247. 
WashbumXR and Yates J.R. 3rd (2000) Analysis of the microbial 

proteome. Curr. Opin. Microbiol, 3, 292-297. 
Wiliest- and Friedman JTP (1999) Searching for evidence of 

altered gene expression: a comment on statistical analysis of 

microarray data [editorial; comment]. J. Natl Cancer. Inst., 91, 

400-401. 

Wolf.YJ., Brenner.S.E. et ol (1999) Distribution of protein folds in 
the three superkingdoms of life. Genome Res., 9, 17-26. 

Young, K.H. (1998) Yeast two-hybrid: so many interactions, (in) so 
little time Biol Reprod., 58, 302-31 1. 

ZhangJvl.Q. (1999) Large-scale gene expression data analysis: 
a new challenge to computational biologists (published erra- 
tum appears in Genome Res., 1999, 9, 1 156). Genome Res., 9. . 
681-6S8. 

Zhu.H., KJemicJ.F. et al (2000) Analysis of yeast protein kinases 
using protein chips. Nature Genet., 26, 283-289. 

Zuker,M (2000) Calculating nucleic acid secondary structure. Curr. 
Opin. Struct. Biol, 10, 303-310. 



£>96 



^iSamlSSS^ 1 A P^):999-I009. . Remote, Li „ ks 



Maryland 20889, UsT ^ * **. Nat,0naI NavaI Me ^» Center, Bethesda, 

50% growth iXbition ^SS^S"^ ^ * ^ * e 

database. EXj^Jt^AL^^^^^^^^ . 
conversion of [(3)H]thymidine to IY3 ^HTTMP to ^, ^ } aCtmty Was asWBed by 
expression was deteLCd byW^^^f"^ 
dehydrogenase (DPD) mRNA exnrSI! ^hydropyrimidine 
transcription-P^R RESULTS' TwS" ^""P?™* h V quantitative reverse 
GI(50), 20.8 micmM^^.^^^^.^^ « f°"ow S: 5-FU 
nmol/min/mg (0 1 6-5 7V in J&l ? Q 75 m,cr °M (0.25-237); TK, 0.93 
(0.12-6.41); H ^ **** 1-05 

noted between 5-FU and FdU d ^^fr* wh * T COrrdati ° n W2S 
iwas seen between TS mRNA an Srf„ L • ' ^ 635 3 weak -™<x»^rte correlation 
norTKactivity oondS^K ^SS?^"^^^" 1 ^ 
expression tended to be more sensitive tb 5 FU W ' ^ lmeS With DfD 
wild-type P 53 were sienificantlVmn^ J - Ines Wlth faster doub,in g times and 

The lack ^con^JS^S^SS^^ FDURD: CONCLUSIONS: 
such as p53, the observation thatfc mo™ v *e ".fluence of downstream factors 
also had higher TS leve^ ^d me^tan^Wl !!J 6 _ 68 fester doubling times 
short (48-h) dragexposl ^ ° f thescre « lhat "« a relatively 

PMID: 1 1309351 [PubMed - indexed for MEDLINE] 



G 



CT: |^?^ F * ,5:,, ^ ):li7 - 39 - ■■ Recces. L in ks 



Galauin in pituitary adenomas. 

iTLT^^^^ LundbladL, Aii£garri^ EncsonJK, Aman 
11, Landry M, Schmidt WE, Hokfelt T . Hutting AL 

S-^Ts^^^ ^ D,abc ' eS Unit > Ka "> linska Hospita., 

o i ll to otockholm, Sweden. Eva.Grenbaek@ks.se 

s^X'lA l C ITf ^ meaSUrcd ' n fr ° m human P ,tUlt ^ ade ~ using a 

specific RIA method for monitoring human galanin. Twenty-two out of twenty-four 

levTin^ "I 8a,amn Wlth fl ° tabIy high |CVCIS in adenomas vaSng 

Sn and ACTH'rr: ^.""V™ levels » secreting adenomas. SLfr 
galarun and ACTH contents were closely correlated ,n all tumors. In four young patients 

tot mo" 03 . ^ , hi ' ghly aCt, ' Ve ^ CUSbJn8 0™" 8 a,ani " inversefy?eS 
to tumor volume. The mo.ecu.ar form of tumor galanin, studied with reverse-phase 

mfmS Z 7^ ?T S W,,h "I 6 maj ° rity ° f tVm0r galanin coduti "g with standard 
human galanin. In the tumors analysed with in situ hybridization there was a good 
correlator, between galanin peptide levels and galanin mRNA expression. In some 
tomors galamn mRNA and POMC levels coexisted, in others they were «sen£Sy in 
different cell populates. Levels of plasma galanin-LI were not related to tumor ga anm 

pi^rlT"' "? 83 bnin IeVe,S WCre m thC Same ra " 8C S1 " US P et — tote 
pituitary venous drainage as ,n peripheral blood: Corticotrophin releasing hormone 

pXTo" ^ ACTH ' "° de,eCt3ble release 

Sf.f ■ ' ' *™ nsfrate that corticotroph, but no. GH adenomas, express high 
evels of galanin, m addit.on to ACTH,.and that in some tumors both polypeptides are 

Z uZZ ,"7, r" 1 ! P ° PU,ati0n - HOWCVer > 8a,anl " leVCls P'-nJw'ere no . 
influenced by the rumor galanin content. 

PMID: 14700749 [PubMed - indexed for MEDLINE} 



307: Tnt J Oncol. 1998 Aug;l3(2):379-83. Related Articles. Links 

Protein abundancy and mRNA levels of the adipocytc-typc fatty acid 
binding protein correlate in non-invasive and invasive bladder transitional 
cell carcinomas. 
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The adipocyte type fatty acid-binding protein (A-FABP) is a small molecular weight fatty 
ac.d-bmd.ng protein whose expression correlates both with the grade of atypia and the 
stage of bladder transitional celt carcinomas (TCCs). To determine if the protein 
abundancy correlates with the mRNA levels in non-invasive and invasive lesions we 
haye.ana ysed fresh TCCs (grade II, Ta; grade in, T2-4) by two-dimensional ' 
polyacrylam.de gel electrophoresis (2D-PAGE) and measured the mRNA levels using the 
reverse transcription linked polymerase chain reaction (RT-PCR) Overall the results 
showed a good correlation between protein abundancy and mRNA levels, indicating that 
the lack of expression of the protein observed in some lesions reflects low levels of 
transcnption of the A-FABP gene rather than translational regulation. In addition our 
studies showed that the loss of A-FABP protein observed in some tumors is not ' 
compensated by an mcrease in the skin fatty acid-bindingprotein PA-FABP as is the 
case in the A-FABP knockout mice. ' 
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Abstract The adipocyte type fatly acid-binding protein (A- 
FABP) is a small molecular weight fatty acid-binding protein 
whose expression correlates both with the «rade of atypia and 
the stage of bladder transitional cell carcinomas (TCCs). To 
determine if the protein abundancy correlates with the 
mRNA levels in non-invasive and invasive lesions, we have 
analysed fresh TCCs (grade II. Ta; grade III. T^J by two- 
dimensional polyacrylarnide gel electrophoresis (2D-PAGE) 
and measured the mRNA levels using the reverse transcription 
linked polymerase chain reaction (RT-PCR). Overall, the 
results showed a good correlation between protein abundancy 
• and mRNA levels, indicating that the lack of expression of 
the protein observed in some, lesions reflects low levels of 
iranscripiion of the A-FABP gene rather than translaiional 
regulation. In addition, our studies showed thai the loss of A- 
FABP protein observed in some tumors is not compensated 
by an increase in the skin fatty acid-binding protein PA- 
FABP. as is the case in the A-FABP knockout mice. 

Introduction 

Bladder cancer accounts for 4.7% of ^11 human cancers 
diagnosed. The spectrum of bladder tumors is broad with 
various histological types thai include transitional cell 
carcinomas (TCCs), squamous cell carcinomas (SCCs), and 
adenocarcinomas (1-3). TCCs are by far the most prevalent 
tumors as they represent nearly 95% of all bladder cancers in 
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the Western Hemisphere. At first presentation, about 70% of 
the urinary bladder TCCs are diagnosed as differentiated 
superficial lesions that are confined either to the mucosa 
(Ta), or to trie underlying connective tissue (T,). The rest 
correspond to highly invasive, poorly differentiated tumors. 

Non-invasive TCCs occur as two distinct growth patterns, 
papillary and non-papillary (flat) lesions (1,2); that _ display 
significant differences in their tnaJignanr potential and that 
are believed to originate from different genetic alterations 
(4-6). Papillary carcinomas usually correspond lo low-grade 
lesions which frequently recur multiple times. These tumors 
begin as areas of hyperplasia that later undergo a process of 
dedifferentiation (grades MV). Invasive tumors may arise 
from these lesions, but often are derived from non-papillary 
carcinoma in situ that usually is of high grade al presentation 
and tend to invade and progress to muscle invasion and 
. metastatic disease. 

To date, many attempts have been made to pinpoint genetic 
changes that underly progression of bladder cancer as well as 
to identify molecular markers that correlate with tumor 
progression. Cytogenetic studies and molecular gene uc data 
have shown thai chromosomes 3p, 4p, 4q. 5'q, 8p. 9p, 9q, 
Hp. I3q. 14q, I7p and I8q are frequently altered in bladder 
urothelial tumors (4,5 and refs. therein), and as a whole they 
have supported the notion thai bladder cancer is a mulustep 
process. Recently, Spruck et al (6) showed that chromosome 
9 alterations occur early during development, while p53 
mutations appear later in the process and confer invasive 
properties. The situation however is reverse in the case of 
Cis, as a large fraction of these lesions contain P 53 mutations 
(5,6,8,9). Besides pointing towards two divergent pathways of 
bladder tumor progression, these studies implied that the order 
in which the genetic changes occur is important in determining 
the outcome of the lesion. 

Assessment of bladder cancer »s based on a thorough 
pathological examination of biopsy material which establishes 
the histological type of the tumor, its degree of differentiation 
(srade), and depth of invasion of the bladder wall (staging) 
( II 0- 12). In spite of strict criteria for the pathological assessment 
Of these lesions, there exist a significant inicr-palhologisi 
variation a fact that emphasises the need for objective 
markers that may aid their classification. With this in mind 
we ore exploring ihc possibility of using proteome (13) 
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expression profiles of these lesions as fingerprints to define, 
their grade of alypia and eventually their stage (314). So far, 
more than 400 tumors of various grades and stages have been 
analysed by two-dimensional poly aery lamide gel 
electrophoresis (2D PAGE), and preliminary experiments 
have shown that even though the overall protein expression 
profiles of tumors of the same grade and stage are very similar, 
there are important differences suggesting that morpho- 
logically 'identical' TCCs may be further subdivided (I). Of 
the biomarkers of TCC progression identified so far, the 
adipocyte-type fatty acid binding protein (A-FABP) is 
perhaps one of the most interesting as the levels of this 
polypeptide have been shown to correlate both with the grade 
of atypia as well as with the stage of the disease (3). Given 
the putative importance of A-FABP as a progression marker, 
and since Anderson and Seilheimer (15) recently showed that 
post-transcriptional regulation of gene expression is a frequent 
phenomena in higher organisms, we have compared the levels 
of A-FABP mRNA and protein in non-invasive and invasive 
bladder TCCs expressing and lacking this protein. 

Materials and methods 

Tumors. Fresh bladder tumors were obtained immediately 
after transurethral resection.. The grade and clinical stage of ■ 
the tumors were determined by the pathologist .at the Aarhus 
Municipal hospital. Clean tumors devoid of blood clots were 
divided into small pieces for 2D PAGE and DNA, and RNA 
preparation.. The latter were immediately frozen in liquid 
nitrogen and store at -JJO'C until use. 

l J *S]-methionine labeling and 2D- PAGE. In a few cases, 
small tumor pieces were labeled with PS}-methionine as 
previously described (3). 2D- PAGE was performed 
according to published procedures (16; see also http: 
. //biobase.dk/cgi-bin/celis). 

RT-PCR. Frozen tumor samples were ground lo powder in 
liquid nitrogen and total RNA was isolated using the acid 
guanidium isothiocyanate/phenol chloroform extraction 
procedure (17). The samples were treated with RNase-free 
DNases I (Pharmacia) to eliminate contaminating genomic 
DNA using the protocols recommended by Ihe supplier. 
Poly(A)* RNA was prepared using Poly (A) : Quick columns 
according lo the manufacturers instructions (Stratagene). The 
synthesis of cDNA for RT-PCR reactions were carried out 
using the Oibco BRL Superscript Kit. Two pg of total RNAs 
was mixed with oligo -dTll primer, PCR buffer. MgCl, 
(25 mM), 0.1 M DDT and 10 mM dNTP The mixture was 
incubated at 42*C for 5 min followed by the addition of 
Superscript ]) reverse transcriptase and funhci incubation at 
42'C for 50 min. The reaction was terminated by raising the 
temperature to 70'C for 15 mm, followed by additional 
incubation at 37*C for 20 min in the presence of RNase H to 
deplete the RNA. Primers for known and cloned genes were 
purchase from Pharmacia as follows: A-FABP, Upper (from 
186-208 bp) 5--GATCATCACTGTCAATGGGCAT-37 
lower (from 374-397 bp) 5* CATCCTCTCGTTlTCTCTiT 
ATG-3"; B-actin upper 5 - GAGGTIGGCTCTGACTGTaCC 
AC 3/lower 5-CTCA7TCAGCTCTCGGAACATCTCG-3'. 



Tabic I. Expression of A-FABP in noa-invasive and invasive 
bladder TCCs: 
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"The levels of a A- ; FABP protein were determined based on the 
visual analysis of Coomassie Brilliant Blue stained gels and 
represent the average estimate of at least two different runs. Tumors 
scored as positive differed significantly with respect to the levels of 
the protein, and therefore are indicated with either four (very high), 
three (High), two (medium) and one (low) cross (see also Fig. I); 
'The mRNA levels were determined based on the intensity of 
Eihidium Bromide stained cDNA bands separated on agarose gel 
using the Bio-Rad Ce) Doc 1000 system and represent the average 
estimate of ai least ihree independent experiments. Corresponding 
mRNA levels are represented by crosses as described above. 



PCR reactions were performed in a Progene thermal cycler 
using the Advantage Klen Tag Polimerase (Clontech). The 
cycling parameters consisted of 30 sec of denaiuralion at 
94"C, with annealing of 30 sec at 60X for B-aclin or at 
64*C for A-FABP. The extension time was for 2 min at 68"C 
for 29-40 cycles with the final extension of 7 min at 68*C The 
PCR products were separated on 1.5% agarose gel 
electrophoresis followed by eihidium bromide staining and 
photography under UV light. 

Results 



A-FABP protein levels in non invasive and invasive TCCs. 
One hundred suspected TCCs removed at the Department of 
Urology, Skejby Hospital, were analysed by high resolution 
2D PAGE and Coomassie Brilliant Blue staining. Of these, 
10 grade J). Ta TCCs (Table 1) were chosen to correlate A- 
IABP protein and mRNA levels as these lesions yielded 
acceptable protein profiles both in terms of their purity as 
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Figure I 1EF 2D gels or whole cellular extracts fiom non-invasive and 
invasive TCCs. A. TCC 532-1; B. TCC-692- 1 ; C TCC 763- I; D.TCC 709-1 
and E. TCC 711-1. Only ihc relevant area of the gels are shown. 



assessed by monitoring for the absence of vimentin 
(contamination with connective tissue) and desmin 
(contamination with smooth muscle cells), as well as 
polypeptide resolution. In addition, reasonable amounts of 
these tumors were available for mRNA preparation. 

Table J shows the levels of A-FABP protein expression in 
the 10 tumors analysed by 2D PAGE The data were scored 
entirely based on the visual analysis of Coomassie Brilliant 
Blue stained gels and represent an average estimate of at 
least two different runs. Tumors scored as positive differed 
significantly with respect to the levels of this protein; and 
therefore are indicated with either four (very high), three 
(high), two (medium) and one cross (low). Representative 
examples of Coomassie stained 2D gels of tumors exhibiting 
very high (TCC 532-1, Fig. 1A\ high (TCC 692-1; Fig. IB), 
medium (TCC 763-1, Fig. 1C) and undetectable levels (TCC 
709-1 and TCC 711-1 Fig. ID E) of A-FABP are shown in 
Fig. 1 (only the relevant area of the gels are shown). 

A-FABP mRNA levels in non- invasive grade 1 1 To TCCs. 
Since in many instances only a limited amount of fresh tumor 
was available, we used RT-PCR to determine the levels of A- 
FABP mRNA in the ten TCCs analysed by 2D PACE (Fig. I). 
Following amplification, the PCR products were analysed by 
conventional 1.5% agarose gel electrophoresis and visualised 
by eihidium bromide staining as shown in Fig. 2. The amount 
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Figure 2. RT-PCR analysis of A-FABP mRNA expression in non-invasive 
iGrll. Ta) and invasive TCCs (Gr 111, T r T,). For RT-PCR analysis, the ss e- 
DNA was synthesized by Reverse Transcriptase using lolal RNA. and used 
for RT-PCR amplification. The PCR "products were resolved on 1:5* 
agarose gels and visualised under UV light following eihidium bromide 
staining. The A-FABP panels show the results of amplifications where the 
pair of gene specific primers was used to generate the 220 bp DNA 
fragment. AmpbTrcatiori of A-FABP was obtained after 30 cycles of PCR. 
The B-atlin panels represent the amplification of the B-actin gene, which 
was used as an internal control to confirm thai equal amounts of c-DNA were 
used in each reaction. ■ • 



of cDNA in each lane was normalised using several house- 
keeping genes so as to achieve a more accurate assessment of 
the expression of the A-FABP mRNA. As shown in Fig. 2, 
TCC 532- 1 exhibited the highest amount of A-FABP mRNA, 
followed by TCCs 692-1,763-1, 616-1. 581-1, 154-1. 166-5 
and 533-1 Undetectable levels of A-FABP mRNA were 
observed in the case of TCCs 607-1 and 709- ! (Fig. 2). 
Relative mRNA levels for the ten TCCs are given m Table ). 

A-FABP protein and mRNA levels in invasive grade III T„ 
TCCs. Of the invasive TCCs (grade 111. T 2 J analysed by 2D 
PACE only six yielded reasonable protein profiles for further 
study. As shown in Table I. none of these lesions expressed 
delectable levels of A-FABP as determined by Coomassie 
Brilliant Blue staining (Fig. IE, TCC 711-1). In line w,th these 
results, the RT-PCR analysis of these tumors also revealed 
undetectable level of A-FABP mRNA (Fig. 2. Gr IB T.-T,; 
Table 1)- 

Loss oj A-FABP proiein is not compensated by on increase in 
PA-FABP. Recent studies of A-FABP knockout mice have 
shown that the loss of A T FABP in fat tissue is compensated 
by an increase in the skin fatty acid-binding protein mall (W- 
Our studies, however, indicated that the human homologue 
of mall, PA-FABP (19), did not compensate for the loss ol 
A-FABP either in the non invasive or the invasive tumors 
analysed in this study (Fig. ID and E). In addition, Fi*. 3 
shows 2D gels of V } S}- methionine labeled proteins from two 
o.ade I). Ta TCCs ( 192-4. T,; F,g. 3A and 192-4, T ; . Fig. 3B). 
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Figure 3 Levels of A-FABP and PA-FABP protein in * JM )c II, Ta tumors 
resected from Ihe same patient. The iWo upper panels show iht 2D «l 
aoton,dK, gra ms of I^Sj-melhionme labeled proteins from TCCs (grade » 
Td) resected from ihe same patient. A, TCC 192-9 tumor 4 and B - TCC 
192-9 lumor I. Only the relevant area of the audiograms are sho^n The 
low panel shows the RT-PCR analysis of A-FABP mRNA expression in .he 
same lumors (see also legend to Fig. 2). 



whtch dtffer significantly in their levels of A-FABP prolein 
and mRNA (Fig. 3, low panc |). As shown in Fig 3 the 
decrease in A-FABP observed in TCC 129-4, T, is not 
accompanied by an increase in the PA-FABP protein (Fig. IE). 

Discussion 

Of the TCC progression markers identified to date A-FABP 
is perhaps one of the most interesting as its presence correlates 
both with the grade of ntypin (p=0.0006) and the stage of the 
disease (p=0.0269) (3). A-FABP ,s a low molecular weight 
protein belonging to a cytosolic multigene family of lipld- 
bmdmg proteins that include heart, liver, intestinal, muscle, 
brain, skm and epiihelialisoforms (20). Members of the 
farTul * 3re hi 8 hl r expressed in differentiated cells and 
show narrow tissue distribution. Their precise function ts at 
present unknown, although there is evidence suggesting thai 
they may play roles in intracellular lipid transport and 
metabolism,, signal transduction (21,22) as well as *rowth 
control and differentiation (23). The role m signal .ransduction 
has been inferred from the fact that long-chain Tatty acids and 
their metabolites can act as primary and second messengers 
»n specf.c s.gnalling pathways (24). Recently. It has been 
shown that A-FABP may play a central role in the pathway 
rhat links obesity with insulin resistance, most likely by 
connecting the fauy acid metabolism with the expression of - 
TNF-a (18). Furthermore, there is evidence indicaung thai 
ihe A- FA BP gene conta.ns sequence information necessary 
tor differentiation-dependent. expression in adipocytes (25) 
Our own data denved from the study of TCCs arid normal 
urothehum suggest that A-FABP may be retired for normal 
urothehum drllerentiaiion (I). as may be .he case lor PA- 
FABP m ihe skin (19) 



Considering the potential prognostic value : of A-FABP 
protein and/or mRNA in TCC progression it was important io 
determine if the levels of both type of macromolecules 
correlated both in the non-invasive and the mvasive lesions 
expressing and lacking A-FABP. The need for such correlation 
was underlined by recent studies of Anderson and Seilhamer 
(15), who reported a lack of overall coirelapon between the 
mRNA and protein levels of. 45 rat proteins analysed by 2D 
PACE in combination with cDNA arrays. Tneir data yielded a 
correlation coefficient of 0.45 which is hair way between weak 
and perfect correlation. Clearly, our data showed a very'good 
correlation between the protem and mRNA levels of A-FABP 
in all tumors analysed indicating thai the loss of A-FABP 
protein observed in. some tumors is not due to posi- 
transcriplional regulation. 

Recently, knockout mice carrying a nuli mutation in the 
aP2 gene encoding for A-FABP was produced (18). These 
animals do not show an obvious morphological or metabolic 
phenotype. but exhibit a 20-fold increase in the levels of the 
kerauriocyte type FABP {maU\ which may compensate for 
the loss of the deleted gene (18). The human homologue of 
the mat} gene, PA-FABP, was cloned in our laboratory and 
has been shown to be highly upregulated in psoriatic skin as 
well as m abnormally differentiated primary keraiinocytes 
(19). PA FABP is expressed in normal urotbelium together 
with A-FABP (3), and ongoing studies in the laboratory have 
shown that its level decreases significantly as tumors progress 
Interestingly, the studies reported in this article did not revealed 
a compensatory up- or down-regulation of PA-FABP. in the 
TCCs analysed, supporting the contention that PA-FABP may 
also play a role in cell growth and differentiation (19). . 
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Restored expression and activity of organic ion transporters i OAT1 
rOAT3 and rOCT2 after hyperuricemia in the rat kidney. 
HabuY, YanoJ, Qkuda m, Fukatsu A . fning 

Department of Pharmacy, Kyoto University Hospital, Faculty of Medicine, Kyoto 
University, Sakyo-ku, Kyoto 606-8507, Japan. 

We previously reported that in hyperuricemia rats, renal impairment occurred and organic 
ofr^Z act ! Vlt ^ ecre ^ accompanied with a specific decrease in the ex'LsfoT 
of ratorgan,caruon transporters, rOATt and rOAT3, and organic cation transporter 
rOCT2. In the present study, we investigated the reversibility of the organic ion transport 
actmty and express.on of organic ion transporters (slc22a) during recovery frTm 

• Hy P erut i cemia ^s induced by the administration of a chow containing 
unc acd and oxomc acd, an inhibitor of uric acid metabolism. Four days after 

teSTST 06 !? ^ Ch ° W ' 'J' P,aSma Ulic ^ co "™ io n ^"ied to the normal 
aXi t l ^ Ctl °" S ft M CrCatirU > e C,earance BUN >™*= were restored 
although the recovery of tubulomterstitial .injury was varied in sites of the kidney ' 
nmi - a UP oS f P™ inohi PPurate (PAH) and tetraethylammonium (TEA) and both 
protem and mRNA levels ofrOATl, rOAT3 and rOCT2 in the kidney gradually 
improved dunng 14 days of recovery from hyperuricemia. Basolateral PAH transport 
showed a bgl.er conation with the protein ieve. ofrOATl (r(2)=0.80)Vhahr6A?3 

^^S^T > ? ml h " anSPOrt Sh ° Wed 3 C0rreIati0n with 
SS^rfioS? P f °.f f0ne C ° nCentration ' wh ' ch « « dominant factor in 
£ e ^'at,on ofrOCT2, was gradually restored during the recovery from hyperuri^ 
, <l IT ^ P,aSraa testosterone ^el and rOCTC protein express^ 

n the Mney was not significant. These results suggest that the regulation of organic ion 

foT^oVT!' r ° AT3 31,(1 ^ by h ^™- ,s reversible, and EgTc 
ion transport acUv.ty restores according to the expression levels of these transporters. 
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Regulation of cytochrome P450 1 Al in teleosts: sustained induction of 
CYP1A1 mRNA, protein, and catalytic activity by 2,3,7,8- 
tetrachlorodibenzofuran in the marine fish StenotomuVchrysops. 

Hatin ME . Stegeman JJ . 

Biology Department, Woods Hole Oceanographic Institution, Massachusetts 02543. 

Cytochrome P4501 Al (CYPI Al) is known to play important roles in the activation and 
detoxificaaon of carcinogens and other toxicants in vertebrate animals, including fish 
Although extensively studied in mammalian systems, the regulation of CYP 1 A forms in 
other vertebrates is less well understood. We examined the time course and dose- 
response relationships for induction of CYPI Al mRNA, protein, and catalytic activity by 
2^,7,8-tetrachIorodibenzofuran (TCDF) in the marine fish Stenotomus chrysops (scup) 
The time course of CYP 1 Al induction was determined following a single ip dose (10 
nmol/kg) of 2,3,7,8-TCDF. Hepatic ethoxyresorufin O-deethylase activity was increased 
after I day, reached a maximum by 8 days, and was still elevated 14 days after treatment 
The content of immunodetectable CYPlAl protein in liver was elevated on Day 1 and 
continued to increase through 14 days. CYPI Al protein content was also strongly 
induced m heart and gill beginning at 2 days after treatment and extending through Day 
14. Hepatic CYP 1 Al mRNA was strongly induced by 1 day after dosing and remained ' 
elevated through 14 days. The sustained induction of CYPI A 1 mRNA by 2 3 7 8-TCDF 
contrasts with the transient induction seen previously in fish treated with nonhabgenated 
inducers and most likely reflects differences in persistence of the inducers. Dose-response 
stud ieS ind.cated that induction of CYPI A 1 mRNA, protein, and catalytic activity 
occurred following doses of 2,3,7,8-TCDF as low as 0.4 nmol/kg (120 ng/kg) within the 
range of whole-body contents of this congener measured in fish from contaminated 
environments. The estimated dose producing half-maximal CYP I Al induction in scup 
was approximately 2-10 nmol/kg, suggesting that the sensitivity of these fish to induction 
may be as great as or.greater than that of rats. In contrast to previous results obtained with 
S^^'-tetrachlorobiphenyl (TCB) and beta-naphthoflavone, which appear to inhibit or 
inactivate CYPI A 1 in fish and other vertebrates, there was a good correlation among 
levels of CYPI Al mRNA, protein, and catalytic activity in individual fish following 
various doses of 2,3,7,8-TCDF. The difference in response to 2,3,7,8-TCDF versus 
3,3',4,4'-TCB may reflect differences in the inducing potencies of the two compounds 
relative to their similar potencies as inhibitors of CYPI A 1 catalytic activity. In additional 
studies to evaluate structure-activity relationships for CYPI Al induction by chlorinated 
dibe^ofurans ,n fish, scup were treated with 2,3,6,8-telrachlorodibenzofuran (2,3,6,8- 
1 CDF). At 10 or 50 nmoi/kg, 2,3,6,8-TCDF was inactive as an inducer of CYPI Al 
mRNA, protein, or catalytic aclivity.(ABSTRACT TRUNCATED AT 400 WORDS) 

PM1D: 8048062 [PubMed - indexed for MEDLINE] 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA 
analysis and pS2-protein radioimmunoassay. 

Hahnel E, Robbuas P. Harvey J, Sterrett G , Hahnel R . 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical 
Centre, Nedlands. 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-protein 
using a radioimmunoassay, and by determining pS2-mRNA using Northern blotting. 
There was a good correlation between the two measurements, indicating that expression 
of the pS2 gene in breast tissues may be assessed by either method . Since 
radioimmunoassay is technically easier and more efficient than Northern blotting, 
radioimmunoassay will be the method of choice in routine applications. 
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Brief commun ication 

Expression of the pS2 gene in breast tissues assessed by pS2-mRNA analysis 
and pS2-protein radioihimunpassay 



Erika HahnejT, Peter Robbins, Jennet Harvey, Gregory Sterrett and Roland Hahnel ' 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical Centre Nedlands 

6009, Western Australia . * 



Key words: breast tissue, pS2-mRNA, pS2 protein, radioimmunoassay 

Summary ... . . 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-prorein using a radioimmu- 
noassay, and by determining pS2-mRNA using Northern blouing.There was a good correlation between the 
two measurements, indicating that expression of the pS2 gene in breas! tissues may be assessed by either 
method. Since radioimmunoassay is technically easier and more efficient than Northern blotting, radioim- 
munoassay will be the method of choice in routine applications. 



Introduction 

Expression of the pS2 gene is controlled by estm- 
gen. This was fnsi described in the MCF-7 breast 
enmer cell line [)}. pS2 expression has since been 
reported to be useful as a prognostic indicator f2, 
3], although this was not confirmed in another se- 
ries^]. 

pS2 expression may be assessed in tissue ho- 
mogenates by analysis of pS2-mRNA [5], by ra- 
dioimmunoassay of the pS2-protein [2], or by in> 
munocyiocbemical detection of thepS2 protein in 
tissue sections [5], It was the aim of this study to 
establish the. correlation between pS2-mRN A and 
pS2-protein.by radioimmunoassay in a series of 
tissues obtained from mastectomy specimens per- 
formed for carcinoma of the breast. Primary breast 
carcinoma tissue, metastatic carcinoma within ax- 
illary nodes, and macroscopjcaJJy benign breast 
tissue were examined. 



Materials and methods 



Breast tissues 



Tissue specimens from mastectomies performed 
for carcinoma of the breast were examined. 32 pri- 
mary breast carcinomas. 10 axillary lymph nodes 
containing metastatic breast carcinoma, and 20 
samples of uninvolved breast tissue were analyzed 
for pS2 expression, 

. The primary breast carcinomas were histolog- 
ically classified using a conventional subclassifi- 
cation. The presence or absence of primary tumour 
was assessed. The presence of metastatic carcino- 
ma within lymph nodes studied was verified by 
histological examination of the node remnant after 
sampling. 

'Uninvolved* breast tissue was sampled from 
sites well removed from the primary breast tumour 
(usually in another quadrant of the breast), and was 
selected only if the tissue appeared macroscopical- 
ly unremarkable'. Tissue sampling occurred irrime- 
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diately upon arrival of the mastectomy specimen in 
. the laboratory, with minimal delays between re- 
moval and sampling. 

Tissues for pS2 analysis were snap frozen in liq- 
uid nitrogen iand stored at -70°C until prqcessed. ' 

Extraction of RNA and 'determination of 
pS2-mRNA • ., 

Details of the procedure have been described in our 
previous paper [6). Briefly, the deep-frozen tissue 
was homogenized in a micrp-dismembrator. The 
homogeneous powder, was extracted with guani- 
diniumisothiocyanatephenolchlorofbrmiso- 
amylalcohol, and RNA was precipitated with iso- 
propanol. The washed RNA pellet was dissolved in 
SDS and glyoxylated, and the RNA preparation 
loaded onto agarose gel After electrophoresis the 
gel was capillary blotted onto Zeta-probe mem- 
branes. Membranes were hybridized overnight 
Avith cDNA.probes pS2 and 36B4, which were la- 
beled with [crT) dCTP by nick translations, 
: Washed membranes were exposed to Kodak X- 
qmat AR film. Relative intensities of the mRNA 
bands were assesstd visually as not detectable, ve- 
ry weaki weak, medium, strong, and very strong, 
taking the intensities of the ubiquitous 36B4 bands 
into account. 



Rod ioimm unoassny of pS2- protein 

Deep frozen specimens were pulverized with a mi- 
crodismembrator. The tissue powder was suspend- 
ed in 10 volumes of pH7.5 phosphate buffer. The 
homogenate was centrifugedin a refrigerated cen- 
: trifuge at 4°C for 60 minutes at 2600xg. The su- 
pernatant was removed with a Pasteur pipette, 
carefully avoiding the fat layer on the top. The pro-' 
tein concentration in the supernatant was estimated 
by use of the Coomassie dye-binding method [7J. 
An aliquot of the supernatant was diluted to a pro- 
tein concentration between 1 and 2mg/ml before 
assay of the pS2- protein. In one case the protein 
concentration of the supernatant was well below 
lms/mf . 



The estimation of the pS2-protein was perform- 
ed using a solid phase, two-site radioimmunoas- 
say. The kits were bought from CIS Biointerna- 
tional, Gif-sur-Yvette, France (ELSA-PS2). In this 
method the molecules of pS2 are sandwiched be- 
tween two monoclonal antibodies; the first one is 
coated on the EJLSA solid phase, the second one is 
radiolabeled with J25-iodine. The radioacoMty 
bound to the ELSA is proportional to the concen- 
tration of pS2-protein. Details of the procedure are 
supplied with the kit [8], " 

Results and discussion 

32 primary breast carcinomas, metastatic breast 
carcinoma in 10 lymph nodes, and 20 samples of 
benign breast tissue from mastectomies were irn 
vestigated. Two of the carcinomas were of the infil- 
trating JobuJar type, two were ductal carcinomas in 
situ, one was a multicentric invasive ductal carci- 
noma, all others were invasive ductal carcinomas. . 

Examples of pS2 Northern blots have been 
shown in our previous paper [6] which demon- 
strate that undegraded pS2.mRNA can be isolated 
by the method used. 

The results of the pS2-protein and pS2-mRNA 
assays are shown on Kg. J. There was a good corre- 
lation between-tbe two types of results. When pS2- 
rnRNA could not be detected by Northern blot, 
pS2-protein results were usually below Ing/mg 
protein (22 of 30), or between J and 3.7ng/mg (6 of 
30). Two were exceptions (7.7 and 14.6); one of 
them could have been due to the very low protein 
content in the cytosol which would lead to a large 
pS2 value and an associated error. There was no 
explanation for the other high result. Very weak 
pS2-mRNA signals on Northern blots correspond- 
ed to pS2-protein values between 1.1 and 19.2 with 
an average of 6.6ng/mg protein (median 5.7); The 
mean arid median pS2-protein concentration in die 
tissue with weak pS2-mRNA signals were 14.3 and 
J 0.7 ng/m g protein, respectively. The average pS2- 
protein concentration increased to 32.7 (median 
3J.5)ng/mg protein for tissues assessed as medium 
pS2-rnRNA intensity, and to 43.3 (median 53.8) 
ng/mg protein for tissues with strong or very strong 
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pS2-mRNA signals. These values should be used 
as an approximate guide only, since the number of. 
samples in the various groups was fairly smalh 
One-way analysis of variance confirmed that the 
means, of the pS2-protein values in the groups 
made up according to their pS2-mRNA signal in- 
tensity, were significantly different (p 

If the pS2 gene is expressed, its expression is on 
average greater in breast carcinomas than in unin- 
volved breast tissue. If one takes pS2-protein val- 
ues above 4ng/mg protein as cut-off, the average 
pS2-protein in J4 breast cancers was 343 (median 
(35.2)* while it was only 18.) (median 13.8) in 12 
uninvolved breast tissue samples.. ]f the cut-off is 
taken at 10ng/mg protein, average pS2-protein in 
breast carcinoma is also about twice the level of . 
uninvolved tissue. There were not enough lymph 
node metastases which expressed the pS2 gene to 
allow a comparison with carcinbma or uninvolved 
breast tissue. , 

Recent preliminary results of pS2 by radioim- 
munoassay [9] are similar to ours for breast cancer 
but considerably lower than our results for normal 
breast tissue. 

The values of the pS2-protein measured obvi- 
ously depend on the protein used for calibration. 
We used the pS2-protein standards supplied with 
the CIS kit, which according to the supplier gave 
values from 0 to 740ng/mg protein in a series of 
205 breast cancer cytosols. Previously, a different 
standard had been used for presumably the same 
series of breast carcinomas [2], and a conversion 
factor to current standards is given as 2.8 [8].' 

It was noticed that the correlation between pS2- 
protein and pS2-mRNA was better in breast carci- 
noma specimens than in uninvolved breast tissue. 
This is unexplained, though it could be due to ihe - 
variable content of cell or tissue types in adjoining 
parts of a specimen, a variation more likely to oc- 
cur in our sampling of non- malignant breast tissue 
compared to sampling of carcinomas. A similar 
variability in breast carcinoma specimens will 
probably have a smaller influence on the pS2 re- 
sults, since the malignant cells - if they do express 
the pS2 gene - contain more pS2-proiein than nor- 
mal breast. 
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The role of the epidermal growth factor receptor in sustaining neutrophil 
inflammation in severe asthma. 
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BACKGROUND: The extent of epithelial injury in asthma is reflected by expression of 
the epidermal growth factor receptor (EGFR), which is increased in proportion to disease 
severity and is corticosteroid refractory. Although the EGFR is involved in epithelial 
, growth and differentiation, it is unknown whether it also contributes to the inflammatory 
response in asthma. OBJECTIVES: Because severe asthma is characterized by 
neutrophilic inflammation, we investigated the relationship between EGFR activation and 
production of IL-8 and macrophage inhibitory protein- 1 alpha (MlP-1 alpha) using in 
vitro culture models and examined the association between epidielial expression of IL-8 
and EGFR in bronchial biopsies from asthmatic subjects. METHODS: H292 or primary 
bronchial epithelial cells were exposed to EGF or H202 to achieve ligand-dependent and 
ligand-independent EGFR activation; IL-8 mRNA was measured by real-time PCR and 
IL-8 and MIP-1 alpha protein measured by enzyme-linked immunosorbent assay 
(ELISA). Epithelial IL-8 and EGFR expression in bronchial biopsies from asthmatic 
subjects was examined by immunohistochemistry and quantified by image analysis; 
RESULTS: Using H292 cells, EGF and H202 increased IL-8 gene expression and 
release and this was completely suppressed by the EGFR-selective tyrosine kinase 
inhibitor, AGI478, but only partially by dexamethasone. MlP-lalpha release was not 
stimulated by EGF, whereas H202 caused a L8-foId increase and this was insensitive to 
AG1478. EGF also significantly stimulated IL-8 release from asthmatic or normal 
primary epithelial cell cultures established from bronchial brushings. In bronchial 
biopsies, epithelial IL-8, MlP-lalpha, EGFR and submucosal neutrophils Were all 
significantly increased in severe compared to mild disease and there was a strong 

- correlation between EGFR and IL-8 expression (r - 0.70, P < 0.001). CONCLUSIONS: 
These results suggest that in severe asthma, epithelial damage has the potential to . 

. contribute to neutrophilic inflammation through enhanced production of IL-8 via EGFR- 
dependent mechanisms. 
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Abstract 

Lung cancer is the most common cause of cancer death 
in developed countries. The prognosis is poor, with less than 
15% of patients surviving 5 years after diagnosis. The poor 
prognosis is attributable to lack of efficient diagnostic meth- 
ods for early detection and lack of successful treatment for 
metastatic disease. Most patients (>75%) present with stage 
)J1 or IV disease anil are rarely curable with current ther- 
apies. Within the last decade, rapid advances in molecular 
biology, pathology, bronchology, and radiology have pro- 
vided a rational basis for improving outcome. These ad- 
vancements have led to a better documentation of morpho- 
logical changes in the bronchial epithelium before development 
of clinical evident invasive carcinomas. This has changed our 
concept of lung carcinogenesis and emphasized the multistep 
carcinogenesis approach on several levels. Combined with 
the technical developments in bronchoscopic tec hniques, 
e.g., Jaser-induced fluorescence endoscope (LIFE) bronchos- 
copy, we now have improved methods to localize preinvasive 
and early-invasive bronchial lesions. "With the LIFE bron- 
choscope, a new morphologiral entity (angiogenic squamous 
dysplasia) has been recognized, which might be an impor- 
tant biomarker and target for anliangiogenic c bcrnoprrven- 
live agents. To reduce the mortality of lung cancer, these 
new technologies have been taken into the clinic in different 
scientific settings. The use of low- dose spiral computed to- 
mography in the screening of a high-risk population has 
demonstrated the possibility of diagnosing small peripheral 
tumors that are not seen on conventional X-ray. A shilt in 
the therapeutic paradigm from targeting advanced clinically 
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manifest Jung cancer toward asymptomatic preinvasive anil 
early- invasive cancer is occurring. The present article re- 
views the recent advances in the diagnosis of preinvasive and 
early-invasive cancer to identify biomarkers for early detec- 
tion of lung cancer and for c he mopr event ion studies. 

Introduction 

Lung cancer is the most common cause of cancer deaths in 
the countries of North America and other developed countries, 
accounting for 29% of all cancer deaths and more deaths than 
from prostate, breast, and colorectal cancer combined in the 
United States (1). Lung cancer will be diagnosed in —170,000 
new patients in the United States in the year 2000, and < 15% of 
them will survive 5 years after diagnosis ( 1). The prognosis for 
the patients with lung cancer is strongly correlated to the stage 
of the disease at the lime of diagnosis. Whereas patients with 
clinical stage 1A disease have a 5-year survival of about 60%, 
the clinical stage 11-1 V disease 5-year survival rate ranges from 
40% to less than 5% (2). Over two-thirds of the patients have 
regional Jympb-node involvement or distant disease at the lime 
of presentation (3). The poor prognosis is largely attributable to 
the lack of effective early detection methods and the inability to 
cure metastatic disease. The unsatisfactory cure rates supports 
efforts aimed at eoily identification and intervention in lung 
cancer. 

Historically., the only diagnostic tests available for the 
detection of lung cancer in its early stages were chest radiogra- 
phy and sputum cytology. The efficacy of these tests as mass 
screening tools was evaluated in controlled trials sponsored by 
the NO 3 3nd conducted at Johns Hopkins University, Memorial 
Sloan- Kettering Cancer Center, and the Mayo Clinic during the 
1970s (4-6). The principal goal of these studies was to deter* 
mine whether a j eduction in lung cancer mortality could be 
achieved by adding sputum cytology testing to annual screening 
by chest radiography. Results from these trials showed that both 
tests could detect presymptomatic, eaily-stage carcinoma, par- 
ticularly of squamous cell type. Fesectabiliry and survival rates 
were found to be generally higher in the study groups than in the 
conuol groups. However, improvements in rcscc lability and 
survival did not lead ro a reduction in overall lung cancer 
morialiiy. ihc must criiical end point. A subsequent shidy of 
6340 Czechoslovak tan male smokers also found no reduction in 
hint' cancer mortality after dual screening bv chest radiography 
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and spuhmi cyroJogy (7). The negative results from these 
■ screening studies lead the NCI and other health policy and 
research groups to conclude that mass screening programs in- 
volving periodic sputum cytolpgical evaluation and chest radio- 
graphs could not be justified. However, controversies in the 
methodology and interpretation of the data from these studies 
have later been extensively discussed (8, 9). One additional 
study of annual chest X-ray screening is currently being con- 
ducted by the NCI; The Prostate-, Lung-, Colorectal-, and Ovar- 
ian (PLCO) screening trial. This trial includes individuals 55-74 
. years old, but they are not selected for this trial on the basis of 
high risk for lung cancer smoking history with >20 

pack-years). 

The failure of clinical trials to demonstrate the efficacy of 
sputum cytology and chest radiography as mass screening tools 
has resulted in a search for better diagnostic approaches for 
early lung cancer detection that take advantage of recent devel- 
opments in molecular biology, gene technology, and radiology 
(10). Furthermore, as has been the case for mammography 
screening for breast cancer, it has also been important lo identify 
risk groups for lung cancer; 

Although, much is known about predisposing factors, nat- 
ural history, and the outcome based on histology and stage, our 
understanding remains very incomplete in many areas. What are . 
the early premalignant changes molecularly, biochemically, and 
morphologically? Which changes are reversible and which are 
not? What research tools are available to provide answers to 
these questions? The identification of preinvasive lesions allows 
for developing promising methods for early intervention fl J). 
The therapeutic paradigm and focus arc today shifting from 
targeting only clinically verified lung cancer as previously to- 
ward targeting, the premalignant and early- malignant lesions. 
Furthermore/ the prospect of lung cancer screening has today 
become more meaningful as a consequence of recent develop- 
ments in biology and radiology and belter possibilities to define 
high-risk populations most suitable for lung cancer screening 
(12). ^ 

The present article will focus on the clinical perspectives of 
out biological knowledge of premalignant and early- malignant 
lesions and the potential of the recent technological advance- 
mem for early diagnosis of lung cancer. 

Pathology of Preinvasive and Early Jnvasive 
Bronchial Lesions 

Most of the cffoits to classify lime cancer have been 
directed toward invasive carcinoma (13). However, better un- 
derstanding of the pathogenesis of lung cancer aroused renewed 
interest in morphological abnormalities thai f:ill short of inva- 
sive carcinoma but may indicate initiation of carcinogenesis. 
These morphological abnormalities are referred to as preinva- 
sive lesions and are shown in Fig. 1. The last edition of the 
WHO classification of lung rumors included the classification of 
preinvasive lesions as a separate section. Numerous recent stud- 
ies have indicated ihat lung cancer is not the result of a Midden 
transforming event in (he bronchial epithelium bur a muliisfrp 
process in which gradually accruing sequential gene lie and 
ccllulai changes result in the formal ion of an invasive 
malignant) hmioi. Mucosal changes in the large airwavj. thai 



may precede or accompany invasive squamous carcinoma in- 
clude hyperplasia, metaplasia, dysplasia, and CIS (14); Hyper- 
plasia of the bronchial epithelium and squamous .metaplasia 
have generally been considered reversible, and not premalignant 
. in the sense of squamous dysplasia and CIS (15). 

Squamous metaplasia is a common finding, especially as a 
jesponse to cigarette smoking. Peters et at ( ! 6) studied bron- 
choscope biopsiesTrom six sites in 106 heavy cigarette smok- 
ers; Squamous metaplasia was noted at one or more biopsy sites 
in approximately two- thirds of the group, and one- fourth 
showed squamous metaplasia in three or more biopsy sites. The 
incidence of squamous metaplasia increased with smoking his- 
tory and was highest in individuals who had smoked more than 
two packs of cigarettes a day. Auerbach et at (17) noted similar 
findings in autopsy tissues: basal cell hyperplasia and squamous 
metaplasia are increased in smokers in proportion to smoking 
history. Hyperplasia and metaplasia are believed to be reactive 
changes in the bronchial epithelium, as opposed to true preneo- 
plastic changes (J 7, 18). The reasons for this include; (a) they 
are frequently found in association with chronic inflammation, 
and may be induced by mechanical trauma; (b) they spontane- 
ously regress after smoking cessation; (c) in chronic smokers, 
the molecular changes present in these lesions are similar to 
those present in histologically normal epithelium; and (d) there 
are no reports linking their presence to increased risk for devel- 
oping Jung cancer. Jn contrast, moderate T to- severe dysplasia and 
CIS lesions seldom regress after smoking cessation (19). 

Dysplasia and CIS are changes that frequently precede 
squamous cell carcinoma of the lung. Saccomanno et at (20) 
studied more than 50,000 samples from 6,000 men. many of 
whom had worked in the uranium mining industry. Both smok- 
ing and uranium mining (radon exposure) were found to be 
associated with increased incidence of dysplasia, CIS, and in- 
vasive cancer. The studies of Saccomanno et of. established that 
increasing degrees of sputum atypia may be recognized an 
average of 4-5 years before the development of frank long 
carcinoma. 

Another question is: which grades of spnrum atypia pio- 
gress. to cancer? From -the Johns Hopkins cohort of the NCI 
chest ,X- ray/sputum screening trial, we know that among indi- 
viduals with moderate atypia on sputum screening, —10% de- 
veloped known cancer up to 9 years later. Among individuals 
with severe atypia on the sputum screening. >40% developed 
known cancer, during the same time period (2J). Although there 
are data in the literature showing the relationship between 
sputum arypia and subsequent invasive cancer, there is Still very 
little 'information about the histological piogression in the bron- 
chial mucosa in the high risk populations. In a recent publica- 
tion, tunc patients with CIS were followed with amoftuorcs- 
icnce bronchoscopy at regular intervals, and 5 (56%) had 
piogiession to invasive cancer despite endobronchial therapy 
(??). I he number i>f invasive cancers might even have been 
higher if neat men t bad not been not given. Ongoing snrdics of 
high- risk subjects (c.,g.. the Colorado sputum cohort smdy) 
inc hrding serial follow-up bronchoscopies will provide evidence 
i elated to the frequency of development of invasive lung cancer 
:r rt lelaie? to smoking his tor)-, airflow ohs million, and sputum 
r.rypia 

Since the previous WHO- classification was published in 
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Fig.' I A, squamous metaplasia. The 
cells arc widely dispersed, with a reg- 
ular maruiation from the basal region 
to ihe top. There is keraiinization, and 
Ihe' ouclei/cyjoplasmic ratio is low. B, 
moderate dysplasia with ASD. Hyper- 
celhilarify of the epithelium with in- 
complete matuiation and micropapil- 
lary invasion of capillaries are seen. 
The miclci/cytoplasmic ratio is high. 
C, severe dysplasia. There is marked 
pleomorphism of ihe cells with irregu- 
larity and prominent nucleoli. 
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1981, two nonsquamous lesions have been added to tbe WHO 
classification of premalignan! lesions: atypical. alveolar hyper- 
plasia and diffuse idiopathic neuroendocrine cell hyperplasia 
(13). Both of these lesions are diagnosed rarely. The former 
consists of lesjons <5 mm in diameter and composed of a 
peripheral epithelial cell proliferation with minimal etiological 
atypia or stromal response and resembles bronchioloalveolar 
carcinoma; The lesion has been seen in hirig specimens resected 
for lung cancer, but no prospective significance, of this lesion 
has been reported. However, this morphological lesion may play 
a role for the pathogenesis of peripheral rung adenocarcinomas 
(23, 24). The resolution of spiral CT (currently about 3 mm) 
approaches tbe diameter of these lesions, and il is anticipated 
that atypical alveolar hyperplasia will be increasingly encoun- 
tered in subjects undergoing this procedure (25). Diffuse idio- 
pathic neuroendocrine cell hyperplasia consists of a patchy 
increase in the number of well- differentia ted neuroendocrine 
cells in tbe bronchioles. This process may result in the formation 
of small carcinoid tumors, and for this reason i! is considered 
. "preinvasive." To date, small cell carcinomas have not been 
associated with this lesion (13). 

Recently, the use of fluorescence bronchoscopy (see be- 
low) has increased the recognition of dysplastic Jesions in ; the 
large airways and a new morphological entity, ASD, was iden- 
tified (26). Dysplasia of bronchial epithelium in "rnicr ©papillo- 
matosis" and the possible link between angiogenesisand prein- 
. vasive bronchial epithelial dysplasia were recognized a s early as 
J 983 by Muller and Muller (27), who also described the ultra- 
structure of these lesions. It has been suggested that ihis angio- 
genesis, which is recognized as capillary loops projecting into 
the dysplastic bronchial lining, is responsible for the reduced 
fluorescence seen in dysplastic lesions by LIFE bronchoscopes 
(Figs. 1 and 3; Ref. 26). Future prospective studies will show 
whether ihis morphological entity is correlated with a progres- 
sion to lung cancer so as to be a target for the use of antrangio- 
genic agents for chemoprc vent ion. 

In general, there are several questions/problems relating lo 
prcmalignant lesions, which will be addressed in future studies: 
{a) The morphological criteria for premalignan t and early - 
malignant changes, both on sputum cyiology and in bronchia) 
biopsies, have to be validated for intra- and interobserver repro- 
ducibility. 

(6) Uniform and reproducible morphological/cyiological 
criteria have lo be published more extensively, and a training set 
of slides should be available. By rhe use of Internet technology, 
this could be more easily facilitated (28). 

)f) 7hc correlation of spurum atypia and hist olor.ica) 
■changes in rhc bronchi in high-risk population is ijoi well 
defined. 

id) The natural course of preinvasive changes in the bron- 
chi from ihe high nsk subjects needs lo be clarified through 
longitudinal, piospcciive studies with reference to histological 
changes in the bronchi. Ongoing longitudinal studies with flu- 
oresrence bronchoscopy and multiple biopsies with histology 
anil orber biomarkeis will define rhe abiliry of these markers to 
assess for risk. 

If) What is ibe pniholoey/biology of ihe small, of (en pc 
ripherallv locaicd. tumors (3 nun in diameter), which are more 



often diagnosed with newer radiological techniques {e.g. t low- 
dose spiral CT)? 

(/) Optimization of the tissue procurement and processing 
. techniques are important Distinction of reactive from neoplastic 
processes is usually straightforward, but diagnostic difficulties 
may arise in tbe case of (a) inadequate or poorly prepared 
histological materia) to evaluate and (b) the presence of cy to- 
logical atypia in epithelium stimulated by inflammation, viral 
infection, radiation, or chemotherapy. 

(g) DNA array analyses of gene expression: will it be 
useful? How to collect proper mRNA? Can mRNA extracted 
from microdissected cells obtained at bronchoscopy be globally 
amplified and still remain representative of RNA present in sim? 

Biology of Lung Carcinogenesis and Potential Early 
Detection Markers 

Lung Cancer is the end- stage of multiple- step carcinogen- 
esis, in most cases driven by genetic and epigenetic damage 
caused by chronic exposure to tobacco carcinogens. The genetic 
instability in human cancers appears to exist at two levels: at the 
chromosomal level, including large scale losses and gains; and 
at the nucleotide level including single or several base changes 
(29). Lung caoce'rs harbor many numerical chromosomal abnor- 
malities (aneuploidy) and structural cytogenetic abnormalities 
including deletions and noweciprocal translocations (30). At 
least three classes of cellular genes are involved: proto-onco- 
genes, TSGs, and DNA repair genes. Oncogenic activation often 
occurs via point mutations, gene amplification, or chromosomal 
rearrangement, whereas TSGs are classically inactivated by the 
loss of one parental allele combined with a point or small 
mutation or aberrant meihylalion of a target TSG in the remain- 
ing allele. Additionally, dysjegulated gene expression (either 
increased or decreased expression) can occur by other, as yet 
unknown, mechanisms (30). Present studies have not yet con- 
firmed a prominent role for abnormalities of DNA repair genes 
in lung cancer. 

Preneoplastic cells contain several molecular genetic ab- 
normalities identical lo some of the abnormalities found in overt 
lung cancer cells (Fig. 2). These include allele loss at several 
loci [3p, 9p. 8p, and I7p), myc and ras up-regulation, cyclin Dl 
overexpression, p53 mutations, and increased immunoreacliv- 
ity, bcl-2 overexpression and DNA aneuploidy (31-35). AJIc.lo- 
typing of precisely microdissected.. preneoplastic foci of cells 
suggests that the earliest changes in the bronchial epithelium is 
allele loss al chromosome regions 3p ; then 9p, 8p, I7p. 5q. and 
then ras mutations (36-39).Thc biological meaning of LOH is 
only vaguely understood. Recent evidence suggests that LOH 
may be a consequence of mitotic recombination, thai there is 
only infrequent physical loss of genetic loci,, and that LOH 
piobably precedes chromosomal duplication (40). Allelic loss 
would thus be significant primarily in the presence of mutation 
m the retained allele, and gene dosage would not be expected to 
cxen a phenoiypic effect in LOH. Some reports have indicated 
that nij activation occurs at early carcinoma stages (34). His- 
tologically normal bronchial epithelium adjacent to cancers has 
:i)so been shown to have err lain genetic losses. Atypical ade- 
nomatous hyperplasia, the potential precursor lesion ol adeno- 
carcinomas, ufien have Ki-/w mutations I'll) 



Clinical Cancer Research 



Early 



Intermediate 



Late 




Normal Epithelium Hyperplasia 
3p LOH/Sroall Telomeric Deletions 



Dysplasia 



CIS Invasive Carcinoma 

3p i-OH/Contiguous Deletions 



Microsatellite Alterations 



9p21 LOH 



Telomerase Dysregulation 



Telomerase Upregulation > 



MYC Over-expression 



8p21-23 LOH 



Neoangiogenesis 



Loss of F hit immunostaining 



P53LOH 



TP53 Mutations 



Aneuploidy 



MothyJation 



5q21 APC MCC LOH 
K-ras Mutation 



- 80% 
-50% 
-70% 
-80% 
-60% 
-80% 

- 40% 
-40% 

- 70% 
80% 
100% 
30% 
20% 



Fig. 7 Sequential changes during Jung cancer pathogenesis. Although multiple genetic markers are abnormal in lung cancers, their appearance during 
the lengthy pteneoplaslie process varies. The liming of ihe appearance of these changes has. been investigated in bronchial preneoplasia, because 
sequential sampling of (he peripheral lung is technically difficult. Several alterations have been described in histologically normal bronchial epithelium 
of smokers. Other changes have been detected in slightly abnormal cpichelium (hyperplasia, metaplasia) which we do not consider to be true 
premalignant lesions. These changes are regarded as early changes. Molecular changes delected frequently in dysplasia are regarded as intermediate 
in liming, whereas those usually detected at the CIS or invasive stages are regarded as late changes. Jt should be stressed that although there is a usual 
order, exceptions regaiding the liming of onset may occur. Some changes are progressive, such as chromosome 3p deletions. Thus small discietc 
chances are present early, ptogressively become more extensive during pathogenesis, and frequently involve all or almost all of the ann in CIS 
samples. Although allelic loss at the TP53 locus may precede the onset of mutations, data. on this sequence are scant. Dysregulation of the RNA 
component of irlomeiase (with its appearance in noobasal cells) is an early event, whereas up-reguJation of the gene is a telatively laie event. 



Molecular change.*; have been found noi only in the Junes 
of patients with long cancer, but also in (be lungs of current and 
former smokers without lung cancer (IS, 42, 43). These obser- 
vations are consistent with the multistep model of carcinogen- 
esis unci "field cancerizntion" process, vheieby the whole icjjion 
is repeatedly exposed to carcinogenic damage (tobacco smoke) 
and is at risk for developing multiple; separate. ctonalJy unre- 
lated foci of neoplasia. The widespread aneuploidy that occujs 
miotic bout i he respiratory 1 iree of smokers supports this theory 
(•14). However, the presence of the same somatic p.Sj point 
mutation at widclv dispctscd preneoplastic lesions in a smoker 
without invasive lung cancel indicates J hat expansion of a single 
procreoiioi clone may spread throughout the inspiratory ttec 
MS). These rnolrcular a Iter n lions might thus be important 
targcis for use in the cailv detection of lung cancer and lot use 
as surrogate biomatkers jn the follow- up of cbemopicvcntion 



studies. Detection of these mutant cells should be possible wilh 
the different molecular techniques in accessible specimens. The 
prospects of diagnosing these biological abnormalities in mul- 
tiple types of clinical specimens are discussed below. 

Specimens for Clinical Testing; Sputum 

Since the 1930s, cytological examination of spumm has 
been used for the diagnosis of lung cancer (-16). Cytological 
examination of sputa, especially multiple samples, is helpful fot 
the detection of central tumors arising fiom the larger bronchi 
[eg., squamous cell- and small cell carcinomas). Ex folia ted 
cells from peripheral tumors, such :ts adenocarcinomas, arising 
from the smaller airways (small bronchi, bionchioles, and alve- 
oli), especially those less than 2 cm in dianiriei. c:m be detected 
only occasionally in sputum samples. 'Ibis has become id 
peaier importance because the changes in eigaiciie exposition 



Review: Advances in Early Deleclion of Lung Cancer 



(filters and decreased nicotine content) have created an increase 
in adenocarcinomas and a decrease in squamous carcinomas 
(47-49). The sensitivity of sputum cytology for early lung 
cancer is only in the 20%-30% range from screening studies, 
but by adhering to proper specimen collection, and processing 
and interpreting criteria, the yield can be substantially improved 
(50, 51). The data on (he reliability of the sputum are conflicting 
(52-54). Browman et al (52) reported interobserver agreement 
of 68% for exact and 82% for within - 1- category. HoJliday et 
oh (54) reported low agreement within observers (27-60%) and 
across observers (13-50%). Within - J - category intraobserver 
agreement underwent a two- or 3- fold increase in agreement, 
" which was also the case for interobserver agreement. The var- 
iation in intra- and interobserver agreement seems to depend on 
experience among the cytoiechnicians/cytopathologists and the 
composition of categories studied. A higher degree of agreement 
is obtained for higher grades of dysplasia (54). Risse et at. (55) 
showed that the ability to deteel premalignanl conditions is 
dependent on the number and rype of cells present in ibe deeper 
airways, suggesting a mode of improvement that is unrelated to 
observer reliability. MacDbugaJI et a}. (56) concluded that spu- 
tum cytology was loo insensitive and insufficiently accurate to. 
be included in the routine work-up of any patieni suspected of 
having lung cancer. To improve the reliability of sputum cytol- 
ogy examinations a simplification of the diagnostic, categories 
from .6 (normal; squamous metaplasia; mild, moderate, and 
severe arypia; and carcinoma) to 2-3 categories have been 
proposed (54). Future clinicopathological studies will be re- 
quired to validate this concept. 

To improve the sensitivity of spurum examination as a 
population-screening tool for the detection of early lung cancer, 
several approaches are currently under development. 

Immonoslaining. Annual sputum specimens obtained 
from individuals screened at Johns Hopkins were obtained, and 
the patients were monitored for 8 years (57). Because the 
clinical outcome of these patients was known, archival sputum 
specimens were screened for the presence of biomarkers that 
could indicate the presence of lung tumois in an early, preinva- 
sive stage. In an attempt to distinguish the pattern of marker 
expression Tockman et ol. (58) studied two monoclonal anti- 
bodies. Positive staining predicted subsequent lung cancer ap- 
proximately 2 years' befoic clinical tecoenitron of the disease, 
with a sensitivity of 91% and a specificity of 88% (58). One of 
these antibodies (703 D4) had a higher sensitivity and was later 
identified as recognizing hnRNP A 2/ 131 (59). The ; io!e of 
hnRNP A2/BJ oveiexpiession for detecting prf clinical June 
cancer has been studied in a large high iisk population including 
6000 Chinese tin miners who were hea-w smokers nnd who hod 
an extraordinary rate of lung cancer i (?(»). 7 he results f rom, this 
study indicated that detection of hnRNP A 2/13 1 over expression 
in sputum epithelium cells was 2- to 3- fold more sensitive for 
detection of lung cancer than smndaid chest X-ray and sputum 
cytology methods. The method was particubily effective in 
identifying early disease (60). The sensitivity was 7*1% *vr.ti/j 
21% for cytology and 42% for chest X-iay. llowevei. the 
biomaiker bad a lower specificity i'70"-«i compaicd «"tth cytol- 
ogy (100%) and chest radiograph t.90%). An ongoing clinical 
trial is evaluating the per lot ma nee of the A 2/13 1 -protein as a 
biomarkcr for ihc eajly detection of SJM.C. The patients at risk 



for SPJLC have the highest incidence of lung cancer (2-5%) 
among asymptomatic populations (6 J). In this trial, 13 SPLCs 
were identified by A2/B1, and the sensitivity and specificity 
were 77-82% and 65-81%, respectively. Among the cases 
. identified as positive by immunocylochcmrsrry and image cy- 
tometry, 67% developed SPLC within ) year (62). Whereas the 
previous immunocytochemistry studies on material from the 
older screening material from the NCI-supported screening 
studies were made on sputum cells cytologically classified with 
moderately or gravely atypical metaplastic appearance, the latter 
studies have been done on cytologically "normal appearing" 
cells. More recently Sueoka etal. (63) reported the confirmation 
of the value of overexpression of hnRNP A2/BI to detect 
preclinical lung cancer in Japan. Efforts to improve the sensi- 
tivity of hnRNP markers are ongoing (64). . 

PGR Techniques. PCR techniques have been used for 
the evaluation of molecular biomarkers for early lung cancer 
detection. In a pilot study with selected patients from the Johns 
Hopkins Lung Project (JHLP), 8 (53%) of 15 patients with 
adenocarcinoma or large cell carcinoma were detected by mu- 
tations in spurum cells from 1 to J 3 months before clinical 
diagnosis (65). However, the method seemed to be less sensitive 
than the protein marker described above, and the identification 
of specific gene abnormalities is further limited by the need to 
know the specific mutation sequence with which to probe the 
sputum specimens. Currently, this approach is not practical for 
screening undiagnosed individuals. Future advances in gene 
chip technology may permit testing for all possible mutations of 
common oncogenes and TSGs in clinical specimens of asymp- 
tomatic individuals (62). 

Microsatellite markers are small repeating DNA sequences 
found in the noncoding regions of a gene. PCR amplification of 
these repeal sequences provides a rapid method for assessment 
of LOH and facilitates the mapping of suppressor genes (66, 
67). Microsatellite alterations 3re extension or deletions of these 
repeated elements. Detection of microsatellite alterations in 
histological or cytological specimens may facilitate Ibe detec- 
tion of clonal preneoplastic oi neoplastic cell populations. AN 
though the detection of miciosatellite alterations does not indi- . 
cale the specific genetic change in the rumor, detection of clonal 
cell populations might serve as a cancer screening marker (65). 
Identical alterations have been found in lung cancels and cor- 
responding spurum samples demonstrating minimal atypia (68). 
The pI6 gene is located on the shori arm of chromosome ■ 
9(9p21) and is frequently mutated or inactivated in tumors and 
cell lines derived from lung cancer (69. 70). Bclinsky etal. (71) 
measured hypcnnethylatjon of the CpG islands in the sputum of 
lung cancer patients and demonstrated a high correlation wilh 
early stages of non-small cell lung cancer, which indicated that 
pi 6 CpG hypermethylation could be useful in the prediction of 
future lung cancer. However, prospective studies are needed to 
evaluate the role of pi 6 bypcnneihybtion as a marker for early 
lung cancer detection. Multiple other genes are inactivaied by 
hyperrneihylafion in lung cancer (72), and the detection of 
hypermethylation may be useful for risk assessment and early 
diagnosis. 

C»>mpuiri-DssisK-d Imago Analysis. Computer- assisted 
image analysis was initial! v use it to detect malignancy- associ- 
ated changes (eg., subvisnal or nonobvions changes in the 
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distribution of DNA in the nuclei of histologics Jfy normal cells 
in the vicinity of preinvasive or invasive cancer; 73). In a 
retrospective analysis of sputum cytology slides, malignancy- 
associated changes alone correctly identified 74% of the sub- 
jects who later developed squamous ceJI carcinoma (74). The 
technique has been improved, and recent data showed sensitiv- 
ities of 75% for stage 0/1 Jung cancer and 85% for adenocarci- 
nomas with a specificity of 90%- (75)1 This quantitative micros- 
copy technique allows the examining of thousands of cells per 
slide within a relative short time. Similar techniques have been 
approved in the United States for cervical cancer screening, and 
: might, in the future, play a role for lung cancer screening. 
However, no prospective clinical studies has evaluated this 
technology in a larger lung cancer screening setting; 

High Throughput Technology. With future advances in 
^gene chip, technology, it might become feasible to probe for 
expression of multiple genes in sputum specimens of asymp- 
. tomatic individuals. However, this requires a large amount of 
undegraded RNA from respiratory tract cells. With the high 
throughput technology, a higher sensitivity might be achieved 
by using multiple markers at the cost of achieving a lower 
specificity, which would be undesirable for a screening study. 

In conclusion, we need to reevaluate the role of sputum 
cytology for screening and early detection of lung cancer be- 
cause of advances in biomarkers and technology. Ongoing stud- 
ies with standard and biomarker analysis in high-risk groups 
might change the previous negative attitude and provide a new 
perspeclive on sputum cytology as a mass screening tool when 
applied in a high-risk population. Adding different molecular 
diagnostic tests gives the possibility for early diagnosis far in 
advance of clinical presentation. However, validation of ihe 
tests in larger prospective studies is necessary, and the individ- 
ual tests have to be compared with each other to define the role 
of early diagnosis in the overall management of high-risk sub- 
jects. Furthermore, health economic issues have to be consid- 
ered. 

Specimens far Clinical Testing: BAL 

BAL involves the infusion and rcaspiraiion of a sterile 
■ saline solution in distal segments of the lung via a fibcropiic 
bronchoscope. Ahrendi ei a/.. (76) examined a series of 50 
resected non:SCLC tumor patterns and compared the rumor and 
BAL with regard to molecular markers including p53 mutations, 
K-ras mutation, the meibylntion status of the CpG island of the 
p)6 gene, and microsaiellite alteration (Tables I and 2). With 
the possible exception of the test for microsatelliie alteration, all 
of the tests had relatively high sensitivity and could delect 
mutant cells in the presence of a large e.nccss of normal cells. 
The frequencies o.f ihc.w changes in the rumors ranged from 
27% f for K-ras mutations) to r-6% ffoi p53 mutations). As 
expected, p53 mutations wcie more frequent in central (piedom- 
inantly squamous cell) tumors, and K-ras mutations were more 
frequent in peripheral ("predominantly adenocarcinoma) tumors 
The specificity was hi oh I nearly 100%) because, with the 
ccp.tion of minos ate Dire alterations, the same genetic change in 
BAL sample as in rumois was always found, but the sensitivity 
was low. and in only V^i- of tumors that contained molecular 
lesions were the same abnormalities detected in conesponding 
Ft A I. fluids Specifically, the te:ts were least helpful in (he 



group of patients in whom improved diagnostic abilities are 
most needed, those with small, peripherally, located tumors (77). 
Unfortunately, the investigators were not able lo compare the 
molecular tests with routine cytopathological analysis of the 
BAL specimens. The sensitivity of the molecular tests in BAL 
specimens has to be improved, and we need to know the results 
from subjects at increased risk (current and former smokers 
without rung cancer and survivors of previous cancer of Ihe 
upper respiratory tract) and subjects with chronic lung diseases 
as well as results from healthy never smokers. 

A European group has previously shown that generic al- 
terations detected in DNA from bronchial lavage of individuals 
with lung cancer were also found in individuals with no evi- 
dence of malignant disease (78), which raises the question about 
the specificity of such molecular damage in neoplastic condi- 
tions. To improve the sensitivity and specificity of detecting 
allelic imbalance in lung tumors, high throughput PCR : based 
roicrosatejlile assays have been established (79). Jn a recent 
study by Fielding et al (80), the up- regulation of hnRNP A2/B 1 
was found to be a promising marker in BAL for the detection of 
premalignanl and malignant bronchial lesions with a diagnostic 
sensitivity of 96% and a specificity of 82%. 

Jl is too early yet to make conclusions as to whether BAL 
examinations will add lo other pathological/molecular biologi- 
cal clinical studies. To obtain diagnostic material for BAL 
bronchoscopy is required, and we do not have any data that 
compare BAL examinations with biopsies. Thus, we do not 
know whether BAL is a valuable adjunct to the biopsies taken 
under the same bronchoscopy procedure. 

Specimens for Clinical Testing: Peripheral Blood 

For many years scientists have searched' for a lung cancer- 
specific rumor marker that could be delected in peripheral blood. 
Optimism was raised in the "early" immunocytochemistry era 
by 'be use of monoclonal antibodies raised against more-or-less 
specific epithelial epitopes. In the search for epithelial cells in 
peripheral blood and bone marrow, monoclonal antibodies 
against cytokeralrn have been used. However, these reactions 
are clearly not cancer- specific, and some, antibodies have been 
shown io cross -react with normal blood or bone marrow ele- 
ments (81, 82). Another explanation could be that celts from the 
macrophage/monocyte system may contain proteins derived 
from the primary rumor that have undergone .necrosis and 
apoptosis and that these processed proteins are recognized by 
the antibodies (82). On the basis of "traditional'* immnnocylo- 
chemistry : no markers have been "able to detect premalignanl or 
early- malignant disorders based on a peripheral blood sample. 
Houwci. with the development of DNA technologies, new 
possibilities have been raised, and. with ihe use of PGR tech- 
niques, some promising reports have been published. 

Nanogram quantities of DNA circulating in blood arc pres- 
ent in healthy individuals (S3. 34). Tumor DNA is also released 
into the plasma component in increased quantities Sf->). 
Thus, ihe plasma and scrum vf cancer patients is enrichcit in 
DNA, an aveiaue four times the amount of free DNA as com 
pared with normal controls (87). In a study by Chen ei ni. tSS'i. 
a Comparison of microsaiellite alterations in rumor and plasma 
DNA w;, s done in SC.LC patients, and 93% of the patients with 
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Specimen 



Table J Tissues and other resources f or ihe study of molecular markers 

Comments 



Ref. 



Tumor tissue 
Sputum 

Surrogate organ 

Bronchial brush/wash 
Bronchial tissues 

BAL fluids 
Blood components 

Tissue for molecular staging 
Tumor cell lines 

Cultures of nonmalignanl (issues 



Nonmalignanl tissue fiom patients 
and from cancer- free relatives 



Numerous 



65, 68,71,74 



140 



Ml, 142, 143 



42, 43, 45, 144, 145 



76, 78, 146, 147, J 48 



72, 92, 149 



154. 155 



156, 1 57, I5S 



Mixture of cell types, may require microdissection (139). Extensively used 
for most. studies. Alcohol-fixed fine-needle aspirates may be used for 
mutational arid other studies. 

Respiratory cells usually in small minority. Most samples fixed in. 
'Saccomanno's fixative. Several studies have been performed on these 
specimens many years later. 

Predominantly squamous epithelial cells. Buccal smears, brushing* of 
tongue or tonsil may be explored as potential surrogate organs resulting 
from the field effect of tobacco damage of the entire upper .. 
aerodigestive tr3C». This concept needs to be confirmed. 

Predominantly respiratory cells. Fresh, frozen, or alcohol- fixed samples arc 
suitable for multiple studies including FISH.* 

Usually from bronchial biopsies, but may be obtained from surgical 
resection specimens. Formalin fixation and paraffin embedding required 
for histological diagnosis, although EASI preps may permit • 
identification and isolation of subpopulations. Paraffin sections may be 
used for genoryping polymorphisms, for allelotyping, and for m situ 
hybridization. 

BAL fluids are useful for examining the peripheral airway cells, which 3re 
the precursor cells of most adenocarcinomas. Numerous mononuclear 
cells present. Enrichment of epithelial cells desirable. 

Analysis of circulating tumor cells and genetic material shed by dying 
tumor cells into the plasma component may yield useful biological and 
diagnostic information. Gene mutations and presence of epithelial cell 
markers have been used to delect circulaiing rumor cells. Gene 
mutations, allelic loss, microsatellite alterations, and aberrant 
melbylation have been used to identify lumor cell DNA released into 
the fluid compartment. 

Although little data exists for lung cancers, regional lymph nodes, sentinel 
lymph nodes, and surgical resection margins have been used in other 
rumor types for molecular staging. 

Provide an unlimited self-repiicating source of high-qualiry molecular, 
reagents and have been used for numerous studies. Cell lines may or 
may not reflect the properties of the lumois from which ihey were 
derived (26), although they probably represent cellular subpopulations 

• (27). Aggiessive metastatic tumors are more likely to be successfully 
cultured (28) resulting in skewed data. 

Epithelial cultures may be useful for studying molecular changes during 
multistage pathogenesis. Temporary as well as a few immortalized 
cultures from nonmalignant epithelial cells have been established. 
B-Iymphoblasioid cultures are useful for linkage analysis, for genetic 
suspect ibiliry studies, and for alle lory ping corresponding lumois. 

Tissues such as buccal smears, tumor-free lymph nodes, and peripheral 
blood mononuclear cells are useful as contiots for linkage analysis, for 
gene lie susceptibility studies, and foi alleloryping corresponding tumors. 



"FISH, flnoicsccnce in iitu hybridization; EASI. epithelial aggregate separation and isolation. 



microsaiellite alterations in tumor DNA also had modifications 
in the plasma DNA. However, some patients had I OH only in 
the tumor DNA. Because most of the microsatellite alterations 
were similar in rumor DNA and plasma DNA. they concluded 
that some of the DNA circulating in the blood comes liom the 
rumor. Thus, modi ftc at ions of circulating DNA can be used as 
;;n early detection marker. Detection of aberrant DNA me thy I - 
,-iiion in semm DNA in patients with non-SGX has been 
lepo'ned [17'). Although the number of patients was small and 
the hypermcibylaied DNA was found in all stapes, it opens up 
for the possibility to be used as an early luru,- cancer detrition 
marker, further more, pi 5 and fas erne mutations have been 



detected in the plasma and serum of patients with colorectal 
conceis (89-91).. pancreatic carcinomas (W. 93). and hemato- 
logical malignancies (94). 

In. conclusion, the limited direct accessibility of lung car- 
cinomas has led to efforts to identify rum or- associated soluble 
markers in serum or plasma. Many of the currently recognized 
soluble markers were first identified as "tumor" 1 markers but, 
when evaluated in nonneoplastic tissue, have often been found 
in normal cells as well as in lumois. Hot eoily detection of lung 
cancer, we need more clinical data evaluating these new molec- 
ular biological markets from multiple sites, especially in high- 
risk croups. 
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. TobU2 Molecular approaches for lung cancer investigation 



Specimen 



Ref. 



Gene mutations 



Alleloryping 



Gene expression al RNA 
and protein level 



159, 160, 161 



18, 158 



\4S, 162, !63 ; 164.. 165, 166 



Molecular cytogenetics 



. Comparative genomic 
hybridization 

Morphometric studies 



40. 167.. 163, 169. J 70 

171, 172 

74, 173; 174 



Comment 



Widely used technique; especially for p53 and ro* genes;. Often used 
lo determine the role of a newly discovered gene in the 
pathogenesis of rung cancer. May be of diagnostic and prognostic 
significance. Multiple methodologies available, ... 

Useful as a partial substitute for mutational analysis and for 
determining the chromosomal locations of putative rumor 
suppressor genes. Widely used lo study multistage pathogenesis. 
Readily performed on formalin- fixed. and roicrodissected tissues. 
Increasing use of genotyping using automatic sequencers. 

Northern blotting and reverse Iranscription-PCR are widely, used to 
investigate gene expression. Western Wotting often used for. 
detection of protein expression, /n situ hybridization for message 
expression can be performed on paraifm-ernbedded tissues and, 
thus, can be used to investigate multistage pathogenesis; 
Microarray techniques offer promise of examining all or most of. 
the genome but currently require relatively large amount of high-, 
quality RNA from purified cell populations. Sage technique useful 
for investigation and identification 'of expressed genes. Similarly, 
advances in proteomics will permit simultaneous detection of 
multiple proteins. Numerous immunorostocheraical studies of 
. oncogene expression have been used to study multistage 
pathogenesis. Of particular interest, hnRNP expression on 
exfoliated epithelial cells in sputum samples may predict for 
development of cancer. 

In situ hybridization srudies of fixed materials or using smears has 
provided considerable information about numerical and structural 
changes. 

Useful for detection of gene amplifications. Less sensitive for the 
detection of regions of allelic loss. 

May be applied to paraffin-embedded tissues. Useful for determining 
aneuploidy and for. measuring a number of nuclear and 
cytoplasmic paiameteis. ' - 



Specimens for Clinicul Testing: Bronchoscopy 

WLB is the most commonly used diagnostic too) for ob- 
taining a definite histological diagnosis of lung cancer. Bron- 
choscopy has major diagnostic limitations for premalignant le- 
sions. Because these lesions ate only 3 few cells thick (0.2- J 
mm) and have a surface diameter of only a few millimeters, they 
rarely are observed as visual abnormalities. WooJner (95) re- 
ported that squamous' eel) CIS was visible to experienced bron- 
choscopists in only 29% of cases. To address this limitation, 
fluorescence bronchoscopy was developed, Early srudies of 
fluorescence bronchoscopy entailed the use of fluorescent drugs 
(hematoporphyrm dyes) that were preferentially retained in ma- 
lignant tissue (96). Although, si tidier, evaluating this approach 
did. in fact, show that early invasive and m .u"m canceis could be 
localized, the detection of dysplasia irmaimd problematic (97- 
100). Furthermoie. the development of phoiodynamic diagnos- 
tic systems was hampered by problems including skin photo- 
sensitizing and inter teieme with tissue autofluoiescence. To 
overcome these problems, a new laser photodynnmic diagnostic 
system was developed H0J). This system detected rumor- 
specific drug fluorescence rM toO nrn w :i velrngth : which is far 
from normal tissue aniHluoie^ene.- iM>0--.>$0 mvt), and inter- 
ference by auiollnoicscoice liom norm?.! tissue should then 
have been eliminated, but it renamed a significant problem 

{my 



Another appmach was developed by Palcic et al. (103), 
who noticed the lack of auto fluorescence in the tumor lesions by 
using blue light (442 nm) rather than white light to illuminate 
the bronchial surfoce. They amplified Ihe difference in auiofluo- 
rescencc between normal, picmalignant, and rumor tissue for 
clinical use (103, 101): Using a high-quality-charge coupled 
device and special algorithm, the LIFE was developed, taking 
advantage of the principle that dysplaslic and malignant tissues 
reduce auiofluoicscent signals compared with normal tissue 

Several studies have been performed comparing the diag- 
nostic specific ir>' and sensitivity of UFE btonchoscopy versus 
WLB in diagnosing preinvasive and early- invasive lesions 
(1 05- 109; Table 3). Most of the studies reported a higher 
diagnostic sensitivity of LIFE bronchoscopy in the detection 
picmalignant and early- malignant lesions at the cost of lower 
specificity [i.r., more false-positive results). In most of these 
Studies, lesions with moderate dysplasia or worse were the laigci 
of the study and rated as "positive." The prevalence of pmn- 
vasive nnd early lung cancct varies widely from one study to 
another, horn 20.?% (.105) to 65.8% (102). The explanation 
might be beyond the risk piofite of genetic variations oi differ- 
ent levels of e>:perience among the endoscopists as well as the 
pathologists involved. Furthermore, iheic seems to be a training 
effect in using the I.IH; bronchoscope., which has been demon 






straicd by Vcnmans a ol (107). In ihirii study, (he diagnostic 
sensitivity increased from 67 to 80% when comparing the first 
and the second half of the srudy. The use of the LIFE device in 
conjunction with WL 13 improved (he detection laic of preneo- 
plastic lesions and CIS significantly (Table 3). Kurie et <;/. (106) 
looked for more subtle tissue transformation, bul iheii studv 
included few patients' with moderate dysplasia or worse. No 
improvement in (he evaluation of metaplasia inde> was oh- 
served by the use of UFE bronchoscopy. Thus, differences in 
the study population might explain the different conclusion. 
There 3ie siill no clinical studies with sufficient lonp-tcrm data 
showing that moderate dysplasia is ibe mosi tc levant clinical 
predictor of eventual malic nancy. Limitations in making con- 
clusions from the existing studies are also the potential mcih- 
odologica] bias related lo the order in which ihc different bron- 
choscopy procedures arc .lone and whether the same examiner 
has performed both procedures. 7 a address these issues,, n 



prospective randomized study between LIFE bronchoscopy 3nd 
WLB was done a! the University of Colorado Cancer Center. 
7he srudy design included a randomization with ieg.tr d to the 
order of procedure as well as the order of the individual bron- 
choscopist (109). Ibe order of (he procedure and of the indi- 
vidual br one hose opist did not affect the results The studv also 
demonstrated a significantly hi » her sensitivity in detecting pie- 
malipnani lesions visualized by rhe LIFE, bur yi ihc cost of a 
lower specificity (J07). The reason for the low diagnostic spec 
ifrciiy foxmti with the LIFE bronchoscopy in the different studies 
mil? hi be aiiriburable to ibe visualization of moie abnormal foci 
with the LIFE bronchoscope, with the consequence ihat a l:u»ci 
numbei of biopsies were taken and. thus-, iheie *,\ as :; Inrhci risl: 
of more false-positive results. The use of IJU : r-imn bnscopv 
h:is led to the identification of n new moipholi-eii a! entity, the 
.-A SO. which is described above. In a recent morphologic :d Mndv 
rmtModysplasiic chances weie trequentlv found m pirncophstir 
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Fig. 4 Seventy-one-year-old man .with a spicular nodule m upper left 
Jobe demonstrated on low- dose helical CT (picture), but nol visible on 
chesl X- radiography. CT- guided biopsy showed adenocarcinoma. 



and early- malignant lesions in the bronchi (26). The morpho- 
logical entity has been confirmed in preneoplasias among smok- 
ers, and the perspectives of this finding have been extensively 
discussed (1 10). The prognostic significance of ihis morpholog- 
ical entity is currently studied in ongoing long- term follow-up 
studies. Future studies have to evaluate the role of ASD as a 
biomarker for early lesions and whether it can be used as a 
marker for treatment effect or therapeutic targe I for cbemopre- 
vention. 

The LIFE bronchoscope may play an important role in the 
screening and follow-up of subjects ai high risk of developing 
lung cancer. At this stage, however, it is unknown whether the 
LIFE bronchoscope will lead to a reduction in lung cancer 
mortality. There arc also no data on cost-effectiveness and 
cost- bene fit analyses available foT this new diagnostic proce- 
dure. The use of ihe LIFE bronchoscope may also in rhe future 
be extended to other indications, eg., patients staged as having 
resectable lung cancer on one side. Whether LIFE bronchoscopy 
of the contralateral lung will disclose abnormalities, which 
would change the therapeutic decision, is not yd reported.. 

Recent Advances in Radiology 

The previous NCI-sponsored screening trials failed to dem- 
onstrate any reduction in the lung cancer mortality by sputum 
cytology and yearly chest radiography 3S mass screening tools 
lor lung cancer screening. Limitations of design and execution 
of the studies, however, have been discussed extensively (8. 
111. 112). An extended follow-up (median. 20.5 years) of the 
Mayo Lung Project was recently published (113). There was 
still no difference in lung cancer mortality between the inter- 
vention arm and the control arm H.-J iv/mt 3.9 deaths per 1000 
per son- years). However, the median survival for patients with 
rcseried early- stage d^ease was 16.0 yeais in'thr intervention 
arm versus 5.0 years in the usual- care arm {P < (Hiy,. The }:ittei 
findings have raised the question ns ro whet bo some small 
lesions with limited clinical relevance* rnnv have been identified 
in the intervention arm. and the question of "over diagnosis" was 
discussed in accompanying editorials (1 It) 



Mass screening for lung cancer has been performed in 
Japan for many years and has been performed in over 500,000 . 
. people in abou! 80% of the Jocal communities (1)5). Sobue et 
oL (J 16) observed that annual clinic-based cbesl X-ray screen- 
ing for rung cancer in Japan showed reduced Jung cancer mor- 
tality by about one-fourth among individuals who. underwent 
screening once a year. In this screening program, Ihe relative 
odds ratio of dying from hmg cancer within 12 months was 
0.535 and in the J2-24-month period was 0:638 017). How- 
ever, many studies have focused on Ihe pitfalls in the detection 
of abnormalities by radiography (] J 8-1 22). The limit of chest 
: radiographic sensitivity for nbduJe detection is roughly I cm in 
diameter, by which time the rumor has over I0 9 cells and may 
already have violated bronchia) epithelium and vascular epithe- 
lium. CT has been shown. to be more effective in the detection 
pf peripheral rung lesions compared with plain radiography or 
conventional tomography of the whole lung (123, )24). 

Spiral CT scan is a relatively new technology with the 
• ability to continuously acquire data resulting in a shorter scan- 
ning time, a lower radiation exposure, and improved diagnostic 
accuracy compared with those of plain radiography (125-127). 
Spiral CT allows Ihe whole chest to be imaged in one or two 
breath-holds, reducing motion artifacts and eliminating respira- 
tory misregistration or missing nodules. Although there is 
greater radiation exposure with CT than with cbest radiography, 
low-dose techniques (lower mA of 30-50 compared with 200 
for conventional CT) have achieved calculated exposure doses 
that are 17% that of conventional CT and 10 times that of cbest 
radiographs. Further reduction in radiation dose while maintain- 
ing diagnostic accuracy is a topic of current research. Further- 
more, for Ihe baseline screening, low- dose spiral-CT-scan i.v. 
contrast is not administered. Nodules as small as 1-5 mm can be 
shown with modern spiral CT technology (25, 128). The obvi- 
ous advantages with this new technology led some groups in 
Japan and in the United States to look to Jow-dose spiral CT as 
a tool for screening (Refs. 129-131: Tables 4 and 5). 

' In a Japanese report, spiral CT scans and cbesl radiographs 
were done twice a year in 1369 individuals (129). Peripheral 
lung cancer was dctecled in 15 (0.3%) of 3*157 examinations, 
and, among the 15 lung cancer cases detected, the results of 
chest X-iny were negative in 11(73%), 3nd the tumors weie 
detected only by low- dose spiral CT. The de led ion rales of 
low-dose spiral CT and chest X-iay were 0.43% (15 of 3457 
examinations) and 0.12% (4 of 3-157 examinations), respec- 
tively. Furthermore, 1 4 (93%) of the 15 hmg cancers were stage 
J disease. The histology showed thai J I of the 15 lung cancer 
cases were adenocarcinoma, and *1 had squamous cell carci- 
noma. The effective exposure dose with spiral CT scan in that 
study was calculated to abour one- sixth (hat of conventional CT. 

The ELCAP in New York was designed to determine: (n) 
the frequency with which nodules were detected: (/?) the fre- 
quency with which detected nodules represent malignant dis- 
ease: and |c) the frequency with which malignant nodules ate 
curable 1 1 3 J). In the ELCAP study. 21 lung cancers were found 
among 1000 subjects screened. Among the 71 patients with 
cancer. S5% had stage I disease ( I able M. 

Another population- based study on low- dost* CT screening 
has been published by Seme rt nl i J 30"). using a mobile low- 
dose spiral CT scanner. 7 lie ttrrrcrien tutv wa> 0.-*S% (/.(-.. 4-y 
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Table 3 Bronchoscopy versus WLB in diagnosing premaligrianl and early-malignant lesions 



Sensitivity * Specificity ' Predictive vahics 













Relative 








Relative 


ppv* 


NPV 






PPV NPV 




No. of 


L1FE+ 






sensitivity 


LIFE+ 






specificity 


LIFE+ 


LJFE+ 


PPV 


NPV 


Author 


biopsies 


WLB 


LIFE WLB L1FE+WLB 


WLB 


LIFE WLB LIFE+WLB 


WLB 


WLB 


LIFE LIFE WLB WLB 


Lam et al (105) 


700 


0.67 


NR 


0.25 


6.3(2.7)* 


0.66 


NR 


0.90 


NR 


0.33 


0.89 


NR 


NR 


0J9 0.83 


Kurie et ol k (106) 


234 


NR 


0J8 


NR 


NR 


NR 


036 


NR 


NR 


NR 


NR 


0J6 


0.81 


NR NR 


Vcnroans et al (107) 


J 39 


NR 


0.89 


0.78 


i:43 


NR 


0.6) 


0.88 


NR 


0.20 


NR 


0.14 


0.99 


031 0.98 


Veromlen et al (108) 


172 


0.93 


NR 


0.25 


3.75 


0.21 


NR 


0.87 


NR 


0.13 


0.96 


NR 


NR 


0.19 0.90 


Kennedy et al (109) . 


394 


0.79 


0.72 


0.18 


4.4 


03 


0.43 


0.78 


0.38 


0.21 


0.85 


0.25 


0.87 


.0.17 0.80 



* PPV, positive predictive value; NPV, negative predictive value; NR, not reported 

* Based on reference pathologist. 

* If invasive cajcinoma is included. 



Table 4 Results from three population-based screening studies with low-dose spiral CT (LPCT) 



Authors 


No. of individuals 
studied 


True 
positive « 


Fake 
positive" % 


Predictive 
value % 


Detection rate % 
LDCT . X-ray 


Pack-yr 


Age incl. 

V ' 


,Kane(tof/o/. (129) 
Sone etal (130) 
Henschke et of. (131) 


1369 . 
■3967 
1000 


15 
19 
27 


15.6 
5.0 
. 20.1 


6.6 
8.8 
11.6 


0.43 
0.46-0.5 
: 2.7 


0.12 
0.70 


>20 
>30 fc 
>1(T 


>50 
40-74 
>60 



" Defined as individuals wiib "lest- positive,,'* in whom further workup gave no suspicion of malignancy. 

* The study also included a group of nonsmokers. 

* Average = 45 (not reponed in the other studies). 



Table 5 Histology, stage, ami size of primary lung cancer detected by low-dose spiral CT 



Author 


No. of cancers/ 
No. screened 


Histology % 






TNM % 




Size (mm) 




Adeno" Squam. 


Other 


1 


11 111 )V 


Average 


Range 


<10 11-20 


>21 


Kaneko.t-r al (12?) 


15/1369 0.1%) 


73 " 17. 




93 


7 


12 


8- 1 8 






Sone ei at. (130) 


19/5483(0.3%) 


63 5 


32 


84 


16 


17 


6-47 


4 14 




Henschke et al (131) 


27/1000(2.7%) 


67 . 3 . 


30 


85 


4 1) 






15 8 


4 



"Adeno, adenocarcinoma; Squam.. squamous cell carcinoma; TNM, tumor- node -metastasis. 



cases per J 000 examinations). Surprisingly, there was no dif- 
ference in the detection rate among smokers (0.52%) versus 
nonsmokcis (0.46%). The results from the three population- 
based studies are summarized in Tables 4 and 5. The conclusion 
from these studies is ihat 85% of the lung cancers detected by 
low-dose CT were in .stage I. offering improved possibility for 
cmative treatment and better prognosis in genera!. However, the 
issue of "false- positive" scans has to be taken into consideration. 
Thus far. up to ?0% of the participants with nodules on the scan 
had no malignancy during the follow- up period. The possibility 
that the cancers found repiesent incidental cancers as in the 
Mayo Lung Project must also be considered (114). The results 
fiom these studies confirm the expectation that low- dose CT 
incicases the de tec iron of small nomnlcifjed nodules and, that 
lung cancer di an t-arliri and more curable stage are detected. 
The mobile CI sciecninp snidy by Sone ei nl. (130) showed that 
low-dose CT increased the likelihood of detection of malignant 
disease 1(1 limes :;5 compared wjih indioiriaphy. The overall rate 
of malignant di/.e:t:-e wj? lower m the Japanese studies {129. 
I?0.) compared with iht 1:1. CAP study iKrf. 131; detection rales 
0.43- 0.4 8% u r nw ?.7%). this could be because the Japanese 
studies screened rmhviduals from ihf general population ages 



40-74, whereas ELCAP screened people at high risk, ages a 60. 
with a tobacco history of at least 10 pack-years. Thus, as 
expected, the risk of the population to be screened affects the 
rate of cancer detection. 

Questions remaining to be answered include: (o) what are 
the diagnostic sensitivity and specificity of this procedure: and 
(/>) does screening reduce lung cancer mortality? The spiral CT 
has not been as sensitive for small central cancers as it is for 
small peripheral cancers (129 : 131)! Minute nodules of lung 
cancer that are near the threshold of. delectabiliry may be over- 
looked at spiral CT screening (132). A prospective study of the 
diagnostic sensitivity of spiral CT has recently shown that the 
diagnostic sensitivity exceeded the sensitivity of convenironal 
CT in previous reports (25). However, there were limitations in 
the detection of intrapnlnionary nodules smaller than 6 mm and 
of pleural lesions. Compared with surgery (thoracotomy wjih 
palpation of deflated lung, resection, and histology), the sensi- 
livjfy of spiral CT was 60% lor inirapulmonary nodules of < 6 
mm and 9.*>% lor nodules of S6 mm and was 100% foi neo- 
plastic lesions ^ty mm. Furthermore, a marked difference in (he 
sensitivities of two independent observers was found for nod- 
ules smaller than 6 mm, whereas agreement was much better (or 
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6- L ID-mm nodules (25). Given these promising preliminary clin- 
ical results, further research is needed to determine the optimal 
technique for spiral CT screening, which includes collimation, 
reconstruction interval, pitch, and viewing methods. Decreasing 
the slice thickness to 3 mm, monitoring the viewing of exami- 
nations, and computer-aided diagnosis have been used to im- 
prove the diagnostic capability of spiral CT in the detection of 
pulmonary nodules (133-136). 

Future large scale randomized studies have to confirm 
whether in fact spiral CT screening will lead to a reduction in 
lung cancer mortality. )n a randomized study, the following 
questions arise: (a) what is the optimal high-risk group to study 
and what should be the control arm? (b) what should be the end 
points (goals) of the studies? The ultimate goal is to reduce the 
rung cancer mortality. However, although this is a long- term 
goal, intermediate end points from such studies should be eval- 
. uated. The change to more curable stages at diagnosis for the 
lung cancer patients is one such immediate goal; (c) what is the 
optimal workup and the morbidity of this program? {d) what is 
the cost, of such a screening program? and (e) what is the 
false-positive rate of the screening findings? Incorporation of 
smoking cessation programs should be included in the future 
design of screening studies because it has been shown that 
screening with low- dose CT in participants who are still smok- 
ing provides substantial motivation for smoking cessation (137). 

The studies with spiral CT-scan have demonstrated the 
superior diagnostic ability in the detection of small peripherally 
located tumors, most of the malignant ones of adenocarcinoma 
type of histology. The diagnostic sensitivity of spiral CT for 
more centrally located rumors (mostly squamous cell carci- 
noma) is significantly lower than for the peripherally located 
ones. Through these spiral CT studies, we will learn about the 
biology, paihology, and clinical course of these small tumors ; 
which might be different from what we know about clinically 
more evident rumors detected routinely in pievious studies. 

Because lung cancer is so common, the introduction of any 
new screening technique in this area has lo be underpinned bv 
careful definition of the cost implications and must be justified 
by compelling evidence. The cost-effectiveness of the spiral 
CT approach should be assessed by evaluating ihc rate of 
over-diagnosing nomnalignani. relatively common abnormali- 
ties and comparing CT imagine to other diagnostic technologies. 

PET with FDG has recently emerged as a practical and 
useful imaging modality in the preoperative staging of patients 
with lung cancer. However, whereas CT is most frequently used 
to provide additional anatomical and morphological information 
about lesions, the FDG PET imaging provides physiological and 
metabolic information thai characterizes lesions that aie inde- 
terminate by CT. FDG PET imagine talies advantage of the 
increased accumulation of FDG in transformed cells and is 
sensitive lor the detection of cancer in patients who 

have indeterminate lesions on CT 1 133). The specificity (~ S."*%> 
of PHI iniaeint; is slightly less than it.<* sensitivity because some 
inflammatory pioerr-scs avidly arcumulaic FDG. The hiph net 1 
alive ptedii-tivr v:due of PHT suggests 'hat lesions considered 
negative on the >nulv are benign, biopsy is not needed. :<nd 
radiographic folio'-*.-- up is recommended. Several studies have 
documented the increased accuracy vf PET conipatrd with CT 
in ihc evaluation ol the hilar and mediastinal lymph node stnm;. 



in patients with lung cancer (J 38). However, the PET resolution 
is sufficient only for nodules 3:6 cm and will not be helpfb) in 
delecting the very small nodules. Compared with low-dose 
spiral CT, the FDG PET scan is more expensive and lime 
consuming. The role of PET scan in early diagnosis of lung 
cancer in an asymptomatic high-risk population is not yet eval- 
uated. However, future studies have to include PET evaluation 
lo define its role in a population screening setting. 

Conclusion 

Recent advances in molecular biology and pathology have 
led to a better understanding and documentation of morpholog- 
ical changes in the bronchia) epithelium before development of 
clinical evident lung carcinomas. Combined with technical de- 
velopments in radiological and bronchoscopic techniques, these 
procedures offer great promise in diagnosing lung cancer, far in 
advance of clinical presentation. Any of these individual proce- 
dures could be incorporated into the routine management of 
individuals at risk for developing primary or secondary lung 
cancer, and for several of these methods, clinical studies are 
under way. Preliminary reported data ore very promising for the 
early detection of lung cancer. Future studies must incorporate 
the different methods in a mullidisciplinary scientific setting to 
evaluate the role of the individual method in the overall man- 
agement for individuals at high risk for developing lung cancer. 
Several of these tests might diagnose the disease at the stage of 
clonal expansion before invasive carcinoma has developed. A 
management and intervention strategy appropriate lo that stage 
of disease have to be developed. Preliminary snidies of chemo- 
prevention agents are reported, and new agents based on other 
biological mechanisms are under development and ready for 
clinical trials. It is now time to plan clinical trials that evaluate 
both diagnostic and therapeutic approaches to access their im- 
pact on the incidence of clinical lung cancer. 
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moderate dysplasia) and 3 were sessile (1 with moderate and 2 
with focalfy severe dysplasia)]. Samples from 16 colorectal ade- 
nocarcinomas (1 Dukes' stage A, 6 Dukes* stage B, 8 Dukes*. stage 
C and 1 Dukes* stage p\ 1 villous adenoma and 1 .malignant colon 
lymphoma were prospectively collected during 1999-2000. These 
prospectively collected, tissue specimens were dissected so that 
samples contained bom normal mucosa and tumor tissue and were 
obtained within 30 min following surgical bowel resection. The 
specimens were immediately fixed in 4% neutral buffered formalin 
for 20-24 hr and then paraffin embedded The 14 archival samples 
- had also been formalin fixed and paraffin embedded. ' 

Generation of nonoverlapping WMP-1 cDNA fragments by PCR 
The full lepgth TTMP-1 cDNA (GenBank NM_003254) cioned 
in pSP64 vector 29 was used as template to generate 2 nonoverlap- 
ping PGR fragments for in vitro transcription, and named f 1 04 (bp 
56-378) and fl06 (bp 398-680). First, the whole insert (-780 bp) 



was cut, out by digestion with Hind 111 and JfamHl and punned 
after agarose gel electrophoresis using the Qiaex H gel extraction 
kit (Qiagen, Crawley, pmted Kingdom). Jo generate nonoverlap- 
ping antisense probes and the correspondin'g sense probe, 2 PCR 
fragments were generated using, upstream* primers flanked by a 
linker sequence containing an EcoRl restriction enzyme site (un- 
derlined nucleotides) and a T3 polymerase binding sequence 
(boldface) 5 f -(ga^aattcattaaccctcactaaagggaga)-3', and down- 
stream "primers flanked by a linker sequence containing a Bam}Jl 
restriction enzyme site and a T7 polymerase binding sequence 
5'-(ggaJcctaatacgactcaciatagggag) : 3\9. The TTMP-1 specific up~ 
. stream primers were 5'- acccaccatggccccctttg -3' for f 104 and 5'- 
(linker) - gcaggatggactcrJgcaca' -3' for fl 06. and the downstream 
primers were 5'- linker- actcctcgctgcggttgtgg -3' for fl04 and 5' - 
(linker) - tatcigggaccgcagggact -3 '.for fl 06. PCR using the 2 fl 04 
primers of the 2 f 106 primers was done as previously described 30 

The PCR products were purified by column chromatography 
using S-200HR microspin columns (Amersham Pharmacia Bio- 
tech, Ina, Piscatway, NJ), and their size tested by agarose gel 
electrophoresis. Both migrated as —300 bp fragments in accor- 
dance with the predicted size (322 and 282 bp, respectively). An 
ABI. PRISM 310 genetic analyzer was employed for DNA se- 
quencing analysis and was performed according to the" manufac- 
turer's instructions (Per kin Elmer, Applied Biosystems, Foster. 
City, CA) using the primers specified above. The DNA sequences 
obtained were confirmed by comparison with the specific TTMP-1 
cDNA nucleic acid sequence (GenBank NM_003254). 

Plasmids containing human MMP-2 cDNA (pCol7201 , bp 647- 
1284) and human MMP-9 cDNA (pO>19202, bp 1751-2326) have 
been described elsewhere. 31 

Ih vitro transcription 

Antisense and sense riboprqbes were labeled with j5 S DTP 
(NEN, Boston, MA) by in vitro transcription using T7 and T3 
RNA polymerases. (Roche, Basel, Switzerland). The DNA tem- 
plate was digested with DNase (Promega, Madison, WI). Nonin- 
corporated 33 S UTP and DNA was removed by column chroma- 
tography using S-200HR microspin columns (Amersham 
Pharmacia Biotech, Inc., Piscatway, NJ). The 35 S activity was 
adjusted for every probe by dilution to 500,000 cpm/uJ- 

In situ hybridization 

In situ hybridization was performed essentially as described 
previously. 32 In brief, 3 tun paraffin sections were deparaf finized 
in xylene, hydrated wjih graded eihanol and boiled in a microwave 
oven for 10-12 min in 10 mM citrate buffer, 6.0. After 
additional 29 min di room, temperature, the sections were dehy- 
drated with graded ethanol and die 33 S labeled probes (2X J 0 6 cpm 
in 20 uf hybridization mixture 3 ' per slide) incubated overnight ai 
55°C in a humidified chamber. Sections were washed in Hellen- 
dahJ chambers with SSC buffers containing 0.]% SDS and ]0mM 
DTT at 150 rpm at 55°C using a Biihler inenbation shaker (Jo- 
hanna Oho GmbH, Hechingen. Germany) for 10 min in 2XSSC, 



10 min in 0.5 X SSC, and 10 min in 0.2XSSC Sections were then 
RNase A treated for 10 min to remove n on specifically bound 
riboprobe. Subsequent wash* was performed in 0.2XSSC as spec- 
ified above. Sections were dehydrated and soaked into an, autora- 
diographic emulsion (Df ord), exposed for 5r7 days if not otherwise 
stated and finally developed. Sections were counterstained- with 
: haematoxylin and eosin. 

Jmmunoperoxidase staining 

Irarouriorristcchemistry was performed essentially. as described 
previously.^ 2 Five micrometer para/fin sections were depara/ 7 
finized with xylene and hydrated through etbanol/watex dilutions. 
Tissue pretreatmenl was performed with protease-K (5 p-g^rnl) 
digestion for 20 miri. Sections were blocked for endogenous per- 
oxidase activity by treatment with 19b hydrogen peroxide for 
1 5 min. The sections were washed in 50 mM Tris J 50 mM NaCJ, 
pH 7.6, containing 0.5% Triton lX-100 (TBS-T). Incubation with 
■7-anB^odSe^^as^one oye.rnight at 4°C. Sheep polyclonal antibodies 



(pAb) against TTMP-1 and nonimmune goal IgG were used at a 
final concentration of 4 J) u-g/ml. Two monoclonal antibodies 
(MAb) against TTMP-1, 3:1 NM4 {clone rTTX6A, NeoMarkersv Fre- 
moDt, CA) and CalB2 (clone 147^£>11, CalBiobexi Oncogene 
Res. Products, Cambridge, MA), and a.JylAb against trinitro- 
phenyl (TOP)** were all incubated at 1.0 u-g/ml (all 3 MAbs are 
IgGl). CaB2 MAb recognizes both free TTMP-1 and TTMP-1 
in complex with MMPs. 33 NM4 MAb only recognizes free 
7TMP-J- 33 According to- the manufacturer's descriptions, both 
MAbs are raised using recombinant human TTMP-1. The. sheep 
. polyclonal antibodies were raised by immunization with TTMP-1 
purified from human dermal fibroblasts. The IgG was obtained by 
triple precipitation using ammonium-sulfate and characterized by 
immunodiffusion and rocket immupoeJectropboresis. 35 In addi- 
tion, we have shown that the pAb recognize both free and MMP- 
complexed TTMP-1. 36 Furthermore, the specificity of the antibod- 
ies was analyzed by Western blotting analysis against recombinant 
human TTMP-1 expressed in NSO mouse myeloma cells. Here, the 
antibody preparation recognizes a band of approximately 28 kDa 
in accordance with the molecular weight of TTMP-1. To certify 
that the pAb recognize TTMP-1 in colon tumors, the antibodies, 
were immobilized on a sepharose column. Total protein extracted' 
from 3 colon adenocarcinomas was passed through the cohimn 5 
times and the bound and subsequently eluted protein analyzed in a 
Western blot using a TTMP-1 monoclonal antibody (MAC15). A 
single band of approximately 28 kDa was revealed in accordance 
with the molecular weight of TIMP-1 (results not shown). In 
irnmunohistochemistry, the sheep pAb were detected with biotin- 
ylated Tabbit-anti-goat IgG, which cross-react with sheep IgG 
(1:100. code E466, DakoCy tomation} followed by horseradish 
peroxidase in complex with streptavidin (code JC377, DakoCyto- 
mation). The MAbs were detected with the Envision- mouse re- 
agent (EnVision reagent, K4003, DakoCytoroation), followed by 
tyraroine amplification, using biotinyl tyraroine substrate as-spec- 
ified by the* manufacturer (Nen, Boston, MA). Sections were 
developed with NoyaRed substrate as specified by the manufac- 
turer (Vector Laboratories, Burlingname, CA) for 15 min. Finally, 
sections were counterstained in Mayers haeniatoxyliri, dehydrated 
in eihanol and mounted. 

Combined in siru hybridization qnd immunohistochemistry 

Double labeling by combining in situ hybridization and imrou- 
nohisiochemistry on paraffin sections has been described P re ^" 
ously. 32 In brief, using MAb against a-sn>aciin (clone 1A.) 
diluted 1:1000. against cytokerarin (clone AE1/AE3) diluted 
1:1000, or against CD68 (clone PGM1) diluted 1:200, sections 
were incu baled for 2 hours at room temperature and then detected 
with ant i-mouse-lgG/horse radish peroxidase-conjugated polymers 
(Envision-mouse reagent, DakoCytomation, Glostrup.. Denmark)- 
Seciions were developed with di ami nobeiizi dine (DAB) for 
7-10 min, and immediately dehydrated for in situ hybridization, 
which was performed as described above using the antisense 
piobes ol f 104. Seciions were counterstained with haemato*vhD- 
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Results 

^fy^ofTlMP-l probes end antibodies for in S im 
hybridization and immunohistochemistry 

JKstopalhoJogicaJ diagnosis of prospectively collected speci- 
™Z { ' OIa . , 5™ ,0re ^ ^™ revealed 16 colorectal adcS 
lTw d v, "°™ ^"o™ ' malignant lymphoma. *S- 
U Wed autisense and sense RNA probes were generated by in v»£ 

! Mttftatta on adjacent 

, secw^sfromSof the colorectal adenocarcinomas. The 2 mtisense 

S c^k 1? Com f' it,nen, mounding the invading 

P^i^d^"„fT an , C3tprcSSi ! >n - ™°^«ocherrristry was 
S«rf*» i 5 ? C adeDocarc 5»°™a S (inclnding the 5 men- 

TIMP1 anBbdd,es 0,1 sectioEs adjacert to 

IIMP-1 m s,m hybridized sections. The T1MP-1 mRNA and 

Snffi °¥ f^,™*"** *e maligna* lymphoma. The 
an^7TMP-l polyclonal antibodies did not react with other cell 
populabons m all of 8 adenocarcinomas andtetX^lS 

canon but showed a staimng pattern similar to ftat of the oolv- 

neiS'^ P M^ a,i0n aiig - m > "** * e «* except, 
n^r f* v™* 1 * SUuned "* apical surface of the 

normal and mahgnant epimelium: No sign/was ob^irfed wfth 

=S *~ a Mf (ofsamtTubclass as CamW 
dln:C,ed a S ains ' syntheric hapten trinitropbenyl fJNP). 
mRAM exprestton /rcrfcmj m cancer 

w„ » colorectal les,ons by m situ hybridization. TIMP-1 rnRNA 
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express.on was m all the cases of colon adenocarcinoma (inclodma 
Utose mentioned above) highly expressed in stromal fihSJ? 
cells located at the invasive front (Fig. 3a,<fL TTMFM mPW? 
^.al was also observed in fibroblas.-hke ceV ocated^ 
mntpr stroma towards the colonic lumen m 8 of IfS wh£ 

Kf tlfE JT "ff? m ** V*™** 1 6f ^arcmoH^ 
n?PMA ^ V colorectal adenocarcinomas, we observed TIMPI 
•"RNA signal in some fibroblast-lDce cells located aroundthl 
fiZl^ ° f 'ocated in the^mucoTta 

from , Ae cancer area. The normal colonic mucosa, including 
larmna propria .hat was present in all samples tested was .enerf^ 
neganve (Rg. 3V)..Only actively, weak I»iSa2 
was detected i„ strontai fiboblas.-lflce cells smround^one o? a 
very few normal crypts (Fig. 3c, j) m 3 0 „t of 6 cases 
extended exposure time (10 days vj. usually 5 davsl In rt^rT 
^^-^Jy a few TlMP-TSiS cKs" 
tuncu „ JU1 j^ ifflanuTjarfon (data hot shown). In the malisnam 
^mphoma of Ae colc^TIMP-l rnRNA expressing fibroWasftte 
cells were, different from the adenocarcinomas lc«tedrn a dTf 

^ was observed in any of the 18 cases in the canccTceDs 
smooth muscle cells or vascular cells. • - ' 




colon adenaarcinoma woe ^^,7^ i JaCem SecI,on5 from 3 
bbeled a,„i sej)se prXs^r T IMP ^ umTI? 13 ™'^ S " 

? nd a conespondS,. m™^' W ■« ^ 1,04 «» 

in bri S h.lir]d (") and as^ ^ . ' **• 5 " V " S '° >nS aDa ^'"onstraled 
whereas no specinc ' SJLTn J^ S ™" ik '» a.b and d). 



ChaTttaerization^JTIMP.lrnkNA expressing cells 
To test whether the TTMP- 1 rnRNA positive fibroblast-like cell, 

colorectal adeDOcarcinomas and the malignant lymphoma wen^ 

ceils (^MQJ ox CD68 (for detection of macrophages) and subse- 
quently mcobared w.th a TIMP-1 rnRNA XisenseTote L, 
normal co on Ussue, a-sm-actin is expressed by vas ula^cJn 
rnuscle cells, smooth muscle cells of lamina muscul Js mu^ae 
cln Zor SCUl '' mS 25 ^ U aS ^ Wofibrobla^h, 
S?l£rS,V B? ^?™K* "? r " Sed by '""or-associated fibro- 
wast-hke ceUs located throughout the tumor stroma, which are 
defined as myofibroblasts. No TIMP-1 rnRNA was de^d t »y 
^If 00 - 11 ^ Sra0O,h mus ^ ^Is, including those of thd 
)anU 2L r ™ scularis rancosae and the tunica muscularis 
I» addmon, no TIMP-1 mRNA was detected in the a-snJSn 
positive pencryptal myofibroblasts of Oie lamina propria in any of 
ot m ir 10DS - 71Mp - J mRNA signal was in conL. seTni 
ir,^ Z^t* associa,ca ™vofibroblasts located at .he 

more ZlX* ?™£ n CdnCerS - Jn 3 o{ *» adenocarcinomas, 
Se iuva^n? ° f m ^ P ; 1 "^A-P^i'-ve cells located close .o 
TO^r^Si^" CCBS WeiC ^actin-positive (Fig. 4). 
TMP-l mWMA posmve fibxoblast-like cells located mere distant 
from U* mvasive cancer cells, towards the submucosa, expressed 
v™£ a - sm - ac,ln 1" J adenocarcinoma and in Ae malignant ' 
Z^T* a PP™.ma.ely50W.heTJMPl mRNA posi.ive^cells. 
coSule r«r aC "^ ^"V 1 * TIMP - 1 mRNA expressing cells 
caTd aTL inv P0PU ^'' £>n ^"""o^ciated myofibroblasts lo- . 
conld t , ft ° m ° f 0,6 ,UmOT - No "MP- 1 mRNA signal 
could be ^entified m any of the CD68-positive cells (Fig. 4) 

Expression of T1MP- J and MMP-2 and 9 in colon cancer 

inv^vr 2 ;"" MMP ~ 9 2 WC ^ "« a ^nases expressed in the 
mv^ve cancer , )Ssue oX co)orec , aJ ^ 

S „TS^ 10 ^ «P Ie «ed by fibroblast-like cells 
lead^o P T f? ma - and MMP " 9 b > "«c,o phag es at die 
exnt?" = " C f V35,ve caDce '- 3S To directlv compare the 
expression pnuems of MMP-2 and MMP-9 wi.b that of TIMP-1 

zed wi,h r ° m AT ,eCa) ade »°c^ino m as were liybrid-- 

hC «P^»'on of HMP-1 niRNA was localized char- 

TJs ^Z ^'I " ,RJVA 1VaS ,nt>5 ' in,ense lhc «»' fa) 
(big. MMP-9 mRNA expressing cells were found at .be 





Figure 2 -Jn situ hybridization and inrmunohistochermstry for T1MP- J in human colon cancer. (4) Twoadjaeem sections from a JJjJJ*^^ 9 " . 
adenocarcinoma were incubated with polyclonal antibodies against T1MP-) foe) and a TIMP-1 mRNA antisense probe (b). The j Jiyu ^* 
immuDoreactivity (red-brown color, auows in a and c) and the TIMP-1 mRNA (silver grains, arrows in b) are identified in the same cells l^ ows 
in a,b\ Immimoperoxidase staining with the TIMP-1 pAb' reveals the TIMP-J-positive cells as fibroblast- like cells (arrows m c) Jooated >n the 
stroma (St)- No TTMP-J immunoreactivity is seen in cancer cells (Ca). a f b: bars *= 50 p-ro; c: bars =13 p-ro- (B) Four conS « u iiy c , ma£! 
sections were incubated wiih CalB2 MAb anti-TlMP-1 (a) ; ' NM4 MAb {b\ sheep ami TIMP-J pAb (c) or mouse anti TNP (.d). The 5 MAOS 
were delected wiih Envision reasem followed by TS amplification and the sheep .pAb wiih biotinylatcd rabbit anti-goal followed Dy 
HRP-conjugated streptavidin (see Material and methods). The 3 TIMP-1 antibodies react with the same cells (arrows). No irnmunweacuvny is 
seen when the sections are incubated with anti- TNP. 



invasive front like those expressing TIMP-1 mRNA but with a 
distinctly different distribution. Foci with high expression of 
TIMP-J mRNA were not accompanied with increased expression 
of MMP-9mRNA and vice- versa (Fig. 5b). Thus, TIMP-1 mRNA 
expression is not coregulmed with MMP-2 or MMP-9 mRNA 
expression. 

TIMP-] in adenomas and Oiikes' sraze A carcinomas 

TIMP-1 antigen can readily be measured ;n blood and we have 
previously reponed that levels of T1MP- 1 in bJood are significantly 



elevated in colorectal cancer patients compared to healthy donors 
and that high plasma TIMP-1 levels are associated with short 
survival of colorectal cancer patients. 2136 TIMP-1 has therefore 
been suggested to be a novel marker for detection of early stage 
colorectal cancer and for prognostic stratification- of colorectal 
cancer patients.- 1 39 These findings, together with the characteristic 
expression pattern of TIMP-1 ai^the invasive front of virtually all 
the colon cancers and the absence or minute TJMP-1 expression m 
normal and benign colon jmucosa. prompted the evaluation of 
TIMP-1 expression as a marker for early invasive colon cancer. 
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area with several CD68-nosiiive Lr^T DO / C ^ 68 ,m "»'n«ociivjiy is seen in ihe TJMP I n,RM a ? , Ce,b are ^"idered as, 

odditional 8 oSte" s^Ta rn n" * e "P'" 3 ' 0 " a » 

?.*«• A c«S ^^^^T a " 8 
invasive from (fi. 6) In I ™ v V mR NA signal at ,he 
T1MP- 1 mRNA etm, ■ ^ C ■ den ° n,a ft**™™!**! type) 

■bis Sample cleaMv ^ ° f addi,io " 3 ' fow. 

in >be same « a • M Ihfl X*. 0 ' t,ySp,aS,ic e ' )i,be, ' u '" 

-v"e) :v: ™ P *r,™r mRNA "r^ atfe "°™ 

located around small 1. ■ ^ "Pressing fibroblast were 
oreo (dara noTsh"^"'" »»«*»'«> ^ ■>* ™,or 



Ihe invJveTol. nf ^ «P'«5.on ,n myofibroblasts located at 
in ol r o f 7 al^f wbere * M P'«*i°" »™ delected 

interfaceTn'of Z ^ 'f nm3 "' < »' 31 epitbelial-s.romal 
.me. face m 2 of ,he cases and wi.l, arleries in the snbmucosa in I 



Discussion 
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Figure 5 - /n jim hybridization for 7TMP- 1 , MMPr2 and MMP-9 in 
human colon cancer. {A) Adjacent sections were incubated with probes 
specific for TTMP-1 mRNA fac) and MMP-2 mRNA .(6.4). respec- 
tively, and is shown in brightfield {a.b) and dark field illu mi nation 
(c.tJ). Tbe TTMP-J mRNA signal increases towards the subrpucosa 
(Sm) whereas the MMP-2 mRNA signal decreases and is most intense 
in the centra) areas (Ca). (#) Adjacent sections were incubated with 
probes specific for TTMP-1 mRNA (a,c) and MMP-9 mRNA (fr.d), 
respectively, and is here shown in brightfield io t b) and darkfield 
illumination (c.d). Both the TIMP-1 mRNA si^hal and the MMP-9 
mRNA signal are most intense at the invasive front towaids the 
submocosa (Sm), but their expression pa items are quite different, wiih 
MMP-9 showing o )c most restricted expression. Bars = 100 rxm. 



cDNA fra°ments and specific pAb and MAbs against human 
TfMP-1. The TIMP-1 mRNA signal in all colorectal adenocarci- 
nomas investigated was seen in fibroblast- like cells located in the 
tumor periphery. An identical hybridization partem was observed 
with the 2 antisense TIMP-1 probejs and application of comple- 
mentary sense probes on neighboring tissue sections as negative 
controls did not result -in any hybiidizaijon signal; therefore, we 
conclude that the hybridization signal generated with the antisense 
probes repiesems the genuine TTMP-1 mRNA. TTMP-J immnno- 
rcnciivity was also distinctly located in fibrobtast-Jikc stromal cells 
> n _.'hc_inmor periphery, and these cells were identified to be (he 
same cells as the T1MP-I mRNA expressing cells. A pieparation 



of sheep pAb against human TTMPV 1 35 and 2 weJI-characterized 
MAbs stained the very same cells in tbe tumor stroma.. Weak 
staining of the apical surface of seine norrnal and - malignant 
epithelial cells was observed with tbe pAb in some tif the samples. 
• No staining was obtained when (he-anti-TEMP?! antibodies were 
substituted with nonimmuiie goat sernm or anti-TNP MAb incuv 
bated at the same concentration*. These ramunornstochemical . 
\ findings strongly suggest that the- JIMP- i antigen detected in the 
fibroblast-likc cells represents the genuine TIMP-1 protein. 

In our study, we found TTMP-1 mRNA expression in stromal 
fibroblast-Kke cells located in the tumorperiphery in ail colorectal 
adenocarcinomas tested, whereas no expression was detected in . 
the. cancer cells in any -of the. cases tested.* This finding is in 
agreement with studies by Zeng and colleagues > ,2 - 2 ° but is partly in 
disagreement with findings by Newell and 1 colleagues. 27 In addi- 
tion to TTMP-1 mRNA signal in fibroblast-fifce' cells in the tumor * 
j-penjto^^ detected a weak TIMP-1 

•mRNA signal in both benign and mafignaltepimdial c^s. 27 This 
observation "was, however,, based on t}ie use* of probes: from a 
. single TTMP-1 cDNA subclone and no additional controls; to verify 
. the expression pattern. The difference between' our results and - 
those of Newell and colleagues may be explained by methodolog- 
ical differences, since the procedure employed, by Newell and 
colleagues was in several steps different from the one used in the 
present study* e : g., Newell and -colleagues used 3 p T labeled probes, 
whereas we used 35 s -0abeled probes. It is in this context notewor- 
thy that in order to look for a low expression level of TIMP-1 
mRNA, we performed in situ hybridization experiments with pro- 
longed exposure time (10 days vj. usually 5 days) with both* our 
TTMP-1 antisense probes and both TIMP-1 sense probes, but with 
this challenge we did not detect any TTMP-1 mRNA in any 
. epithelial cells. It cannot be excluded though that the TIMP-1 
mRNA is expressed in epithelial cells below the detection limit of 
our rn situ hybridization procedure. 

An interesting observation in our study was the characteristic 
intense TTMP-1 mRNA and protein expression in the rumor pe- 
riphery of all colon adenocarcinomas, while little or no expression 
was seen in the center of the cardnonias. Only in the colorectal 
lymphoma did we find TTMP-1 mRNA and protein expression in 
'fibioblast-like cells located throughout the tumor tissue. Tbe 
TTMP-1 expression pattern in the colon adenocarcinomas is in 
contrast to the expression pattern reported by Hewitt and col- 
leagues, 19 who found that the TTMP-T staining in most colorectal 
adenocarcinomas was equally intense in fibroblasts throughout the 
tumors and that some of the cases even showed decreased TTMP-1 
signal intensity towards the tumor periphery. This difference may 
be explained by possible cross-reactivity of the polyclonal anti- 
bodies employed by Hewitt and colleagues or that Hewitt and 
colleagues employed cryostai sections, while we analyzed paraffin 
sections. . - . 

The Tl.MP-T expressing cells had a fibrbblast-like morphology 
and using combined in situ hybridization for TTMP-1 mRNA and 
inununohistochemistry for ct-sm-aclin, we found that many, gen- 
. erally more than 50%, of the TTMP-1 mRNA positive cells coex- 
pressed a-sm-aclin. According to the cellular morphology of the 
TTMP-T expressing cells and their localization in the invasive- 
front, we could conclude mat the cells were myofibroblasts and not 
smooth muscle cells. 

The myofibroblast is a cell type present in the normal colon 
mucosa, that originally was described as a pericryplal fibro- 
blast 37 *' 0 and Taler was identified with- antibodies against a-sm- 
actin. 37 In the lamina propria, (he myofibroblasts form a continu- 
ous cell layer just below the intestinal epithelium. The pericryptal 
myofibroblasts are phenotypicaUy different from me neighboring 
quiescent interstitial fibroblasts that do not express markers of 
smooth muscle cells/* During early steps of colonic tumori genesis 
the number of myofibroblasts is significantly increased. 111 The 
T) MP- 1 e x pressin g myofibroblasts may be generated after activa- 
tion of the pericrypM myofibroblasts and/or the quiescent inter- 
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sbiial fibroblasts. Adegboyega and colleagues*' hypothesized that 

n,pS^r Cd 7? fibrob5asts «rigi n ate from the quiescent 
interstitial fibroblasts of mc lamina pjt;rpriaj jalhcr ^ ^ 

cryptal myofibroblast or smooth muscle cells, which may help to 

S£X« ft WC >i ^ S ° m - e ° f tbe T,MP " 1 mRNA expressmg 
fibroblast-bke cells a-sm-actm-positive and some ^sm-actin-neo! 
aove. ° 

Several MMPs including MMP-2, MMP-1I and MMP-14 are 
expressed by fibroblast-like cells in human colon cancer 
some of which may indeed be myofibroblasts. The role of the (mvo-) 
fibroblasts m colon cancer progression isnot known. Since TTMJM in 
human colon cancer appears only to be expressed by fibroblast- like 
cells most of which are myofibroblasts; and since high TIMP-] levels 
measured m blood or romor extracts from colon cancer patienls are 

T"^^™* Wi0> 3 *** F«>S«** aM, ™» it could be argued 
that the TMP-J expressing myofibroblasts play a tumor-promotin° 
role^ lirariurKjbslc^bernical localization studies of proteins involved 
in UK activauon and regulation of the efficient serine protease plas- 
minogen, mcludmg urokinase plasminogen activator (uPA) and its 
specific inhibtor PAM show that both are mainly expressed by 
myofibroblasts m human breast cancer^ Hhh leveJs of „ pA 
PAJ-ljue strongly correlated with poor prognosis in breast can- 
cer, supponmg ibe assumption that tbe myofibroblast express a 
promoung role : ,„ cancer invasion. We recently reponed ihat mc 
predommao, PAJ-1 expressing cell in human colorectal cancer also is 
the myofibroblasts and earlier studies indicated tho. elevated levels 
of PAJ-J m colon cancer patients are associated wiib poor prosno- 
sis.- Together these findings indicate thai myofibroblasts are strongly 
contnbunn- to the expression of proteins involved in the re«,.bi,W 
Md™^si^r tm degr3di " ? plCAeX1SCS ' h3! faciKta,c ca "«r invasion 
A panicuhrly interesting firKjin o of the pjesem sll|(J , 

fi^.? c A <*™°n*s 'he TiMP- J hiRNA was expressed in 
fibiob a^t- like cells along the invasive from. In the 2 benign lesions 
m which .he TIMP- 1 mRNA wns see,, in the adenoma" are! *e 



TIMP-1 mRNA positive cells were confined to a single focus with 
locally increased inflammation related to the dysplastic epithelium 
Evident disruption of the dysplastic epitheJium was observed in the 
adenoma with most intense IIMP-l mRNA signal. Intestinal in- 
flammation may be caused by disruption of the mucous epithelium 
that leads to focal leakage of mucinous colon material into the 
Jamma propria. Increased intestinal permeability is a common 
deficiency in Crohn's disease and interestingly T3MP-1 mRNA 
was found in the intestinal granulation tissue of Crohn's disease 50 
and is expressed by myofibroblasts isolated from Crohn's dis- 
ease 5' Induction of TJMP-1 in myofibroblasts in a benisn or 
preinvasive tumor may also be a response to locally increased 
MMP activity or a response to the presence of a specific MMP in 
the local microenvironment. MMP-2 and MMP-9 mRNA expres- 
sion, however, did not appear to be copulated wiib TjMP-1 
mRNA expression in the colorectal adenocarcinomas. Specific 
■MMPs may indeed be involved in the transition of noninvasive to 
invasive disease; in studies of preinvasive lesions (ductal carcino- 
mas m situ) of the human breast we recently reported that MMP-13 
is specifically expressed in myofibroblasts associated with micro- 
invasive events.** Future studies may clarify whether TJMP-1 
expression in colorectal adenomas is conelated with expression of 
specific MMPs T cytokines and/or growth factors, such as TGF-pi 
and TGF -0251, and whether T1MP- Lean be used as a histopnibo- 
logical marker for malignancy in colorectal tumors. 
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SISS^T" ac,iV3,or by *- f ° fecti »" ° lb "°"" 

Baaa^mfceiHy, Lfa HS. Hn VS ri.,- ««, , ,. Y rh Tlil 

™S '!™"" rh "S ic fever «"» *»8*> syndrome (DHF/DSS) a™ sve« 

US1 ^ b0th Ptimaiy iS ° ,ated endothel ^ tonan umWIcT^ veins 
cells, and a human rmcrovascular endothelial cell line. DV infection simificantlv S^S 

? ^ ^ ^ ?AM ° f hUman ^othdhl cells: fa^^SSSS 
endothelial cells was induced by DV as demonstrated byRT-PCR^r^tib^dva^in^rr 
Sror bited DV ^uced tPA pJuchon oL^SS ^ 

but not Dr patients.. These results suggest that IL-6 can regulate DV-induced tPA 

SSSS^SSSSSTJ^ ^ may p,ay important ^ ,es in 

aevetopment of DHF/DSS. Copynglit 2003 Wiley-Liss, Inc. 
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Ne« oncogene expression in ovarian tumors: a quantitative study. 

Department of Pathology, Tufts University School of Medicine, Massachusetts. 

2T« ^ k '™olnstochem,stry in 57 primary and metastatic ovarian 
neoplasms, two paraoyanan leiomyosarcomas, and eight normal ovaries SomeUof 

-.k rVr t? - ° 6% ° £the ovanan tumors that overexposed neu were of 
ep lt hehar typa Ep.thelial ovarian rumors had significantly higher amounts^me neu 
oncogene product as determined by capture ELISA than eitheTgerm S. Z Z>Z 

S^anTdi^ 

ELfSA .ndf,^ ?1 "Tf ° f ° eU ° nCOgene P roduct as measur «* by capture 
ELISA, endometrioid tumors had the highest, and poorly differentiated carcmom as not 
otherw,se speafied had the lowest (p less tlian 0.025). ELISA values^ST 
rS reSS, ° n ' "5 imm ^ ohis ^™cal staining intensity did not correlate with stage ■ 
at dxagnosrs or archtectural or nuclear grade in ovarian rumors. We conclude thTcaptu^ 
ELISA ,s .a simple, effect.ve way to measure the neu oncogene protein pmduTand S 
£S£ H ^fro 11 b6tWeen ELISA ,CVels "* ™nfunohLocheSl 2 a "fng 

PMID: 1353878 [PiibMed- indexed for MEDLINE] 
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Real-timequantitativeRT^PCRofcycHnDl mRNA in mantle cell 
lymphoma:. comparison with FISH and immunohistochemistry. 

HuiP, SoweJS, Crouch J, Nimmakavalu M. Qumsivel, MB TaUiniG Flvnn SD 
Smith BR . ' — ^ ' 

Department of Laboratoiy Medicine, Yale University School of Medicine 333 Cedar 
Street, P.O. Box 208035, New Haven, CT 06520-803 5, USA. ' . 

Presence of the balanced translocation 1(1 1 ; I4)(ql 3;q32) and the consequent 
overexpression of cyclin Dl found in mantle cell lymphoma (MCL) has been shown to be 
of important diagnostic value. Although many molecular and immunohistochemical 
approaches have been applied to analyze cyclin Dl status, correlative studies to compare 
Afferent methyls for the diagnosis of MCL are lacking. In this study, we examined 39 
archived paraffin specimens from patients diagnosed with a variety of 
lymphoprohferative diseases including nine cases meeting morphologic and 
immunophenotypic criteria for MCL by: (1) real-time quantitative RT-PCR to evaluate 
eye m Dl mRNA expression; (2) dual fluorescence in situ hybridization (FISH) to 
evaluate the t(l l;14) translocation in interphase nuclei; and (3) tissue array 
.minunohistochemistry to evaluate the cyplin Dlprotein level. Among the nine cases of 
possible MCL, seven cases showed overexpression of cyclin Dl mRNA (cyclin Dl 
positive MCL) and two cases showed no cyclin Dl mRNA increase (cyclin Dl negative 
MOL-hke ) In s.x of seven cyclin Dl positive cases, the t(l 1;14) translocation was 
demonstrated by FISH analysis; in one case FISH was unsuccessful. Six of the seven 
cyclin D 1 mRNA overexposing cases showed increased cyclin Dl protein on tissue 
array mimunohistochemistry; one was technically suboptimal. Among the two cyclin Dl 
negauve MCL-hke cases, FISH confirmed the absence of the t(l 1;14) translocation in 
both cases. All other lymphoprohferative diseases studied were found to have low or no 
cychn Dl mRNA expression and were easily distinguishable from the cyclin Dl 
overexposing MCLs by all three techniques. In addition, to confirming the need to 
assess cychn Dl status, as well as, morphology and immunophenotyping to establish the 
diagnosis of MCL, this study demonstrates good correlation and comparability between 
measure of cychn Dl mRNA, the 1 1 ;14 translocation and cyclin Dl protein. 
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448: CancerRes. 1994 Jan l5;54(2):539-46. . Re.atedArtic.es. Links 

Elevation of topoisomerase I messenger RNA, protein, and catalytic 
activity ,„ human tumors: demonstration of tumor-type specificity and 
implications for cancer chemotherapy. "'cuyand 

HusajnX MgjjjerJL, SeigjerHF, Besterman JM -- 

^^ 3 i > '^^ I u Se "•' iaS ' JCe " '^ eflt '^ e< ^ as 311 intracellular target of camptothecin aolant 
alkalo.dw.th anhcancer activity. Various lines of evidence suggest o££S*S of 
cells tQ thus drug ls directly related to the topoisomerase Icontent totonwTS^ 
of topoxsomerase I have been shown to be elevated in colorectal ^ Tmp^ed to 
normal colon mucosa. The aim of our study was to determine whether (^HopokomLe 

in S ^ fj M.H " ^ W ^ 6,eVated CnZ ^ e is ^XSHSve 
m these tumors, and (c) the increase m topoisomerase I levels in colorectal tumon, is a 

^ ZTl or ^tioaTopoisomerase I levels were ZSSrf in 

crude extracts from colorectal, prostate, and kidney rumors and their matchX^! 
counterparts ; by Western blotting and by direct determination of catal^ a^vn^nd 

tumors showed r 5-35-fold mcreases m topoisomerase I levels, compared to their normal 
hT" , . 0356 of r state <™> the increase was 2-lb-fold, compareTSrh 
berugn hyperplast.c prostate tissue from the same patients. However no difference Was 

counterparts. The catalytic acrmty of topoisomerase I was determined by a quantitative 
32P-rransfer assay m crude homogenates, without isolating nuclei Colorectal 2d 
prostate tenors exhibited 1 1-40- and 4-26-fold increases, f^B^^L 

Smn ^ £T Cr ' kldne , y tUm ° ni did " 0t Sh0W ™y iteration in catafec a ^vuy 
compared o the.r normal matched samples. Thus, for all three tumor types there was a 
good correlation between enzyme levels and catalytic activity. Final y^ToSllmors 
7oZT^ f J^y^^ levels. A 2-33-fpld inlase JmSS £ Is vT 
found in colorectal tumors, compared to normal colon mucosa. These results surest that 
alterat.ons m topo.somerase I expression in humans are tumor type specific anS the 
mcrease m topo.somerase I levels results from either increased Snscript on of the 
topoisomerase I gene or increased mRNA stability, "puonortne 

PMID: 8275492 [PubMed - indexed for MEDLINE] 
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Immunohistochemistry and quantita ^ R^CR " 0Utp0m ? P^ents. METHODS: 
in 43 archival specimens of patle S S Sadvllv rS^' 0 ^ eXpreSsion of P Ki ^ 

were no. treated with neo-adjuvant iprRKULTS ^JT^ C , 0,0^eCta, Carcinoma ' who 
labeling index of 3 1 .3% (ranee 10 3 66 4»/VTw determined a median pKi-67 (MIB-1 ) 

range 0.01-0.69); indices 3tew* *d nolirreA" 1 ^ 1 ^ ,evel <* 0.1 7 fi9 (DeltaCTT): " ' 
associated witha significa^Uyfa:^ 

correlated to prognostic outcome. A multivariate LkS *f f b ^ hng md,Ces were not 
indicated that rumor stage (UfCC) anZSTmRNA 7 'T 1 3nd bio,o e ical fa <*>* 
prognostic factors. CO&LUSI^^S^^? , ^J^ ^ inde P en *»« 
Prognostic^ 
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Modulation of glucagon receptor expression and response in transfected 
human embryonic kidney cells. Hucciea 

Ike K ami T, CypessAIVr Bouscarei B 

University MedicaI ^ 

Tfce modulation of glucagon receptor (GR) expression and biological response was 

the GR with^fferent densit.es. The GR mRNA expression level in these clones L 
upregu lated by cellular cAMP accumulation and presented a good co^aZZoth 
fte protem express.cn level and the maximum number of glucagon binding s^ 
However the determination of glucagon-induced cAMP accumulation iSe cel. lines 

Inc^t iS7^T Cn : r°?T 0r eXpreSSi ° n did " 0t lead to 3 P r ~> 
increase m cAMP formation. Under these conditions, the maximum cAMP production 

reS, y ^ ^ h0t s ^ ifi -ntly different among selected" cfoneT 

regardless of me receptor expression level. High receptor-expressing clones showS'the 
greatest suscept.bd.ty for agonist-induced desens.tization Spared w.th clones wl 

ll^k^rG^er-^ • reSUUS ° f * e PreSent SlUdy SUggCSt that the GR 

recnnt non-GR-spec.fic desensit.zat.on mechanism(s). Furthermore, the partial inhibition 

or alterahon of the overall cAMP synthesis pathway at the receptor level may be a 
PMrD: 1 1 546678 [PubMed - indexed for MEDLfNE] 



459:IntJpevBio!.l993Sep;37(3):417-23. RetatedArtic.es. Links 

Developmental regulation of acidic fibroblast growth factor (aFGF) 
expression in bovine retina. 

Jacquemin E , JooetL, OjjyerL, BugraK, LaurentM, CourtoisY, JeannxJC. 

Unite de Recherches Gerontologiques, U: 1 1 8 INSERM, Paris, France. 

Acidic fibroblast growth factor (aFGF) is a signalling molecule implicated in a wide 
vanetyofbiological processes such as cell growth, differentiation and survival It has 
been purified from bovine retinal The present study was carried out to detect which cells 
m the bovine retina expressed aFGF at the different stages of embryonic and post-natal 
development. The specific aFGF mRNA and protein were detected by in situ 
hybnd,zadon employing riboprobes and immunocytochemistry using affinity purified 
polyclonal human recombinant aFGF antibodies respectively. No signal was detected bv 
either technique until 4-5 months and then there was progressive expression of aFGF 
with terminal morphogenesis.of the retina. By 8-9 months of embryonic development 
..nucle, of the 3 neuronal layers (ganglion cell layer, inner and outer nuclear layers) were 
all uniformly and intensely labeled A slight labeling of the pigmented epithelium of the 
retina was also visible throughout development and maturation. These results showed a 
good correlation between message and protein expression in these cell types. In contrast 
glial cells in the nerve fiber layer and vascular endothelial cells displayed a nuclear 
immunostaining for the protein in (he absence of message. These data suggest that aFGF 
plays a role in the late steps of retinal differentiation by autocrine and paracrine 
mechaiusms. 
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The P 21(Cipl) protein, a cyclin inhibitor, regulates the levels and the 
intracellular localization of CDC25A in mice regenerating livers. 

f "Trf ' ggg^i, PantojaC, CanelaN, Casanovas Q. S*r ra „n ivr 

AgeH N. Bachs O . — : = ' 

DepartaentofCell Biology and Pathology, Paculty.of Medicine, Instihit d'lnvestigacions 
Bromed.ques August Pi Sunyer (IDIBAPS), University of Barcelona, Barcelona, SpahT 

DMA H fr ° m / 2 / ( ? , " / : ) mi r SUbjeCte<1 10 P artiaI hepatectomy (PH) progress into 
DNA synthesis faster than those from wild-type mice. These cells also show a premature 
lection of cychn E/c^ . 

mechanisms whereby cells lacking p21(Cip l) showed a premature induction of this 
activity .Whereas the ^levels of CDK2, cyclin E, and p27(Kipl) were similar in both wild- 
type and P 21(Cipl-/-) mice, those of the activator CDC25A were much higher in 
o! £- P , 1 ,1 qyieScent md regenerating livers than in wild-type animals. Moreover • 
pZl (Cipl-/-) cells also showed a premature translocation of CDC25A from cytoplasm 
m o^ e V U , C ! eUS - eCt ° piC ex P ression of p21(Cipl) into mice embryo fibroblasts from 
p21(Gipl-/-) mice decreased the levels of CBC25A and delayed its nuclear translocation 
The levels of CDC25A messenger RNA in p2l(Cipl-/-) cells were higher than in wild- ' 
lype cellsj suggesting that tins increase might be responsible, at least in part, for the high 

o, >A ? A Pr0fem dlCSe Cdls - Thus ' Ae «s»»fe re P<>rted here indicate that 
pil(Cipl) regulates the levels and the intracellular localization of CDC25A We also 
found a good correlation between CDC25A nuclear translocation and cyclin E/CDBC2 
activation. In conclusion, premature translocation of CDC25A to the nucleus might be 
involved m the advanced induction of cyclin E/CDK2 activity andDNA replication in 
cells from animals lacking p21(Cipl). 
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Alteration of frizzled expression in renal cell carcinoma. 

quMiUteUve PCR. We observed that the mRNA level of Fzd5 was markedly increased in 
8of 1 1 renal carcmoma samples whilst Fzd8 mRNA was increased in 7 oH 3 
c^norna samples. Western blot analysis of crude membrane factions revealS mat 
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Abstract 

To evaluate the involvement of frizzled receptors iFzds) 
in oncogenesis, we investigated rnRNA expression lev- 
els of several human F2ds in more than 30 differ- 
ent human tumor samples and their corresponding 
(matched) normal tissue samples,. using real-time quan- 
titative PCR. We observed that the mRNA level of Fzd5 
was markedly increased in 8 of 11 renal carcinoma sam- 
ples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
carcinoma samples. Western blot analysis of crude 
membrane fractions revealed that F2d5 protein expres- 
sion in the matched tumor/normal Kidney samples corre- 
lated with the observed mRNA Jevel. Wnt/p-catenin sig- 
naling pathway activation was confirmed by the in- 
creased expression of a set of target genes. Using a kid- 
ney tumor lissue array, F7d5 protein expression was 
investigated in a broader panel of kidney tumor samples. 
Fzd5 membrane staining. was detected in 30% of clear 
cell carcinomas, and there was a strong correlation with 
nuclear cyclin Dl staining in the samples. Our data sug- 
gested that altered expression of certain members of the 
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Fid family, and their downstream targets, could provide 
alternative mechanisms leading to activation of the Wnt 
signaling pathway in renal carcinogenesis, Fzd family 
members may have a role as a biomarker. 

~~ • Copyright © 2004 S. X v prr AG. B*sel 

Introduction 

The Wnt signaling pathway is evolutionary conserved 
and controls' many events during embryonic develop- 
ment. Members of the Wnt gene family of secreted glyco- 
proteins are involved in embryonic induction, generation 
of cell polarity, cell proliferation and the determination of 
cell fate [J, 2J. Recently, il has become evident that the 
Wnt pathway is also deregulated in a range of tumors [3]. . 

The Wnt signaling pathway is activated when Wnt pro- 
teins bind to a celJ surface receptor complex consisting of 
a member of the fri22)ed receptor (Fzd) fa miry and either 
Jow-density-Iipoprotein receptor-related protein (LRP)5 
or LRP6 [4, 5]. A detailed characterization of the Fzds. 
and the immediate downstream events after Wnt binding 
has been hampered by the lack of pure biologically active 
Wnis. * 

Downstream of the receptor complex, three pathways 
may be in ilia ted, depending on l he composition of the 
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ligand and receptor complex The 'Wrtt/p-catenin path- 
way*, the *Wnt/Ca 2 + pathway' or the *Wnt polarity path- 
way' 16). The Writ/p-catcnin pathway has been linked to 
carcinogenesis. Genetic alterations in components of this 
pathway (adenomatous polyposis coli, APC, axin and 0- 
. catenin) can result ijp the accumulation of non-phosphory- 
lated P-ca ten in 1 3, 7] and this can promote carcinogenesis. 
Conversely, neither the Wnt/Ca 2+ pathway nor the Wm 
polarity pathway involves the activation of P-catenin [for 
review, see ref. 1,6]. 

Mutations in one of the three regulatory genes (APC, 
P-catenin and axih), overexpression of Wnts and Fzds or 
the expression of a const ituti very active Fzd have been 
Jinked to Wnt/p-catenin pathway activation in various 
tumors f8, 9], 

To evaluate the involvement of Fzds in oncogenesis, 
we investigated mRNA expression levels of several hu- 
man F2ds.(Fzd2, 3, 5, 6, 7, 8 and 9) in more than 30 differ- 
ent human tumor samples using real-time quantitative 
PCR. Each sample was compared with its corresponding 
(matched) normal tissue sample. The most striking obser- 
vation was the dramatically increased Fzd5 and Fzd8 
mRNA expression seen in the renal carcinoma samples. 
This was confirmed at the protein level using Western 
blotting. Xidney tumor tissue arrays confirmed Fzd5 
membrane staining in 30% of clear cell carcinomas, with 
nuclear cyclin Dl showing a strong correlation with the 
Fzd5 membrane labeling. Fzd8 protein expression analy- 
sis was not performed due to ihe lack of suitable reagents. 
Tbese data suggest that Fzd5 may have a role in renal ceil 
carcinogenesis due to its frequent overexpression obr 
served in these tumor samples. Potential future applica- 
tions could include uses in tumor targeting or as a poten- 
tial biomarker. 



Materials and Methods 

Tissite Samples 

Frozen tumor l issue samples with corresponding normal tissue 
from the same patient were derived either from human biopsv.or 
autopsy material (Department of Pathology, University or Antwerp, 
kindly provided by Prof. E. Van Marck). Tissue specimens were, 
snap-frozen in liquid nitrogen and kept at -S0*C until use. Frozen 
actions or kidney tumor and normal tissue samples we/e stained 
with hcmaloaylm-eosin to support the pathologist's observations and 
to confirm the type of kidney tumor. Paraffin-embedded tissue slides 
or renal carcinoma, lung carcinoma, breast and colon carcinoma 
were obtained, afrer encryprion. from ihe Department of Paihologv 
(M»dde)heim Hospital. Antwerp. Belgium). The CL1 human kidnei 
cancer (Super BioChips Laboratories) tissue arrav used in ihis sludv 
contained 59 tissue samples consisting of 9 normal kidney tissues. 



30 clear cell renal carcinoma samples and another 20 renal celt tumor 
types tchroroophil, chromophobe, papillary type, collecting duct car- 
cinoma and samples with mixed types). 

RNA Isolation and Reverse Transcription 
Total RNA was extracted from tissue specimens using UHraspcc 
Reagent {Bioiecx. USA) according to the manufacturers instruc- 
i ions. AH total RNA was routinely treated with DNase (DNA-free kit, 
Ambion, USA). I ug of total RNA was used to synthesize cDNA 
using oiigo-dT primers (Superscript; Inviirogen, Merelbeke. Bel- 
gium)! Reverse transcription was performed at 42 * C for 60 min, fol- 
lowed by 70 * C for 1 0 rn in. 

Real-Time PCR . 

Real-time PCR was performed on eilher an AM Prism 7700 or 
7900 Sequence detection system (Perk in-Dmer Applied Biosysterro, 
Foster City, Calif., USA) using the 5' nuclease assay (Taqman™).- * 
Primer and probe sequences were designed using Primer Express (PE 
Applied Biosyslems) and are shown in table I. Quantitative values, 
were obtained from the threshold cycle number (Ct) al which the 
increase in the signal associated with exponential growth of PCR 
products is delected using PE Biosystcms analysis software, accord- 
ing to the manufacturer's instructions. . v 

We have used ihe 2' A ^ mC ihod to analyze the relative changes 
in gene expression of the different genes between tumor and corre- 
sponding normal tissue samples. We used ihe mitochondrial ATP 
synthase 6 (ATPsy6) as the endogenous RNA control [10; Jznsscns et 
af r in prep.}, and each sample was normalized lo its ATPsy6 content. 
The relative expression of the target gene was also normalized to the 
corresponding normal tissue sample (calibrator). Results, expressed . 
as the amount of target sample relative to the ATPsy6 gene and the 
calibrator, were determined as' follows, N - 2-' a ° MP> P k - AOraiib»i©ri 
where ihe AO values of the sample and calibrator were determined 
by subtracting the average Ct value of the sample and the calibrator . 
from the average Cl value of the ATPsy6 gene. Amplification was 
done essentially as described previously J I0JL Briefly, 50 uJ or reac- 
tion mixture containing 1 ul of cDNA template were amplified as 
follows; incubation at 50*C for 2 min, denaturaiion at 95'C for 
10 min, and 50 cycles al 95'Cfor 15 sand 60*C for r min. 

Membrane Preparation. Get Electrophoresis andjmmvnoblotiing 
Tissue samples were weighed, suspended at a 40 times dilution 
1^ AO volumes/original wei weight of tissue (v/w)J in 50 mM Tnv 
HCI buffer. pH 7.4, and homogenized with an Ullra-Turrax homoge- 
nizes After cent rifuga lion for JO.min,24,0O0^at 4*C, the pellet was 
washed three times by resuspension in the Tris-HCI bufTer followed . 
by centrifugal ion. The final membrane pellets were stored at - 80* C 
in the Tris-HCI bulfer at a concentration of 0.5-1 mg/mi. The Brad- 
ford protein assay (Pierce, Aalst, Belgium) was used Tor protein deter- 
mination. Proteins (50 ug) were separated. by 89b SDS-PAGE and 
transferred to nitrocellulose membranes. After primary and second- 
ary antibody incubation, the amigen-ahtibody.peroxidase complex 
was detected by chemiluminescence (Pierce, Aalst, Belgium) accord- 
ing to the manufacturer's instructions. 

Immunoh isi och em is try 

Immunohistochemisiry was performed on lO-pAAthick cryosec- 
tions of unfixed tumor tissue and on 6-U;V/-lhick para ffin 'sections 
from renal tumor tissue fi^ed by formalin or by an alcohol- based fixa- 
tive. Adjacent tissue blocks from renal tumors were processed with 
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Table 1, Real-time PCR primer and probe sequences 



Target cDNA 


\ Primer/probe sequences* 


Fragment 
position* 


Accession 
No.* 


FZD2 


{a)5*-alcecgigeccggc-3' 

{b)5*-gtamgatC3tglagatcgtgaagic-3' 

(c) S'-FAM-tacacgccgcgcatglcgc-TAM RA-3' 


1,548-1,613 


AB017364 

* 


- FZD3 


(a) $'-icacgcc3£tecatggg-3' 

(bj 5'-ttgtcaccltcaatUtaltcatcg-3' 

(c)3'-FAM-catccccggaactciaaccatcatccilU-TAMRA*3' 


J,473^I547 


- AB039723 


. rzps 


(a) 5'-igcca3ggicacticcgttt-3' 

(b) 5'*tctccaaglcgccgcg-3' 

(c) y-FAM-cenc3lggi£tlgl tgccccc-TAM RA-3>' 


2,143-2,204 


HSU43318 


FZD6 


(a) 5'-ciagcacccctaggtfaagagaa-3' 

(b) 5'-cccagagagtclgg3gatgg3t-3' 

(c) 5'-FAM-t^ggtgaacclgcctcgccag-TAMRA-3' 


2,094-2, J 70 


AF072873 • 


FZD7 


(a) 5'-cctgtggaaaggcataaeigig-3' 

(b) 5'-3accaacgggaaaccicaga-3' 

(c) 5*-FAM-aagcaaciJttataggca3agcagtgcaa-TAMRA-3' 


2,687-2,762' * 


AB01736> 


F2D8 


(a) 5'-lgigglcgg»gclcigci|-3* 

(b) 5'-cgctcca!gtcgaiaaggaag-3' 

(c) 3'-FAM-ccacct tcgccaccgtc£cca-TAMRA-3' 


853-919 


* AB043703 


FZD9 


(a) 5'-ccccgggagcl3Cggac-3' : 

(b) 5 -lagtcatgt£caag3cc3cgg-3* . 

(c) 5vFAM-tggcacgcaclgccaclataa£gcl-TAMRA-3'. - 


1,696-1,763 


HSU82169 


ATPsy6 


(») 5'-£gtgtaggtgtgcc»gtgg|-3' 

(b) 5 -gggcgcagtgaHataggctl-3' 

(c) 5'-FAM-33gtgggciagggcat til taatcitagagcg-TAM RA-3' 


. 580-503 


AF36827I 


c-myc 


(a) 5*-aec&ccagcagcgaciclga-3' 

(b) 5'-lceagcag3aegtgaiccagact-3' 

(c) 3 : -FAM-accutlgccaggagceigcctcl-TAMRA-3' 


U9?-1,413 


HSMYCI 


CyciinDI 


(a) 5'-gaacctggccgcoalgac-3'- 

(b) 5'-cgcctctggcaitilgga-3* 

(c) 5'-FAM-ccgcacgailtcaUgaacactt-TAMRA-3' 


4,148-4,211. 


AF5 1 1 593 


PPAR5 


(a) S'-agcatcctcaecggcaaaO' 

(b) 5'-gtctcgatgtcgiggatcaca-3' 

(c) 5'-FAM-ccagcc3cacggcgcccl-TAMRA-3' 


932-990 . 


NM-006238 . 


3 (a) =. Sense primer; (b) = anlisense primer, (c) = probe. 

b Fragment positions are given according to the EMBL/GenBank accession No. of cloned sequence. 
* EMBJUGenBank accession No. of cloned sequence. • - 



formalin and with the alcohol-based fixative. Paraffin and cryosec- 
tions were mounted on poly-i.-lysine or 3-aminopropyltrietboxys>- 
lane-gelalin-coated slides. Tht 59 tissue samples on the CL1 human 
renal cancer tissue array slides were, all fixed with 'formalin, and 
embedded in paralTin. and ihe sections were mounted on silane- 
coatcd slides (SuperBioChips Labora lories). In addition io renal car- 
cinoma tissue, sections from 10 formalin- fixed pa raflm-em bedded 
lung carcinomas were stained for F2d5. Colon and breast lumors 



were used as positive controls for p-c a ten in and cyclin Dl immuno- 
staining. 

. The following primary antibodies were used: Fid5 (Upstate Bio- 
technology), p-catenin {Zymed), cycJin Dl (Zymed).' E^cadherin 
(Novoccslra) and cytokera tin 8 (Biogenex). Cryoseclions were fixed 
in 4% paraformaldehyde for 5 min; acetone for 5 min at -20*C and 
70% elhanol for 5 min. Endogenous peroxidase, activity was 
quenched using 3% H;(X Paraffin sect ions of formalin- and alcohol- 
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Table 2. Fid mRNA expression in tumor samples 



Sample 3 


Tissue 


Tumor type 


x-fold expression increase* 1 










FZD2 


F2D3 


FZD5 


F2D6 


F2D7 


fzds. 


F2D9 


133702 


kidney ' 


* adenocarcinoma 


0.17 


0.5 


X72 


2.13 


0.06 


1.32 




137770 


kidney 


renal cell carcinoma 


L23 


3J6 


836 


2.61 


1.3 


83 i * 


5.44 


138844 


kidney 


renal cell carcinoma 


0.31 


0,11 


6.84 


1.18 


0.23 


3JS 


2.8 


137146 


kidney" 


renal cell carcinoma 


0.47 


0.45 


XI6 


2.23 


0.73 


4.42 


1.37 


137564 


kidney 


. renal cell carcinoma 


. 3.43 


2.95 


9.6 


1.57 


764 


3.52 




■133408 • 


kidney 


renal cell carcinoma 


■2X97 


0.98 


6.39 


2.48 


505 


16.72 




139J88 


kidney 


renal cell carcinoma - 


3.7 


0.56 


0.66 


1.33- 


0.37 


4JI 


J 33 


13569? - 


kidney 


renal cell carcinoma 


2.6 * 


0.36 


■4,83 . 


6.9 


0.34 


2.54 


2.31 


139064 . 


kidney 


renal ceJI carcinoma 


5.16 


1.82 


1.25; 


2.36 


1.8 


2.19 


2.85 • 


134585 


kidney 


renal cell carcinoma 


1.47 


0.72 


1.38 


0.47 


0.09. 


OA 




140279 


kidney 


jena) cell carcinoma 


7.33 


?8J7 


3.93 


605 


6.41 


4.65 


7.91 ; 



137252 


ovary 


carcinosarcoma - 


0.4 


3J9 


0.49 


0.44 


0.92 


034 


6.53 


138256 


ovary 


papillary carcinoma 


0.7 


5.17 


1.22 


2.19 


0:4 


O.U 




J 46472 


ovary 


serous papilJary carcinoma 


- 0.39 


339 . 


2.56 


1.34 


3.94 


.0.67 


03? 


145845 


colon 


adenocarcinoma 


3.45. 


- 3.36 


0.58 


1.22: 


1.67 


134 


1.99 


J46I45 


colon 


adenocarcinoma . 


5.46 


6.74 


4.42 


637 


3.36 


. 6.08 


0.15 


146630 


colon 


adenocarcinoma 


4.01 


4.73 


0.3 


1.16 


0.54 


0.55 


0.12- 


146633 


colon 


adenocajohoma 


J. 87 


1.07 


1.62 


1.47 


. 0.51 


2.07. 




147055 


colon 


adenocarcinoma . 


0.66 


1.4! 


1.0J 


i.33 


0.24 


0.7 9 . 


4.87 


142253 


lung 


adenocarcinoma. 


0.67 


5.17 


0.43 


' 1.87 


0J3 


0.59 


0.41 


143036 


lung 


adenocarcinoma 


0.67 


1.24 


2.63 


1.13 


i.ll 


1.04 


9.19 


. 138938 


lung. 


adenocarcinoma 


0.93 


0.88 


1.07 ' 


1.3 


1.73 


038 


0.76 


133563 


Jung 


adenocarcinoma 


2.76 


1.23 


0.31 


134 


0.99 


0.78 


4.97* 


144387 


lung 


•-. adenocarcinoma 


12.85 


0.43 


0.49 


0.25 . 


2.73 


0.54 




137304 


lung 


acina ry adenocarcinoma 


0.54 - 


9-15 


1.65 


1.25 


0.63 


3.41 


2.64 


144546 


lung 


epithelial carcinoma 


0.1 


0.67 


0.21 


8.1) 


. 0.27 


0.97 


132 


137621 


lung 


epithelial carcinoma 


1.52 


2.19 


.0.37 


2.26 


0.52 


0.45 


1.06 


145552 - 


lung 


epithelial carcinoma 


1.09 




036 


0.91 


M7 


0.13 


03 


143987 


lest is 


embryonal carcinoma 


0.24 


0.6 


19.43 


1.33 


23.12 


2.43 


0.33 


137332 


stomach 


leiomyoma 


19.32 


X9J 


0.45 


1.53 


18.98 


32.82 . 


16.7 


139026 


stroma 


gastrointestinal carcinoma 


66.1 


3-51 


0.03 


6.67 


8.44 


2.28 


20.82 


136049 


rectum 


adenocarcinoma . 


1.71 


032 


0.4 


0.54 


0.29 


o.88 : 


038 


140794 


gal) bladder 


adenosquamous carcinoma 


0.19 




0.93 


3.02 


0.2 







■ Sample identification numbers were given by the pathologist. 

b Results are expressed as x-lbld increase or the gene in the tumor tissue sample compared to its matched normal tissue sample after 
normalizing both samples on the basis of their. ATPsy6 content. A cutolT of 3-fold was used to define differential expression. Significant 
i> 3-fold) increases in the expression level of the Fzd receptor are shown in italics. - = Expression of the target gene undetectable in one or both 
samples (tumor and/or normal). 



fixed tissue were processed with a irypsin-citrateTmicrowave pre- 
treatment or with an £DT A- microwave pre treat men! I o unmask epi- 
topes, respectively. Sections were then sequentially processed with 
primary antibodies, brpiinyhned secondary antibodies and st repl av- 
id in- bioiin-pero.x ids se (Fid5. JE-cadbe r'm and cytokeratin 8). For p- 
catenin. polyclonal rabbit antibody with the En Vision detection sys- 
tem (DAKO) was used. The slides were further developed using 3- 
amino-9-elhy!r3rbazoie. counttrstained with hemalaun and mount- 



ed with glycerin gelatin. Stained sections were observed with an 
Axioplan 2 microscope equipped with an Axiocam digital camera. 
Staining intensity for p-catenin was scored as no staining (value 0), 
weak and fragmentary staining of cell membranes (value 1). moder- 
ate membrane staining of less than 50% of the tumor cells (vaIuc-2). 
moderate membrane staining of more than 50% of tumor cells (value 
3) and strong membrane staining of more than 75% of tumor cells 
(value 4). The cycJin Dl staining w as quantified as a percentage of 
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Fig. 1. Fid5 protein .expression in matched 
tumor/normal kidney samples. T = Tumor 
sample; N - maiched normal sample.- Sam- 
ple identification numbers are given by the 
pathologist. 



133702 137770 138844 137146 1375W 133408 

T N T N T.NT-N T N T. N 



mm** 



135699 • 1390S4 134585 140279 
T N T N T N T N 



FidS 



- Aciin . 




FjdS 



cyclin Dl-imrmmoreactive nuclei in tumor cells in three fields (area: 
1 8,64 1 pro 3 ) of each tumor sample; The total number of tumor nuclei 
ranged from 51 to 164. The correlation between Fzd5 and R-catcmT) 
staining arid between Fzd5 and cyclin Dl staining was evaluated by 
the Mann-Whitney U IcsL . 



ResuHs 

Fzd mRNA Expression in Matched Human 

Tumor /Normal Tissue Samples 

Fzd expression in tumor tissue was compared with Fzd 
expression in matched norma) tissue samples and normal- 
ized to the expression of the housekeeping gene mitochon- 
drial ATPsy6 (labJe 2). A 3-fold increase was considered 
significant. 

In the kidney tumor samples, in which 10 of II sam- 
ples were clear cell carcinomas, Fzd5 was upregulaled in 8 
of the 1 1 samples. A similar observation was made for 
F2d8 and Fzd2, which were upregulaled in 7 and 5 renal 
tumor samples, respectively. None of the other Fzds 
showed consistent upregulation. 

Both Fzd2 and Fzd3 were upregulaled in 3 of 5 colon 
adenocarcinoma samples. No other Fzd expression was 
significantly different compared to the normal colon tis- 
sue sample. Fzd3 showed an increased expression in all 3 
ovarian carcinoma samples. Fzd expression was not al- 
tered in any of the lung tumor samples. The Fzd expres- 



sion level was observed to be relatively low in these lung 
tissues compared to the other tissues investigated. 

Western Blot and ) mmunohistochemistry Analysis on 
Renat Carcinomas 
Western blotting was used to evaluate Fzd5 protein 
expression in the renal tissue samples used for mRNA 
expression analysis. Membrane fractions of the renal car- 
cinoma and corresponding normal tissue samples were 
prepared. As previously shown (table 2), Fzd5:mRNA 
upregulation was delected in 8 of the 1 1 maiched tumor/ 
normal samples. Increased expression of Fzd5 protein 
was seen in membrane fractions from 9 of 11 samples 
(fig. I). Jn most cases, concomitant increases in Fzd 5 
mRNA and protein levels were observed. 

Hemaloxylin-eosin staining of the cryosect ion ed' tu- 
mors confirmed the presence of clear cell carcinoma. 
Fzd5 immunostaining in clear cell carcinoma (fig. 2a, b) 
was observed to be localized to cell membranes and to 
nuclei. Cylokeratin 8 (fig. 2c) and E-cadherin (fig. 2d) 
were also delected. E-cadherin labeling of cell membranes 
in clear cell carcinoma was less intense and patchy com- 
pared to epithelial cells of normal renal tissue. P-Catenin 
staining was confined to the cell membrane. 0-Catenin 
levels in the clear cell carcinoma membranes were highly 
variable. Nuclear 0-catenin staining was not observed in 
any of the samples. Epithelial cells in normal renal, tissue 
showed intense membrane staining and some cytoplasmic 
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Staining. Jo addition, weak p-catenin staining of endothe- 
lial cells was observed. A high number of cyciin Dl- 

. immunoreactive nuclei was observed in clear cell carcino- 

. ma (fig. 21). 

On the CL1 human kidney cancer tissue array, 30% 
(n = 9) of the clear eel) carcinoma tumor samples (n = 30) 
showed Fzd5 immunoreactivity (fig. 3a). Membrane-as- 
sociated p-catenin staining was observed in 33% of the 
FzdS-pdsili.ve tumor samples and 57% of Fzd5-negative 
clear renal cell carcinoma samples (table 3; fig. 3c, d). 
- Again, nuclear P-catenin staining was never observed. 
Statistical analysis did not reveal a difference in . the 
expression of p-catenin between Fzd^positive and F2d5- 
negative tumor samples (fig. 4a). 

Nuclear cyciin D J was observed in : 89% of the Fzd5- 
positive clear ceil carcinoma samples (table 3; fig. 3e). 
Only 38% of the Fzd5-negative clear cell carcinoma sam- 
ples contained nuclear cyciin Dl. Statistical analysis 
showed a significantly higher cyciin Dl expression in 
FzdS-posilive compared loFzd5-negative tumor samples 
(fig. 4b). 



Expression of PPAR5 was investigated because it repre- 
sents a direct target of the P-catenin pathway with T ceil 
factor binding sites in its promoter. Expression of c-myc 
was found to be upreguiated in 7 of 1 1, whilst cyciin Dl 
was upreguiated in 10 of 1 1 kidney tumor samples (ta- 
ble 4). PPAR5 was upreguiated in 9 cases. All three select- 
ed target genes showed a marked upregulation in . the 
majority of renal tumors, which suggested that the Wnt/ 
P-catenin pathway was activated in these samples. 



Discussion 

Fzd family member ove repression has been postu- 
lated to play key roles in different tumor types such as 
esophageal carcinoma [11 J, gastric cancer f 1 2] and bead 
and neck squamous, cell carcinoma jl3J, The current 
study evaluated the potential implication of F2ds as 
tumor-associated antigens in different tumor types. We 
screened a number of matched normal/tumor tissue sam- 
ples for the expression of a variety of Fzds using real-time 



c-myc, Cyciin D J and Peroxisome 
Proliferator-Actiyaled Receptor 5 Repression in Renal 
Carcinomas 

Wnt/p-catenin. pathway activation in the kidney tissue 
samples was investigated looking at the expression of a 
number of target genes, which have previously been 
shown to be upreguiated when the pathway is active. 
Gene expression, of c-myc, cyciin Dl and peroxisome 
probTerator-activated receptor 8 (PPAR5) was analyzed. 
Increased expression of both c-myc and cyciin Dl genes 
have been implicated in cell proliferation, and carcino- 
genesis, and they represent two of the more important and 
closely studied target genes of the Wnt signaling pathway. 



quantitative PGR. 



Hg. 2. Distribution of FzdS fa, b), eytoJceratin 8 (c), E-cadherin (d) 
and P-otcnin (*) immuhoreacliviiy »„ paraffin sections from a rcnol 
tumor processed by an alcohol fixative. From the same tumor/a for- 
malin-fixed block was used Tor cyciin D I iromuoostaining Fz65 
immunosiainingshowj distinct immunoreaciiviiy in celt membranes 
and in nuclei of clear cell renal carcinoma. Clear.cells are immunore- 
active for cyiokeratin 8. fJ-Caienin and E-cadherin staining of mem- 
branes is rather weak, and not uniform, in clear cell renal carcinoma. 
Nuclear P-catenin immunoreactivity was not observed. In dear cell 
icnal carcinoma; many nuclei showed cyciin Dl immunoreactivity. 
The inset in f shows a detailed view of the cyciin Dl labeling of nuclei 
inxlear cell renal carcmoma. 



Resujts obtained revealed that both Fzd5 and Fzd8 
mRNA were overexpressed in the majority of renal carci- 
noma samples when compared to the matched normal 
kidney samples. Fzd2 and Fzd3 were upreguiated in 3 of 5. 
coloxi adenocarcinoma samples. Fzd3 was also upregu- 
iated in the ovarian tumor tissue samples compared to the 
matched normal tissue samples. None or the other Fzds 
evaluated showed a specific differential expression pat- 
tern in any of the samples studied. Fzd5 and Fzd 8 show 
69.1 % similarity and belong to the same subgroup of Fzds 
1 14]. The significantly higher expression of Fzd5 and. 
Fzd8 in the renal tumor samples, as compared to the nor- 
mal renal samples, suggests a higher probability that this - 
subgroup may be implicated in the progression of renal 
cancer. Therefore, we decided to further examine the pos- ' 
sible role of Fzd5 in renal carcinoma. 

We observed, using Western blotting, that protein lev- 
els were mostly consistent with mRNA levels in the tumor 
samples. Jn order to be able to determine the Fzd5 expres- 
sion in a broader range of kidney tissues, we utilized a 
tissue array. Fzd5 membrane staining was detected in 9 of 
30 (30%) clear cell carcinomas, and importantly, mem- 
brane staining was not detected in the matched 9 normal 
kidney tissue samples. 

Since the Wnt signaling pathway appears to play . an 
important role in embryonic development, in panicular 
embryonic kidney induction [15, 16), activation of this 
pathway in the adult kidney due to mutation or overex-. 
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Table 3. Correlation between Fzd5 and 
Poienin or cyclin DJ expression 



Fzd5,% 



£Catem"ri.+ 
p^Catehm- 
Cyclin Dl + 
Cyclin DJ - 



33 
.67 
89 
II 



57 
43 
38 
62 



T*ble A. Wnt/0-cafcnin large! gene mRNA expression 



pressjon of one of the components oflhe pathway could 
be a deterrmning factor in the devdopment of renal can- 
cers Therefore, several studies have JooJced into the possU 
bJe function the Wnt/^catenin pathway plays in renal 
caranogenesis. APC gene mutations have been demon- 
strated not to be involved in renal carcinoma [17, 18J J n 
addition, p-catenin mutations are rare events in renal car. 
cinoma [19, 2DJ. Nevertheless, cytoplasmic accumulation 
orp-catemn has.been reported in a number of renal cell 
caranomas [19], and thus the Wnt signaling pathway 



m 1 urn or samples 



Sample Tissue 



Tumor type 



x-fold expression increase* 



. 133702 
137770 
138844 
137146 
137564 
133408 
139188 
J 33699 
139064 
J34385 
140279 



omyc cyclin DI PPAR5 



kidney adenocarcinoma 
kidney renal cell carcinoma 
kidney renal cejj carcinoma 
kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal eclj carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

. kjdney renal eeJI carcinoma 



0.54 
J3.9 . 
2.39 
7.62 
33.82 
7.8 
222 
12.18 
22:}l 
1.79 
61.68 



4.52 


0,52 


28.91 


5.53 


31.49 


7.48 


15.38 


315 > 


19.65 


8.65 


8.92 


4.86 


9.92 


11.67 


22.73 


\ 5.53 . 


5.04. 


6.33 


1.14 


1.67 


54.95 


14.62 



1- 



US 



ti. 



s 



F«J5» 



Sample .dem.ficano,, number «e,e bv , he p a , ho) „ is , 

. R « uJ '"« "Pressed as j-fold.Wease of the^^ .• 

to its malched normal .issue sample after nnnlr ■ u 1 ,umo,I,ssue sam P fc compared 

ATT-syfi con.eot. A cu.ofT of Mold ^sed^T* ° n ,be baS, ' J ° nh ™ 
>*™)-eas^ 



b GO 



* SO 



40 

30 



* p < 0,01 



rids* 



r»ds- 



F» 9 . 3. fzdS 0-caremn and cyclin Dl immunosiainine of the CI I rm.i * 
. ^ays. The lefi column of images represents serial Ztln I f "rcmoma t,ssue 

show an overview of each lumor secrL on h^* fmm U, ^ 0rsa ^ insets 

»ivec| C3 rccll rena, carcinoma 5^i»^ «S The Fzd5-immunor cac- 

hing for cyclin Di W d tcets dist XKX* rT ^"^■'tenin (c). I w 
^e right column of images is taken from ser K ? T " C,C3 ' CciI rCnal C3 "inoma. 

dwinei membrane 0-ca.enio staining (d) ^ " D ' W bU ' d0e5sbcw 



F'*9. 4. Box plot charts (thick black line = 
median) illustrating the relationship be- 
tween Fzd5 immunosiaining and 0-caienirj 
(») and cyclin DI (b) expression in clear cell 
renal carcinoma. No significant correlation 
was observed between the fkratenin scores of 
FzdS-positive and -negative clear ceil renaj 
carc.noma. Nuclear cyclin. Dl staining m 
clear cell renal carcinoma showed a signifi- 
cant. di/Terence between FzdS-positive and 
F2d5-negative tumor samples. 
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might act as an inducer of tumorigenesis in the kidney. 
. This view is supported by the observation that aberrant 
activity of the Wnt signaling pathway has been reported in 
renal-cancer-derived cell lines. Zang et a). [2 1 ] observed a 
higher expression level of WntSa and Fzd5 mRNA in the 
renal cancer cell line GRC-1 than in the normal renal cell 
line Expression of p-cateniri was also higher in 

GRC-1 than in HK-2. 

To determine the status of the canonical Wnt signaling 
pathway m our renal carcinoma sanrpJes, we have quanti- 

• lated the mRNA levels of three important target genes of 
. T cell factor/Tymphoid enhancer factor activation by 

P-catenin. The mRNA levels of these three target genes 
(c-jnyc, cyclin D 1 and PPAR 5) correlated largely with the 

• expression, of Fzd5 in these samples, suggesting that the ' 
canonical pathway is activated. On the kidney tissue 
array, cyclin Dl protein expression showed a highly sig- 
nificant correlation with the Fzd5 expression in the tumor 
samples (table 3). Cyclin Dl protein is frequently. overex- 
pressed in various tumors, but in only a. proportion of the 
cases is it due to amplification of the cyclin Dl gene [22]. 

.There/ore, other mechanisms such as upregulatiori of gene 
. transcription may play a substantial role in the overex- 
pression of cyclin Dl [23-:26]. Our data, showing in- 
creased cyclin Dl expression in renal carcinoma samples, 
are consistent with the results of Stassar et a J. [27], They 
studied genes that are associated with human renal carci- 
noma by suppression subtractive hybridization and re- 
ported 14 differentially expressed genes, including cyclin 
Dl. Although we would have expected an increased nu- 
clear f^catenin staining, nuclear accumulation of 0-cate- 



nin was not observed in any of the tumors or on the tissue 
array. This result is consistent with the data presented for 
renal cell carcinomas by Kim eta!. {19]. They did not 
detect nuclear p-ca ten in staining in the 52 renal cell carci- 
nomas examined. The lack of nuclear 0-catenin staining 
has also been reported by others in tumors that might 
have arisen from Wnl/0-catenin pathway activation [28- 
31]. 

While expression of both Wnt5a and Fzd5 does induce 
duplication of the Xenopus head, exogenous expression of 
Fzd5 in a Xenopus model does not induce duplication of 
the head [32]. Fzd5 does not activate the p-catenin signal- 
ing pathway on its own, as the presence of its endogenous 
Jigand is also required. Our results.suggest that Fzd5 may . 
have a role in renal cell carcinogenesis due to its frequent 
overexpression observed in these tumor samples, and we 
hypothesize that if Fzd5 is overexpressed, it has a rather 
limited effect, on p-catenin signalings However, in the 
presence of its endogenous still unknown ligand, it acti- 
vates the canonical Wnt signaling pathway. The elucida- 
tion of this ligand and its binding characteristics is still 
under investigation. Ultimately, knowledge of the specific 
expression patterns of both Wnt and Fzd members could 
lead to directed tumor targeting or could be used as a 
tumor marker. 
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Immunohistochemical analysis of NY-ESO-1 antigen expression in normal 
and malignant human tissues. 

Jungbluth AA, ChenYT, Stockert E , Busam KJ , Kolb D. Ivej^enK £e^nK 
Williamson B , Altorki N. OjdjLJ. 

Ludwig Institute for Cancer Research, Memorial Sloan-Kettering Cancer Center, New 
York, NY, USA.jungblua@mskcc.org 

NY-ESO-1, a member of the CT (cancer/testis) family of antigens, is expressed in normal 
" testis and in a range of human tumor types. Knowledge of NY-ESO-I expression has 
depended on RT-PCR detection of mRNA-and there is a need for detecting NY-ESO-1 at 
the protein level. In the present study, a method for the immunochemical detection of 
NY-ESO-lin paraffin-embedded tissues has been developed and used to define the 
expression pattern of NY-ESO-1 in normal tissues and in a panel of human tumors. No 
normal tissue other than testis showed NY-ESO-1 reactivity, and expression in testis was. 
restricted to germ cells particularly spermatogonia. In human tumors, the frequency of 
NY-ESO-l antigen expression corresponds with past analysis of NY-ESO-1 mRNA 
expression e.g., 20-30% of lung cancers, bladder cancers and melanoma, and no 
expression in colon and renal cancer, Co-typing of NY-ESO-1 antigen and mRNA 
expression in a large panel of lung cancers showed a good correlation. There is great 
variability in NY-ESO-1 expression in individual tumors, ranging from an infrequent 
homogeneous pattern of staining to highly heterogeneous antigen expression. Copyright 
2001 Wiley-Liss, Inc. 
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Multidrug resistance phosphoglyco protein (ABCB1) in the mouse 
placenta: fetal protection. 

KalabisGM , Kostaki A , Andrews Mil, Petronoulos S. Gihh W, Matthews SG . 

Department of Physiology, University of Toronto, Ontario, Canada. 

The multidrug resistance phosphoglycoprofein ATP-binding cassette subfamily B 
(ABCB1) actively extrudes a range of structurally and functionally diverse xenobiotics as 
well as glucocorticoids.. ABCB 1 is present in many cancer cell types as well as in normal 
tissues. Although it has been localized within the mouse placenta, virtually nothing is 
known about its regulation. In the mouse, two genes, Abcbla and Abcblb, encode 
ABCB 1. We hypothesized that there are changes in placental Abcbla and Abcblb gene 
expression and ABCB1 protein levels during pregnancy. Using in situ hybridization we 
demonstrated that Abcblb mRNA is the predominant placental isofotm and that there are 
profound gestational changes in the expression of both Abcbla and Abcblb mRNA. 
Placentas from pregnant mice were analyzed between Embryonic Days (E) 9 5 and 19 
(term approximately 19.5d). Abcblb mRNA was detected in invading trophoblast cells 
by E9.5, peaked within the placental labyrinth at E12.5, and then progressively decreased 
toward term (P < 0,0001). Abcbla mRNA, although lower than that of Abcblb at 
mtdgestation, paralleled changes in Abcblb mRNA. Changes in Abcbl mRNA were 
reflected by a significant decrease in ABCB 1 protein (P < 0.05). A strong correlation 
existed between placental Abcblb mRNA and maternal progesterone concentrations 
indicating a potential role of progesterone in regulation of placental Abcblb mRNA.' In 
conclusion, there are dramatic decreases in Abcbla and Abcblb mRNA and in ABCB1 at 
the maternal-fetal interface over the second half of gestation, suggesting that the fetus 
may become increasingly susceptible to the influences of xenobiotics and natural steroids 
in the maternal circulation. 
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Expression of human telomerase reverse transcriptase gene and protein 
and of estrogen and progesterone receptors, in breast tumors: Preliminary 
data from nco-adjuvant chemotherapy. 



gggffl^ fasmaiN, HasjumotoM, Nakapuir^ Ql^g T , Kurabavashi R 
NaojciH, HonmaN, Ogawa T, Kaminislii M . TakHh^ 1 ~' 

Division ofBreast and Endocrine Surgery, Department of Surgery, Graduate School of 
Mediae, The Umvcrs.ty of Tokyo, Bunkyo-ku, Tokyo 1 13-8655, Japan, kanmori- 

ais@umin.ac.jp. 

Human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase is 
very closely associated with telomerase activity. Telomerase has been implicated in ' 
cellular immortalization and carcinogenesis, in situ detection of hTERT will aid in 
determining the localization of telomerase-positive cells. The aim of this study was to 
detect express.on of hTERT mRNA, hTERT protein, estrogen receptor (ER) and 
progesterone receptor (PR) in paraffin-embedded breast tissue samples and to investigate 
the relationship between hTERT expression and various clinicopamological parameters 
in breast tumortgenesis. We used in situ hybridization (ISH) to examine hTERT gene 
expression, and immunohistochemistry (IHC) to examine expression of hTERT protein 
ER and PR, m breast tissues including 64 adenocarcinomas, 2 phyllode rumors and their 
adjacent normal breast tissues. hTERT gene expression was detected by ISH in 56 (88%) 
carcinomas but in neither of the 2 phyllode tumors. hTERT protein expression was 
detected by IHC m 52 (81%) carcinomas, but in neither of the 2 phyllode tumors 

Moreover, ER and PR were expressed in 42 (66%) and 42 (66%) carcinomas 
respectively, and in neither of the 2 phyllode rumors. In 4 cases of breast carcinoma that 
strongly expressed hTERT gene and protein before treatment, neo-adjuvant 
chemotherapy led to disappearance of gene and protein expression in all cases There was 
a strong correlation between detection of hTERT gene expression by ISH and of hTERT 
protein by ICH in tissue specimens from breast tumors. These results suggest that 
detection of hTERT protein by ICH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 
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Abstract. Human telomerase reverse transcriptase (hTERT). 
the catalytic subunit of telomerase. is very closely associated 
with telomerase activity. Telomerase has been implicated in 
cellular immortalization and carcinogenesis. In situ detection of 
hTERT will aid in determining the localization of telomerase- 
positive cells. The aim of this study was io detect expression of 
hTERT mRNA. hTERT protein, estrogen receptor (ER) and 
progesterone, receptor (PR) in paraffin-embeoded breast tissue 
samples and lo investigate the relationship between hTERT 
expression and various clinicopaihologic a! parameters in breast 
rurnorigenesis. We used in situ hybridization (15H) to examine 
hTERT gene expression, and imrnunohistochemistry (1HQ to 
exaroine'expression of hTERT proiein. ER and PR, in breast 
tissues including 64 adenocarcinomas. 2 phyllode tumors and 
their adjacent normal breast tissues. hTERT sene expression 
was detected by 1SH in 56 (88%) carcinomas, but in neither 
of the 2 phyllode tumors. hTERT protein expression was 
detected by IHC in 52 (81%) carcinomas, but in neither of 
the 2 phyllode rumors. Moreover. ER and PR were expressed 
in 42 (66%) and 42 (66%) carcinomas, respectively, and in 
neither of the 2 phyllode rumors. In 4 cases of breast carcinoma 
thai strongly expressed hTERT gene and proiein before 
treatment, neo- adjuvant cbemoihcrapy led to disappearance of 
gene and proiein expression in all cases. There was a strong 
correlation between deteclion of hTERT gene expression by 
1SH and of hTERT protein by 1CH in tissue specimens from 
breast lumors. These results suggest that detection of hTERT 
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protein by 1CH can be used to distinguish breast cancers as a 
potenlial diagnostic and therapeutic marker. 

Introduction 

Breast cancer is the most frequent malignancy in women, 
affecting up to one in every eight females worldwide. The 
most important cbnicopathological prognostic parameter so 
far identified is the absence or presence of lymph node 
metastasis, but the identification of further parameters for 
both lymph node-positive and -negative patients would 
facilitate an individually based risk-directed therapy (1). A 
promising emerging molecular marker is telomerase. a 
ribonuclcoprorein enzyme complex. whicl* when actuated 
or upregulaied allows tumor cells lo escape from cellular 
senescence and to proliferate indefinitely (2). The human 
telomere is a simple repeat sequence of six bases (TTAGGG) 
that is located el the ends of each chromosome (3). Telomeres 
are believed to protect against degeneration, reconstruction, 
fusion, and loss (4) and to promote the homologous pairing 
of chromosomes (5). The end-to-end chromosome fusions 
observed in some lumors may result from the loss of telomeres 
and may be partly responsible for the genetic instabihty 
associated with tumorigenesis. Telomerase catalyzes the 
synthesis of telomere DNA and facilitates cell immortalization 
through ihe stabilization of chromosomal structure (6-8). 
Alihough the expression of the human RNA component of 
reJomerase (hTERC) is widespread, the restricted expression 
pattern of the mRNA of hTERT, the human telomerase 
catalytic subunit eene, is correlated with telomerase activtty 
(8-13). As telomerase activity seems to be the key player m 
tumor cell immortality, it has importance as a target molecule 
for anti-cancer therapy. Telomerase aciiviiy has been shown 
to correlaie with poor clinical outcome in neuroblastomas 
and oihcr tumors (14). For breast cancer, howevei. lelomerase 
activity is a controversial prognostic marker; some stud>es 
surest thai telomerase activity, clinicopaihological parameters 
and disease outcome are linked, whereas oihers do not find 
this association (14-23). 
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We have succeeded in very clearly and sensitively demon- 
strating hTERT roRNA in thyroid, colorectal, parathyroid 
and lung tissues by use of an oligonucleotide probe ( 1 3 ,24-26). 
Strong correlation has been observed between hTERT mRNA 
and^or protein expression and leJornerase in a variety of 
malignant tumors. (13.14,24.25.27,28). In the present study, 
we used 1SH to examine expression of the hTERT gene, and 
1HC to examine expression of hTERT protein, ER and PR, in 
64 carcinomas and 2 phyllodes rumors of breast to determine 
whether hTERT protein can be used to differentiate breast 
cancers. Wc also analyzed hTERT mRNA and protein 
expression with special reference to clinical features and 
histological findings to investigate the potential role of 
hTERT mRNA expression analysis in predicting the biological 
characteristics of breast cancers. Since hTERT expression in 
breast tumors has not previously been analysed by 1SH or JHC, 
our investigation also examined various climcopathoJogical 
parameters, including age, hist opatbplogical type, tumor size, 
lymph node status, relapses, and the expression of ER and PR. 

* Materials and methods 

Tbsvc collection. Sixty-six samples were obtained during 66 
mastectomies: 64 breast carcinomas, 2 phyilodes tumors and 
66 specimens of the adjacent normal breast gland. In 4 cases, 
samples were obtained during core needle biopsies (CNB) 
before neo-adjuvant therapy, and again during mastectomies 
after neo-adjovant therapy; for these cases, all measurements 
and examinations were performed both before and after the 
neo-adjuvant therapy. The patients ranged in age from 32 to 
90 years. The patients with carcinomas ranged in age from 37 
to 90, mean 56. years and were all women. The women with 
phylJode tumors were aged 32 and .38 years, respectively. 
The surgical and CNB tissue samples were frozen rapidly 
wjib liquid nitrogen and stored at -80X until fixation. Then, 
ihey were fixed in 10% buffer formalin solution and 
embedded in paraffin. Surgical and CNB samples were 
collected from the patients after obtaining their informed 
consent, and the study protocol was approved by the Medical 
Department of the University of Tokyo Ethics Committee. 
The pathologic diagnoses were made by the surgical 
pathological specialists at our insiituie on the basis of 
examination of hematoxylin-eosin stained slides. A 
pathological review was performed for all breast tumors 
according to the BRE score. pT and pN staging were assigned 
according to the 1997 WHO classification (7th edition). 

MCF-7 human breast cancer cells, kindly provided by the 
Cell Resource Center for Biomedical Research. Institute of 
Development. Aging and Cancer. Tohoku University, were 
used as positive controls. The cells were incubated in RPMl- 
J640 medium with 25 mM HEPES buffer. L-glutamine. and 
)0% fciaf bovine serum (Cibco, Grand Island. NY) on a 
chamber-attached slideglass (Lab -Tek 3 Chamber Slide 1 ": 
Nalge Nunc International. NapervjJlc. IL) in a humidified 5% 
CO, atmosphere at 37*C The cells were then fixed with 10% 
buffered neutral formalin (Sigma Chemical Co.. St. Louts. 
MO). The cultured MCF-7 cell line that was used as a positive 
control was tested for telomerase with a PCR- based standard 
TRAP assay (6.! 3). These cells were also used to prepare cell 
blocks. Briefly, the cells were fixed in 1 09b buffeied neutral 



formalin, resuspended in molten agarose and then embedded in 
paraffin. Sections from these cell blocks were used as positive 
procedural controls in 1SH and IHC. The negative control in 
1SH was obtained by replacing the oligonucleotide probe with 
RNase. The negative control in IHC was obtained by replacing 
the primary antibody with Tris-buffercd saline (TBS). 

Oligonucleotide probe for JSH. The specificity of the oligo- 
nucleotide sequence was initially determined by a GenEMBL 
database search using the Genetics Computer Grotfp Sequence 
Analysis Program (GCG. Madison, WI) based on Dx fesiA 
algorithm (29); the sequence exhibited 100% homology 
with the hTERT gene sequence, A dCD* oligonucleotide 
was used to verify the integrity and.lack of degradation of 
the mRNA in each sample. All oligonucleotide probes 
were synthesized with a hapitn-Jabeled nucleotide, such a* 
digoxigeninrdUTP (Boelmnger-Mannheim), al the 3' end via 
direct coupling by using standard phosphoramidite chemistry 
(Research Genetics, HuntsvMe, AL) (30). The probe used for 
detection of hTERT by 1SH was generated from the original 
sequence for Homo sapiens telomerase reverse transcriptase 
(AF0 15950), 2766-2S00: 5*-GCCTCGTCTTCTACAGGGA 
AGTTCACCACTGTCTr-3' (13,24-26). 

1SH. 1SH was performed with the GenPoint nucleic acid hyper- 
detection system (Dako, Carpenteria, CA) .(31). FormaJw- 
fixed, paraffin-embedded tissue sections (5 urn thick) were 
deparaffinhed in xylene and a graded alcohol series. Tissue 
sections and CNB samples were then prclreated with target 
retrieval solution (Dako, SI 700) at 95*C and proteinase K 
(DBko, S30CW) at room temperature. Next* the tissues and 
CNB samples were fixed in 03% hydrogen peroxide followed 
by a methyl alcohol series al Toom temperature. Digoxigcnin- 
labeled ami-sense oligonucleotide in mRNA in situ hybri- 
dization solution (Dako, S3304) was pbced over the tissues 
and CNB samples. After hybridization at 37*C overnight, 
the slides were washed in stringent wash solution (Dako. 
GenPoint System Kit) at 4yC.The tissues and CNB samples 
were exposed to avidin blocking solution (Dako, X0590) 
at room temperature followed by biotin blocking solution 
(Dako, X0590) at room temperature. The tissues and samples 
were then incubated ai room temperature with a sheep 
monoclonal hapien-labcled anti-digoxigenin antibody (Dako. 
p5J04), and the slides were then fixed with biotinyl tyraminc 
(Dako, GenPoint System Kit) at room temperature. Finally, 
the Slides were incubated with HRP-conjugatcd strcptavidin 
(Dako, CenPoini System Kil) ai room temperature. Since 
33'diaminobenzidine letrahydrocbloride (DAB) was used as 
the substrate, a positive reaction was visible as a brown color 
under a light microscope. The sections were weakly counter- 
stained with 0.1% hematoxylin. 

JHC. IHC was performed by the avidin-biolio complex/ 
horseradish peroxidase method. Tissue sections were stained 
for hTERT with a commercially available monoclonal antibody 
(NCL-L-hTERT; Novocastra. Newcastle upon Tyne. UK). 
Sections were dewaxed in xylene. Antigen retrieval was done 
by incubating sections immersed in 00 1 M citrate buffer at 
pH 6.0 in o rnioowave oven at 99'C. The sections were 
allowed to cool down at room temperature. The sections 
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Figure I. Representative results of the .TRAP assay, If functional tetomcrase is 
present, ihc tniymc adds DNA lo the substrate in 6-base-pair (bp) mrnmtnls ( 
resulting m a ladder- tike disrribuiion of products. The 6- bp ladder signals art 
apparent for MCF-7 and breast cancers (case nos. 6. 14, 23. 44 and 59) and arc 
no* apparent for lysis buffer as iSe negative control. An ca tract of MCF-7 was 
used as a poaiovt control for the TRAP assay and as ah Inltmal Telomerase 
Assay Standard (JTAS) positive control for PCR ampiifkiiJbn, with rysis 
buffer as the negative control (Lysis). 



were then immersed in )% hydrogen peroxide (B 2 0 2 ) in 
methanol to. block endogenous peroxidase activity. Following 
that, the sections were washed in TBS (pH 7.6) before being 
incubated in normal rabbit serum for 20 min to block non- 
specific binding. After, draining off the excess serum, tbc 
sections were incubated with the primary antibody at room 
temperature. The sections were washed in TBS before being 
incubated with the secondary antibody (biotinylated rabbit 
anti-mouse, Dako). The sections were washed again with 
TBS and incubated with avidin biotin complex/horseradish 
peroxidase. After washing the sections with TBS. peroxidase 
activity was visualized under light microscopy by applying 
DAB chromogen (Dako). The sections were countersigned 
with hematoxylin; dehydrated in increasing grades of alcohol 
and finally mounted in dibutyl phthalate (DPX) mountanl. 

Homogeneous staining or a speckled/dotted pattern in the 
nucleus was considered positive staining, and absence of 
distinct nuclear staining was taken as negative staining. 
Grading of the percentage of stained cells (hTERT labeling 
index) was performed by previously published. criteria (J) as 
follows: Grade 1, negative staining; Grade 2, 1-10% positive 
staining nuclei; Grade 3, 11-50% positive nuclei; and Grade 4, 
>50% positive nuclei.. Immy nos rained slides for ER and 
PR were scored as previously described (32.33). Jn brief, 
each entire slide was evaluated by light microscopy. Frrst, a 
proportion score was assigned, which represents the estimated 
proportion of positive-staining tumor cells (0, none; ] , < J/100; 
2. 1/100 to 1/10; 3. 1/10 to 1/3; 4, 1/3 to 2/3; and 5, >2/3)* 
Next, art intensity score was assigned, which represents the 
average intensity of positive tumor cells (0. none; 1. weak. 
2, intermediate: and 3. strong). The proportion and iniensiry 
scores were then added to obtain a total score, which ranged 
from 0 to 8. Slides were sco;ed by pathologist who did not 
have knowledge of ligand-binding testis or patient outcome. 



Table I. Relationships between mRNA status (negative/ 
positive) by ISH and standard clinical, pathological* and 
biological factors in the 66 tumors. . 



Total 



No, of patients (%) 



population hTERT hTERT * 

(%) negative positive P-valuc* 



Total 


66 


10(15.2) 56(84.8) 




' Aec 






NS 




26 


6123 Y\ 20 1*76 9> 




>50 


40 






Hisiopatho- 








Jogical type „ 






NS 


Scirrhous 


32 


. 2 (6.4) 30(93.6) 




PapilJorubolar 


20 


2(10 0") ISfQOO) 




Solid, tubular 


6 


i(16J) 5(833) 




Mucinous 


2 


) (50.0) . 1 (50.0) 




Non-invasive 


4 


2(50.0) 2(50.0) 




PhyJlodes 


2 


2 (100) 0 (0) 




Tumor size 






NS 










T) (<2.0) 


20 


O t(Y\ 70 nnfft 

V \*J J jC\J \ J lAJ ) 




T2 (2.0-5.0) 


34 . 


6(17.6) 28(82.4) 




T3(>5.0) 


10 


2 (20.0) 8 (80.0) 




Lymph node 








status* 






NS 


pNO 


34 


6(17.6) 28(82.4) 




pN) - 


26 


2 (7.7) 24(92.3) 




pN2+pNM 


4 


0 (0) 4 (100) 




Relapse 






NS 


+ 


10 


0 (0) .10 (J00) 






56 


10(17.6) 46(82.4) 




ER expression 






NS 


+ (*2) " ' ' 


42 


6(143) 36(85.7) 




- (<2) 


24 


4(16.7) 20<833) 




PR expression 






NS 


+ <*2) 


42 


4 (9.5) 38 (90 5) 




- <<2) 


24 


6(25.0) 18(75H) 





V rest. NS. not significant. ^Information available for 64 patients. 



Statistical analysis. Differences in p-values were analyzed 
with the x 7 'est for independence, and Fisher's test was used 
for correla lions. In al) comparisons. p<0.05 was considered 
significant. 

Results 



Representative results of the TRAP assay a^e shown in Fig, !. 
The cultured cells, which were rested for teJomcrasc activity 
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.tcrp*. (EH) by IHC and p,o S cM«one ,tce P ,o, (PR) by IHC in b,caM onto. (>. HAE): (b. hTERT mRNA); <c, hTERT pro.cn); <d. ER) >nd («. PR). 



with the TRAP assay, cave positive results with all procedural 
controls (MCF-7 and 5 breast cancer samples) (Fig- I). 

1SH revealed that hTERT mRNA was strongly expressed 
in the nuclei and cytoplasm of almost all of the MCF-7 human 
cancer cells (data not shown). Expression of hTCRT mRNA 
was detected in?56 (88%) of the 64 breast cancers and in none 
of the pbyllodes tumors of the breast (Table )) with ibe anli- 
. sense probe, whereas no expression was detected with the anti- 
sense probe ueaied with RNase (data not shown). The levels of 
expression were hetcioseneous within the carcinomatous 
regions- Strong expression of hTERT mRNA was not confined 
lo the carcinomatous regions but was also detected in 
infiltrating lymphocytes (Fie. 2a and b). Higher expression 
levels of boih signals of hTERT mRNA were delected in 
some sections containing both carcinomas and lymphocytes, 
compared wiih the adjacent non-cancerous mammary gland, 
but no clear differences in signal intensity were observed 
between carcinomas and lymphocytes. The signals in both 
the normal and cancer tissues were mainly present in the 
lymphocytes, ;>.nd ihe signal intensity was similar in both, 
although a precise quantitative comparison of the in sini signals 
was impossible. 



JHC revealed that hTERT protein was strongly expressed 
in the nuclei, nuclear membrane and cytoplasm of almost all 
of the MCF-7 human. cancer cells (data not shown). Expression 
of hTERT protein was delected in 52 (81%) of the 64 breasi 
cancers and in none of the phylJode tumors of the breast 
(Table II). The levels of expression were heterogeneous within 
the carcinomatous regions. As shown in Fig. 2c, strong 
expression of hTERT protein was observed in nuclei, nuclear 
membrane and cytoplasm, similar lo the pattern in MCF-7 
human cancer cells. Normal mammary gland and stromal 
cells generally showed negative immunoreactivjty against 
hTERT protein antibody. 

A nuclear signal for the ER (Fig. 2d), as assessed by IHC 
was obser/ed in 36 (56%) of the 64 breast cancers and in 
none of ihe phyllode tumors of Ihe breasi. with positive 
scores ranging from 2 lo 8 (Tables I and 11). A nuclear signal 
for the PR (Fig. 2e). as assessed by IHC. was observed in 38 
(59%) of the 64 breasi cancers and in none of the phyllode 
tumors of the breasi. with positive scoies ranging, from 2 to 8 
{Tables 1 and II). 

We used ISH and IHC to examine hTERT expression 
in 4 cases of breast cancer before and after neo adjuvant 



Table H Relationships between mRNA stilus (negative/ 
positive) by 1HC and standard clinical. pathological and. 
biological factors in the 66 tumors. 



Table III. Relationship of hTERT mRNA and protein 



No- of patients (%) 

Total hTERT hTERT 

negative positive P-value 3 



Total 

Age 
<50 
>50 

Histopathologic*! . 

type 

Scirrhous 

papillotubular 

Solid tubular 

Mucinous 

Non-invasive 

Phyllodes 

Tumor size (cmy 
Tl (<2-0> 
T2 (2.0-5.0) 
T3 (>5.0) 

Lymph node status* 
pN'O 
P N) 

P N2+pNM 
Relapse 



ER expression 

+ te2) 

- <<2) 

PR expression 
-M>2) 

- i<7) 



66 14(21.2) 52(78,8) 



26 


10(38.5) 


16(61.5) 


40 


4(10.0) 


36 (90,0) 


32 


2 (6.4) 


30 (93.6) 


20 


4 (20.0) 


16(80.0) 


6 


: 4 (66.7) 


2 (33 3) 


2 


0 (0) 


2 (100) 


4". 


2 (50.0) 


2(50.0) 


2 


2 (100) 


0 (0) 


20 


4 (20.0) 


16(80.0) 


34 


6(17.6) 


28(82.4) 


10 


2(20.0) 


8 (80J0) 


34 


6(17,6) 


28 (82!4) 


26 


6(23.1) 


20 (76.9) 


4 


0 (0) 


4 (100) 



NS 



NS 



)0 1 (10.0) 9(90.0) 
56 13(23.2) 43(79.6) 



42 6(143) 36(85.7) 
24 8(333) 16(66.7) 



42 6(143) 36(85.7) 
24 8(33.3) 16(66.7) 



NS 



NS 



NS 



NS 



NS 



L / ? lest. NS. noi significant. "Information available for 64 patients. 



chemotherapy. Before chemoiheiapy . all 4 of the breast 
carcinomas strongly expressed hTERT by both JSH arrd 
1HC After chemotherapy. hTERT expression completely 
disappeared in all 4 cases (Table 1)1). hTERT expression by 
lymphocytes was detectable by 1SH and 1HC both before and 
after chemotherapy in all 4 coses, and the level of expression- 
did not appear to be altered by.treaimeni. 

No correlation was observed between hTERT mRNA 
expression and any of the clinicopathoios.cal parameters 
age. hisiopathoiosicaJ type, lumoi <i7e. lymph node status. 







hTERT mRNA hTERT protein 


Case Age 


Neoadjuvant . 


Before After Before After 


I 80 


Anasuozole 




2 78 


Anasuozole 


+ ' + 


3 35 


FEC* 


+ + 


4 37 


AO 




"PEC 5FU (500 rng/m^Epirubicin (70 mg/m^ Cyclophosphamide 
(500 mg/m>).*AC, Doxorubicin (60 mg/nV). Cyclophosphamide 
(500 mg/m*). Before; before neoadjuvant chemotherapy. After, 
after neoadjuvant chemotherapy. positive; -,negaiiVe. 



relapses, and ibe .expression of ER and PR, Similarly, there 
was no correlation between hTERT protein expression and 
any of these clinicopathological parameters. There 
correlation between hTERT mRNA expression and hTERT 
protein expression in breast cancers (p<0.005). 

Discussion 

This study reports a comparison of hTERT mRNA expression 
by 1SH and hTERT protein expression by 1HC m tissue 
sections from breast tumors. hTERT mRNA was detected 
by ISH in 56 of the 64 breast cancers arid in MCF-7 human 
breast cancer cells. Breast cancer cell nuclei stained strongly 
positive with the specific anti-sense probe but not with the 
anti-sense probe treated with RNase (data not shown). Tissue 
lymphocytes also stained posiiivety.with the anti-sense probe, 
but the stromal cells did not. Expression of hTERT protein 
was observed by 1HC in 52 of ibe 64 breast cancers. hTERT 
mRNA and protein expressions were highly correlated in 
breast cancers <p<0.005). Deteciion of the hTERT protein by 
1HC has permitted further analysis of carcinogenesis and 
cancer diagnosis (34). 

In recent years, there has been disagreement over the 
suitability of telomeiase activity as a prognostic biologic 
marker in breast cancer thai may help to differentiate patients 
for individually based risk-related therapy. Hiyama tt ol (15), 
in a study of 140 breast canceT specimens with the TRAP 
assay, found a strong association between leiomerase activity 
and stage classification and observed telomerase activity 
in 68%W stage 1 tumors and 95% of stage IV tumors. 
Poremba 1 1 ol{\). using (issue microanays. found a statistically 
significant correlation berween rumor-specine survival (overall 
survival) and hTERT expression in breast cancer. However, 
some problems in interpretation may affect this apparent 
consensus. First, some samples of breast cancer tissue may 
be extensively contaminated by infiltrating lymphocytes 
durin- operative manipulations. especially in advanced 
causing bvercsiirnation of telomerase activity and/oi hTERT 
e*p,ession. In our previous reports, higher expression levels 



of signals Tor both hTERT mRNA and protein were Elected 
5n some sections containing born carcinoma and lymphocytes 
in thyroid and colorectal cancers (13,24). Secondly, Poremba 
et a! (1) used polyclonal antibodies against hTERT prolem as 
a signal for expression. In our bands, polyclonal antibodies 
against hTERT protein give rise to strong background signals 
and art not clearly specific for measuring expression in cancer 
tissues. We have carefully compared the reactivity against 
hTERT protein of the monoclonal antibody used in. the 
present study with thai of some polyclonal antibodies. Use of 
the monoclonal antibody in UK allowed clear demonstration 
of hTERT protein expression, with results similar to those 
oflSH for hTERT mRNA expression. Furthermore, JHC is 
technically much easier to perform than 1SH, since con- 
tamination of samples by RNase is not an issue irt JHC. 

To the best of pur knowledge, this report is the first on the 
study of hTERT expression in breast cancer as a function of 
neoadjuvant treatment. We examined hTERT expression 
in 4 cases of breast cancer before and after chemotherapy. 
Before chemotherapy, hTERT was strongly expressed in 
all 4 carcinomas, but after chemotherapy hTERT expression 
had completely disappeared in all 4 cases, hTERT expression 
by. lymphocyte's was detectable by JSB and JHC both before 
and after chemotherapy in all 4 cases, and the level of 
' expression did not appear to be altered by treatment. 

In conclusion, determination of hTERT mRNA expression 
by ISH and hTERT protein expression by JHC can be used to 
obtain information contributing to a histopathologic^! diagnosis 
during screening of breast cancers. By use of a monoclonal 
antibody, we could very clearly and sensitively demonstrate 
hTERT protein expression in breast cancel tissues but not in 
noncancerous tissues. We also demonstrated that 4 carcinomas 
with originally positive immunoreactivity against hTERT 
protein became negative afier neo adjuvant chemotherapy, 
these results suggest that determination of hTERT pjotein 
by 1CH can be used as a potential diagnostic and therapeutic 
marker to distinguish breast cancers. 
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Expression of the ubiquitui-proccasomc pathway and muscle loss in 
experimental cancer cachexia. 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham, UK. 

Muscle protein degradation is thought to play a major role in muscle atrophy in cancer 

Z, VTVT thC im P° rtance of the ubiquitin-proteasome pathway wSL 

been sugg ted to be ^ ^ dcgradative p - ^ P p Ll lossTn 

c achex,a, the express™ of mRNA for proteasome subunits C2 anfc5 as weT. as tie 
ub.quum-conjugatmg enzyme, E2(I4k), has been detennino! in gaslrocnemru and 
pectoral muscles of mice bearing the MAC16 adenocarei.oma, using competinvJ 
quant tat, ve reverse transcriptase polymerase chain reaction. Protein levels of oroteasome 

fl^f WCre deteimined by '^-^^toen^S^ffiT 
were reflected m changes in protein expression. Muscle weights correlatedllneaVrv^ 
wcght loss dunng the coun* of the study. There was a good^aelationteS^ 

in?rrofi?^^ E2 ^ 

ZZS . ° (6 f Md for 02 ^ two - fM ^ E2(14k) between 12 and 20% weight loss 
fohowed by a decrease m expression at.weight losses of 25-27%, although lossff muSle 

TlZT Tt ^ ? ntraSt> eXpreSSi ° n ° f 05 mRNA °"'y i'^ased two-foTd and wal 
cfevatod s.rmlarly at all weiglu losses between 7.5 and 27% Both P roteasome^ca 0 3 
act.v.ty, and proteasorrie-specific tyrosine release as a measure of tota proSn 
Ssl° e n ^ a,S ° maX, ' mal 3t ,8 " 20% ,oss decreased at height loss 

m„^T i ( i } 0lJy being seen at ^e" 1 ,osses a »ove 1 7% although 

s^^ r r d r B ? My with increasing weigiit ,oss - These -u/ts s Ug ;t 

, ub,< l ultln -P rotea so m e pathway plays a major role in protein loss in 
g strocnemms muscle, up to 20% weight loss, but that other factors such as depress on in 
protem synthesis may play a more important role at higher weight loss. aepnssioam : 

PMID: 16160695 [PubMed - in process] 
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Increased expression of proteasome subunits in skeletal m «scle of cancer 
patients with weight loss. 

Pharmaceutical Sciences Research institute, Aston University, Birmingham B4 VET, UK. 

Afrophy of skeletal muscle is common in patients with cancer and results in increased 
morb.d.ty and mortality. In order to design effective therapy the mechanism by vS 

pr«c 10 ^ e ' UCidated - M ° St StUdiCS SUggCSt th3 " he "biquitin'otealome 
proteose pathway ,s most important in intracellular proteolysis, although there have 
been no reports on rhe activity of this pathway in patients wiui diflhJS^ofSLht 
loss. In th,s report the expression of the ubiquitin-proteasome pathway i rlchis 
«bdom,ru S muscle has been determined in cancer patients with weighflossS-34-/. 

expression ot mRNA fpr proteasome subunits C2 and C5 while nmi-«« ~ v , 
been determined by western blotting. Overall. bi a^J^SS^^ 

without Sn " r ^.Ll ' th3t m patiCntS With0ut wei ^ K»s. with or 

wiinour cancer. The level of gene expression was dependent on the amount of weiriit 

S3 for b ° th P rote ~ -"units in patientsS weS £ of 

12 19/.. Further mcreases m weight loss reduced expression of mRNA for bom 

SSET S UmtS ' alth ° Ugh * deVated in — P-son wTp t L Ate with no 

manlo r, "° for m inCreaSe in ex P ressio » * wefght losseT less 

than 10 /..There was a good correlation between expression of proteasome ZOSdoS 
suburuts, detected by western blotting, and C2 and C5 mRNA, SSSt 
gene expression resulted in increased protein synthesis. Expre^sionSe uo Z to 
conJ»gat,ng en, yme, E2(14k), with weight.loss followed asimilar pa tZ £ Z of 

PM[D: 1 6 1 25 11 6 [PubMed - in process] 
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CeU type-specific occurrence of caveolin-lalpha and -lbcta in the hme 
caused by expression of distinct mRNAs. 

Kogojg, Ajba T, Fuiimoto T . 

School of Medicine, Showa-ku, Nagoya 466-8550, Japan, hkogo@fujita-hu.ac.jp 
Two isoforms of caveoHn-l, alpha and beta, had been thought to be generated by 
a variant mRNA (5 V mRNA) encodes the beta isoform specifically In the present stndv 

ZrfZTZ *Z one con " e,ation betwem the expression ° f the «vX S^ 

isoforms and mRNA vanants ,n culture cells and the developing mouse lung The alpha 
isoform protein and FL mRNA were expressed constantly during the lung devek>pmenT 
whereas expression of the beta isoform protein and 5'V mRNA was oeg&K^Su 
lung before , 7.5 days post coitum, and markedly increased simultaneous!?' , S , dat 
pos coitum, whence alveolar type I cells started to differentiate. Immunotu tochemTal 

cell expresses thebeta isoform predommantly, while the endothelium harbors the aloha 

S T 8 , ! eCt,ve P lasmametnb rane preparation obtained from the 

SSm I iZt eP,tlie,ia J Cd,S - ^ PrCSent rCSU,t * at *« two 

caveolui-1 isoforms are generated from distinct mRNAs in vivo and that their production 

2Z Tft l : dCP f dmt K 3t ^ ******** level. The result also sugges £ftc 
alpha and beta isoforms of caveolin-1 may have unique physiological fusions 

PM1D: 15067006 [PubMed - indexed for MEDLINE] 



493: Acta Obstet Gynecol Scand Suppl. 1992;1 55: 19-24. Related Articles. Links 

Oncogene and growth factor expression in ovarian cancer. 
KommossF, Bauknecht T, BjrmgjinG, KojijerJ^ TescbH, HjcidcrerA. 
Department of Gynaecology, Albert^udwig University, Freiburg, Germany. 

^r^?r r f" l °^ 1 beh3Vi0r ° f ° Varian probably influences both 

? r 0p !f ''f md re Wc to chemotherapy, which are the most relevant prognostic 
factors The phcnotype of different ovarian carcinomas is obviously associated S m 

Analyse of EGF-R, TGF-alpha, c-myc. and c-jun expression in 33 stage IWW, and 2 
stage I/Ifovanan carcinomas with biochemical, molecular-chemical and 
.mmunoru^stochernical methods showed a correlation between the mRNA and protein 
levels of EGF.R and TGF-alpha for tumor, with low or high expressing rates. Swever, 
the concentrate of measurable free EGF-Rs seems to depend on the amount of TGF- 

^I*!*^ 10 nb y aie ^ or S-Th e EGF-RbindingHgandTGF-alphaisproducedby 
epithelial rumor cells; stromal cells are usually TGF-alpha-negative, as shown by 
rnimunofestochemistry. High expression rates of EGF-R. TGF-alpha arid c-myc were 

£S 1 m f 10 ° Ut J ° f 35 ° Varian Carc,nomas > respectively. C-jun mRNA was 
detected ,n 18/19 cases studied. Non-malignant tissues originating from myometrium or 
ovary expressed no (or only small amounts of) EGF-R or TGF-alpha mRNA, whereas a 
high c-myc expression was found in 1/7 normal myometria, and in 2/5 normal ovaries 
There was no strong correlation between EGF-R/TGF-alpha and c-myc/c-iun 
expression.(ABSTRACT TRUNCATED AT 250 WORDS) 

PMID: 1502888 [PubMed - indexed for MEDLINE] 
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Somatostatin receptors in primary human breast cancer: quantitative 
analysis of mRNA for subtypes 1-S and correlation with receptor protein 
expression and tumor pathology. 

Kum3rJJ,Grigora!^ 

Fraser Laboratories For Diabetes Research, Department of Medicine, McGill University 
Royal Victona Hospital, 687 Pine Avenue West, H3A 1A1 Montreal Quebec, Canada 
ujendra.loimar@muhc.mcgill.ca 

Somatostatin receptors (SSTRs) have been identified in most hormone-producing tumors 
as well as .n breast cancer. In the present study, we determined SSTR1-5 expression in 
primary ductal NOS breast tumors through semi-quantitative RT-PCR and 
irru^unocytochemistry. The results from the analysis of 98 samples were correlated with 
several key h.stological markers and receptor expression. All five SSTR subtypes are 

JSfftjvSSt? ' eVd ^ brCaSt tUm ° IS With 91% of sam P ,es lowing 

SSTRl, 98% SSTR2, 96%SSTR3, 76% SSTR4, and 54% SSTR5 SSTR1-5 are 

locahzed to both tumor cells and the surrounding peritumoral regions as detected by 
immunocytochemistry. Levels of SSTR mRNA, when corrected for beta-actin levels 
were h.ghest for SSTR3 followed by SSTRI, SSTR2, SSTRS, and SSTR4. Furthermore 
°°n °" betW6en mRNA 3X1(1 P rotein «ptession with 84% for SSTRl', 
recenfl ^S', f ° 'fl*™*' 68% for SS ™W/» for SSTRS, and 78% for atl five 
receptors SSTRl, 2 and 4 were correlated with ER levels whereas SSTR2 showed an 
additional correlation with PR levels. These correlations were independent of patient age 
and h stological grade. Moreover, using immunocytochemistry, blood vessels exhibited 
receptor-specfic localization for SSTR2 and SSTRS. Our results indicate significant 
correlations between mRNA and protein expression along with receptor-specific 
correlauons with histological markers as well as ER and PR levels Differential 
distribution of SSTR subtypes in tumors and receptor-specific expression in vascular 
agonislT C ° nSldered 23 3 " OVel dia g" osis f< > r breast tumors with receptor subtype 
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A transcriptomic and protcomic analysis of the effect of CpG-ODN on human 
im -1 monocytic leukemia cells. 

KuoCC, KuoCW, Liang CM, LinngSM. 

Institute of Bio Agricultural Sciences, Academia Sinica, Taipei, Taiwan. 

The CpG motif of bacterial DNA (CpG-DNA) is a potent immunostimulating agent whose 
mechanism of actum ,s not yet clear. Here, we used both DNA mieroarray 

OniJT?^ T St,gafe *" of ^^yn-leotides containing^ CpoSfVCpG- 

ODN) on gene tanscnpt.on and protein expression profiles of CpG-ODN responsi ve THP-1 cells 
.Mieroarray analys.s revealed that 2 h stimulation with CpG-ODN up-regulated 50 gen^Ld 
down-regula ed five genes. These genes were identified as being asLiaTed with inSnTtl 
^microbial defense transcriptional regulation, signal transduction, tumor progrSsbTi U 
differentrntton, proteolys.s and metabolism. Longer stimulation (8 h with CpG-OD^n^anced 
transenpUonal expression of 58 genes. Among these 58 genes, none except one, nameryWn 
mducible s.gnahng pathway protein 2, was the same as those induced after 2 h Son 

identified several protems up-regulated by CpG-ODN. These proteins included heat shock 
proteins, modulators of inflarnmation, metabolic proteins and energy pathway proteins 
Comparison of mieroarray and proteomic expression profiles showed poor correlation Use of " 

S^^^-"^ " rCVerSe P0 ' ' merase ^TeactL 

Western blotting and functional assays, on several genes and proteins, nonetheless confirmed that 

freatment. This study also revealed that several anti-apoptotic and neuroprotective related 
proteins not prev.ously reported, are activated by CpG-DNA. These findingXYextended our 
Wledge on one activation of cells by CpG-DNA and may contribute to£^6^Zof 
mechan.sms that hnk innate unmunity with acquired immune response(s). ^tand.ng ot 
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A transcriptomic and proteomic analysis of the effect of 
CpG-ODN on human THP-1 monocytic leukemia cells 
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. approaches ,o investigate the efflct oW f ^ mim>amy md 

ODN» ™ „ w 1 °%^«»9™cIeorjdes containing the CpG motif ICnT 

and down-regulated fiv Re Ts Thet Pe J T f CpG ° DN "P-^ted 50 genes 
mation. 'n^U^^^^^fT^ **** ^ -flam- 

sion, cell differentiation pSSS?^ S,8nal «■»** P^gres- 

danced transcription/e^eSon j 5^ ^ P^O^OM* 
namely WNT1 inducible siZmfo a 8 Am ° ne ^ 58 g mes > except one. 

^ectrometry identified Zt^rZ up^ ^0^' * F» 

heat shock proteins modulator* „r ; n gW3,ed by C P G "9 I)N - P'otems included 

proteins. cL vm ZZ?^Z:i^^2t m - mM I—- - energy pathway 
Son. Use of more idi^le^TS! * T "P* 00 ? P'° fi ^ Poor ctaeb- 
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sion after CpG-ODN treatment Thk <L it correlation between mRNA and protein expres- 
roprotective related ^l^t ^'^ ^ -ti-apoptotic and neu- 
ings have extended our ^"^^1^ ^^ * ^ ^ ^ 
to further understanding of mech " S m S.TnT f " ^ ^ ™* """^ 
response(s). "* ranale """""lity with acquired 
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1 Introduction 

Manuals protect (hemselves agains, pathogen infection pri- 
manfy ma innate and adaptive immunity The innate 
unmunr system relies on a set of pattern recognition receptors 
\cg.. Toll-Ute receptors) to recognize foreign molecular struc- 
tures such as lipopolysaccharide (LPS) and bacterial DNA \7 3] 
Innate immune cells recognize these molecular structures and 
inrhate no. only innate but also adaptive immunity by produc- 
mg .mmunomodulatory cytokines and actJvatinP T and B 
.mmune cells Bacterial DNA can directly activate B cells to 
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proliferate and secrete immunoglobulins in a Tcell-independ- 
ent manner 14-6). It also induces B cells and monocytes to 
activate transcription factor NF-kB and secrete cytokines, 
imJuding interleukin (II) 12, tumor necrosis factor a (TNF-a), 
arid interferon cr/0 f7— 10J. The immurK>stimulatory activity of 
bacterial DNA has been assigned tounmethylated CpG motifs 
(GACGTTfor rnurine, GTCGTTfor human) [h| Recent evi- 
dence shows that synthetic oligodeoxynudeotides containing a 
CpG motif (CpG-ODN), like bacterial DNA with the CpG moi- 
ety (CpG-DNA), induce potent Thl -like immune responses 
that are protective against several infectious agents and 
immune disorders in animal models [12, 13]L Biologically active 
CpG-ODN, like bacteria DNA, actuates ' macrophages and 
immature dendritic cells to increase expression of MHC class 
Hand costimulatory molecules, thereby transcribing cytokine 
mRNAs, and producirig pro-inflammatory cytokines including 
TNFa.IL-1, and 1U2 1 ?, 14-16). CpG-ODN can therefore 
serve as an adjuvant and immunomodulatox in vaccines 
against a wide variety of targets, including infectious agents, 
cancer antigens and allergens |17J. 

It has been suggested that unmethylated CpG-DNA-medi- 
ated immune activation functions through a toll-like receptor 9 
(TLR9) signaling pathway |18). Endocytosis and sequentially 
endosomal maturation as well as bin ding of heat shock protein 
(HSP) 90 to CpG-DNA are essential for induction of TLR9 sig- 
nal transduction (19, 20). It has also been shown that recogni- 
tion of CpG-DNA causes TUR to form a dirrier, which recruits 
the adaptor molecule, myeloid differentiation factor 88 
(MyD88), through interaction between their C-terminal ToB fth- 
1R domains. This lecruirment of MyD88 to the ToB/lUR do- 
main of TLR9 initiates a signaling pathway that sequentially 
involves lL-lR-assooated kinase 1 and TNF-a receptbr-asso- 
dated factor 6 |18, 21, 22J. Studies using gene-deficient mice 
and RAW264.7 cells transiently transfected with dominant- 
negative forms of these molecules have indicated that ihe 
MyD88-mediated signaling pathway is essential for CpG-DNA- 
induced activation of NF-kB and c-Jun NH r terminal kinase 
(JNK), as well as subsequent production of cytokines in mono- 
cytic cells 1 18, 21, 22]. The precise mechanism of action of CpG- 
DNA and CpG-ODN. nonetheless, is still not thoroughly 
understood. To further elucidate the molecular events after 
binding of CpGODN to JTR9. in this study, we treated CpG- 
ODN responsive THP-1 cells with Cpt-ODN and evaluated 
changes by using DNA miooarray and proteomk approaches. 
We have discovered up-regulation of more than 50 disting- 
uished genes/proteins and identified induction of several anri- 
apoptoric and neuroprotecting genes by CpG-ODN treatment. 

2 Materials and methods 
2.1 Reagents 

Phosphorothioate-modified CpG-ODN and GpC-ODN were 
synthesized by MDBio (Taipei. Taiwan). Human specific 
ODN sequences are: CpG-ODN. 5' TCG TCG ITTTGT CGT 

© 2005 WILEV-VCH Verlag GmbH & Co. KGaA. Weinheim 



TTT GTC GTT-3"; GpC-ODN, 5 -TGC TGC TTT TGT GCT 
ITT GTG CTT-3'. The mouse specific CpG-ODN sequence is 
5VTCC ATG ACG TTC CTG ATQ CT-3\ CHCA was from 
Sigma (St Louis, MO, USA). 

2;2 Cell culture 

Cell lines were obtained from the American Type Culture 
CoUectioTi (Rockville, MD). Mouse RAW264.7 macrophage, 
and human embryonic kidney 293 cells (HEK293) were cul- 
tured in DMEM' supplemented with 10% heat inactivated 
fetal bovine serum, 100 U/mL penicillin, 100 ug/mL strep- 
tomycin sulfate, 200mmol/L I-gJutamine, and 50 um p-mer- 
captoethanol in a humidified atmosphere of 5% C0 2 at 37*C 
The medium was changed every 2 days for all experiments. ' 
Human THP-1 monocytic leukemia cells, which have been 
shown to express TLR9 and respond to CpG-DNA- stimula- 
tion [23, 24]; were cultured in RPMI1640 with the same sup- 
plements as for RAW264.7 cell cultures. 

2.3 Human cDNA microarray 

Total RNAs extracted from cultured THP-1 cells were isolated 
withTRlzol (lnvirrogen, Leek, The Netherlands) and submitted 
to Genasia Biotechnology (Taipei, Taiwan) for further proces- 
sing. In brief, 4 ug of total RNA from CpG-ODN stimulated, or 
normal THP-1 cells was labeled with a Ouofescence marker 
(U-vision, (Taipei, Taiwan)). Different colored fluorescence dyes 
(Cy5 and Cy3) were used to distinguish total RNA from normal 
and ODN stimulated cells. The labeled RNA was used for 
hybridization with the Human 1 cDNA microchip from Agilent 
Technologies {Palo Alto, CA, USA).The chips were scanned and 
the expression partem was analyzed using genechip software. 
Genes showing up-iegulation or down-regulation of RNA levels 
were analyzed and identified on a genomic database as sug- 
gested by the manufacturer of the micrcKhip. 

2.4 Protein preparation 

THP-1 cells were seeded in a 175 cm 2 tissue culture flask at a 
density of 10 6 cells per milliliter in culnue medium. The cells 
were stimulated with or without 1.5 um CpG-ODN at defined 
times and harvested by cenhifugation at 4°C. J 000 x g foT 
l5min. Cell pellets were washed twice with ice-cold PBS, 
icsuspended and sonicated in extraction buffer containing 
25 mM Tris-Hd (pH 7.5), 2mM p-mercaptoethanol and pro- 
tease inhibitor cocktail. After centrifrigabon at 10000 x gfor 
20 mm, ammonium sulfate was added to the supernatant until 
the 6nal concenbation reached 50% saturation w/v. The solu- 
tion was stirred at 4 6 C for 30min and centrifuged at 10 000 x g 
for 30min at 4°C The supernatant fraction was then trans- 
ferred into a fresh tube, and the precipitated protein pellet 
solubilized in extraction buffer. To remove salts and other con- 
taminants, the extracts were treated with a pre-cooled {~20°C) 
solution of 10% TCA in acetone with 0.07% p-mercaptoetba- 
nol. Proteins were allowed to precipitate overnight at -20°C 
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After cenrrifugation, the pellet was washed with ice<old ace- 
tone, containing 0.07% ^mercaptoethanoi. The supernatant 
was discarded and tbe peDet dried in a SpeedVac system (Model 
AESIOIO; Savant, Holbrook, NY, USA). 

2.5 2-DE 

2-DE was performed using an IPGphore 1EF and a Hofer 
DA1X vertical unit (Amersbam Biosciences, Piscataway* N), 
USA). One milligram of dried protein sample was dissolved 
in 350 uL of rehydration buffer solution, containing 7 m urea, 
2 u thiourea, 4% w/v CHAPS, 5 mM tributyj phosphine, and 
2% IPG and loaded onto an immobilized pH 3-10 linear 
gradient strip (18cm), followed by rehydration for 16h. 1EF 
was then performed in tbe following manner 100 V for 
30min, 250V for 30rnin, 500V for 30min, 1000V for * 
30min, 4000 V for 30min, 6000 V for 55 000 Vh. At the end 
of IEF, tbe IPG strips were equilibrated for 15 min in buffer 
containing 6 m urea, 2% w/v SPS, 30%v/v glycerol, and 
50 mM Tris, pH 6.8, then reduced with 65 mM dithiorery- 
thritol (DTE) and subsequently alkylated with 135mM 
iodoacetamide for another 3 5 min. After equilibration, the 
IPG strips were immediately placed on top of a 12% SDS^ 
PAGE (1.5 mm,. 20x24 cm). The second dimension gels 
were then overlaid with molten 0.8% agarose solution in 
SDS electrophoresis buffer. Electrophoresis was performed 
at I6°C, starting at 10mA per gel for 1 b, followed by 45mA 
per gel untfl the dye front reached th e bottom of the gels. 

2.6 Staining and image acquisition 

Immediately after electrophoresis, gels were stained with 
SYPRO Ruby (Molecular Probes, Eugene, OR, USA). Jn 
brief, gels were fixed for 30min in 10% methanol, 7% acetic 
add, and then stained overnight in SYPRO Ruby stain. Tbe 
staining solution was removed and gels were washed in 30% 
methanol and 7% acetic acid for 3 h. After staining, image 
acquisition was carried out on a Typhoon 9200 (Amersbam 
Biosciences). To identify a protein, spot detection, quantifi- 
cation and matching of 2-D Jesuits were analyzed using ' 
JmageMaster software (Amersbam Biosciences). The M, of 
the proteins weie calibrated accojding to the LMW-St>S 
Marker Kit (Ameisham Biosciences), and their pi values 
were estimated fiom the position of the protein spots on the 
2-D gel and confirmed with the information supplied by the 
manufacturer. Since most of the pi values foi the truncated 
proteins had not been reported previously, the pi values of 
the truncated proteins were estimated from the position of 
the observed spots. To omit the variation due to the use of 
separate gels, after background subtraction, tbe intensity 
levels of protein spots on each gel were normalized as a pio- 
portion of one reference spot, and protein quantities were 
calculated by integrating the density over the spot area. Pro- 
tein spots that showed reproducible modulation exceeding 
-80% after CpG ODN treatment in tbiee experiments were 
further analyzed by MS. 



Proteomics 2005, 5 t 894-906 

2.7 in-gei digestion with trypsin arid extraction of 
peptides 

Protein spots were excised from stained gels and cut into 
pieces. In brief, gel spots were dehydrated with ACN for 
lOmin and dried in a vacuum centrifuge. Gel pieces were 
reswelled with 55 mM DTE in 25 mM ammonium bicarbon- 
ate (pH 8.5) at 3rC for 1 h. The solution was then exchanged 
with afkyhtion solution, which contained lOOrriM iodoaceta- 
mide in 25 mM ammonium bicarbonate (pH 8.5), at room 
temperature for th. After- alkylati on, the. gel pieces were 
washed twice with 50% ACN in 25 m M ammonium bicarbo- 
nate (pH 8.5) for 15 min. The wash solution was discarded 
and the pieces of gel were dehydrated with ACN for lOmih 
and dried in a vacuum centrifuge. Tryptic digestion was 
iniriated by resweffing the gel in 25 mM ammonium bicar- 
bonate solution with 25 ng of trypsin- (Prornega, Madison, 
WI, USA). After incubation at 37°C for 16h, tryptic peptides 
were extracted twice with 50% ACN containing 5% formic 
acid for 15 min with moderate sonication. The extracted 
solutions were pooled and evaporated to dryness in a vacuum 
centrifuge. The dried peptide mixture was dissolved in 0.1% 
formic acid and used for MS. 

2.8 MALDI-Q-TOF MS and protein identification 

Tryptic peptides analyses were performed using a Miaomass 
Q-TOF Ultima MALDI {Micromass, Wytherishawe, U.K.) 
equipped with a 337 nm nitrogen laser and operated in 
reflection positive ion mode. Peptide mixtures (1 uL) were 
piexnixed with 3 pL of the matrix (5mg CHCA in 50% ACN 
with 0.1% TFA) then spotted onto the MALDI target plate. 
Mass spectra were acquired for tbe mass range of 900- 
3500 Da and the individual spectra from MALDI MS or MS/ 
MS were processed using the Micromass. MassLynx 4.0 soft- 
ware. The generated peak list files were used to query the 
Swiss-Prot database using the MASCOT program (http:// 
www.matrixscience.com) with the following parameters: 
peptide mass tolerance, 50ppm; MS/MS ion mass tolerance, 
0.25 Da; allowance of missed cleavage, 1; and consideration 
for variable modifications such as oxidation of methionine 
and carboxyamidomethylation of cysteines. Only significant 
hits as defined by MASCOT probability analysis were con- 
sidered initially. In addition, when the PMF matches were 
between 5 and 9, at least one peptide sequence was manually 
checked by MALDI MS/MS analysis. 

2.9 RT-PCR analysis 

cDNA from THP-) cells was produced with Superscript Jl 
reverse transcriptase (Jnviriogeri) using a oligo(dT) )5 primer 
for 1 h at A2°C. PCR of cDNA was performed using specific 
primers for the gene of inteiest and control 0 actin. All PCR 
products were electrophoresed on a 1.5% agarose geJ, and 
DNA bands were visualized by staining the gel with ethi- 
diurrj bromide. 
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Z10 Immunoblotting 

Human THP-1 or mouse macrophage RAW264.7 cells 
(5 x 3 x lefyweJ]) were cultured in a six-well culture 
plate and treated with or without 13 um CpG-ODN for 
the. designated times. After stimulation, cells were harvest- 
ed by centrifugation at 1000 x g for ISmiri in a refri-. 
gerated centrifuge and washed twice with cold PBS buffer. 
The cells were fysed on. ice for 15rnin with 300 pL lysis 
buiTer (Pierce, Rockford, USA), supplemented with pro- 
tease inhibitor cocktail (Sigma). The fysates were cen- 
trifuged at 12000 x g for ISmin at 4°Q and protein 
concentrations of supernatant were determined using the 
Bio-Rad Protein Assay (Hercules, CA> USA). The fysates 
(50 ug of protein/lane) were subjected to 12% SDS-PAGE 
and transferred to NC membranes (Amersham Bio- 
sdehces). The membranes were blocked in PBS-0.1% 
Tween 20 (PBST) containing 5% non-fat skim milk at 
room temperature for 1 h, followed by staining with anti- 
ADP-ribosylation factor 3 (ARF-3) monoclonal antibody. 

Mg/mL; Sigma). The membranes were then incubated 
with horseradish peroxidase-confu gated secondary anti- 
body (dilution, 1:3000) for Ih. After washing three times 
with PBSX specific bands were detected by chemihimi- 
nescence according to the manufacturers protocol 
(Amersham Biosciences). - 

2.11 Cell transfeclion and lucif erase assay 

HEK293 cells (5 x 3 x jO'/weil) were transfected using 
FuGENE 6 (Roche Molecular Biochemicals, Indianapolis, 
IN, USA) plus 0.1 ug p5xNF-KB-luc (Stratagene, La Jolla, CA. 
USA), 0.05 ug pCDNA3.1-P-gaiadosidase, and pCDNA3.1- 
hTLR9 overnight The cells were incubated with or without 
1.5 um CpG-ODN for 8h and then lysed. NF-kB luciferase 
activity assays were performed as recommended by the 
manufacturer (Promega). f*-gaIactosidase activity was used to 
normalize the data. 

2.12 Enzyme activity assay 

Pyruvate kinase activity was assayed in a solution (1 mL) 
containing lOOmM Tris-HCl (pH 8.0), lOOmM KG, lOmw 
MgCl* 0.2 mM NADH, jOjtim PEP, 1.5 mM ADP, lunit of 
lactate dehydrogenase, and an appropriate amount of cell 
lysate from CpG-dDN untreated or treated THP-1 cells. The 
reaction was monitored at 30°C for a period of time by 
measuring the decrease in absorbance at 340nm\ PGK ac- 
tivity was assayed in a coupled reaction with glyceraldehyde 
3-phosphate dehydrogenase (CAPDH) as described by Lee 
(25). In brief, the assay was performed at 30°C in a total vol- 
ume of 1 mL containing JOOmM Tris-HCl (pH 7.9), 10mM 
MgCl?, 0.15 mM NADH, 2mM ATP, 6mM 3-phosphoglyce- 
rate, 0.1 mg/mL BSA, 50 mg ofGAPDH r and an appropriate 
amount of cell lysate. NADH consumption was monitored at 
340 nm. 



3 Results 

3.1 Effect of CpG-ODN on gene expression profiles of 
human THP-1 cells 

To elucidate the elTect of CpG-ODN on gene expression, 
THP-1, a 'cell line known to express TLR9 and respond to 
CpG-DNA (23, 24) was cultured with or without CpG- 
ODN. Since preliminary experiments showed that 15 um 
CpG-ODN caused more contrasting results between nor- 
mal and CpG-ODR treated cells, 1.5 um CpG-ODN was 
used throughout the experiments. To distinguish CpG- 
ODN treated samples from the control, total cellular RNA 
of normal and CpG-ODN treated cells was isolated and 
labeled with the fluorescence dyes> Cy5 and Cy3, respec- 
tively. The labeled RNA was then used for hybridization 
with a Human 1 cDNA microchip from Agilent Technol- 
ogies. Of the 13 000 human, genes represented on the 
gene array, a total of .55 genes changed expression sign- 
ificantly after 2 h CpG-ODN treatment Among these, 
50 genes were up-regulated while five genes were down- 
regulated by a factor > 2. These genes were sorted by 
functions and are listed in Tables 1 and 2. They included 
notably, 1L-18 receptor accessory protein, MSG A beta 
gene, thioredoxin, pro-pol -dUTPase polyprotein, Spl40; 
connexin 59 gene, Grb2-like 2, en oyl- coenzyme A hydra- 
tase, propionyl coenzyme A carboxylase, cytochrome P450, 
and WNT1 inducible signaling pathway protein 2 (W1SP- 
2) etc. The function of these genes are known to be relat- 
ed to inflainmatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduc- 
tion, tumor progression, cell differentiation, proteolysis 
tic 

Table 1. Genes up-regulated \> 2 fold) in human THP-1 cells after 
2 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

JL-18 receptor accessory protein AF077346 

T cell receptor V beta gene X58806 

MGSA, beta gene U03019 

Antigen gene {PA) M21896 

TIED NM_004791 

Platelet activating receptor AF002986 

Antimicrobial defence 

Thioredoxin NM_003329 

Pro-Pol-dUTPase polyprotein AC004748 

Nuclear body protein Spl<10 U63420 

Transcriptional regulation 

Putative transcription factor LUZP AI986271 
General transcription factor II, I, pseudogene 1 AI700706 

Connexin 59 gene L29277 

Basic helix- loop-helix protein class B 1 AF221520 
tBHLHBl) 
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Table 1. Continued 



Gene name and description 
(changed fold> 2) 



Genebank 
number 



. Regulatory protein 
AdviHin 

Channel and transport . 
Small GTP binding protein Rab9 
FXYD domain-containing ion transport 
regulator 1 

Signal transduction 
SH3-d6main Grb2-like 2 
Tilin 

Titin associated protein (165 kD protein) 
KIAA1451 protein 

Vasoactive intestinal peptide receptor 

Enzyme and protease 

E.noyl-Coenzyme A, hydratase 

Nephrin, B-rype metallocarboxypeptidase 

Propionyl Coenzyme A carboxylase . 

Acyloxyacyl hydrolase (neutrophil) 

Cytochrome P450 

Intestinal alkaline phosphatase 

Tumor progression and cell differentiation 
Retinoblastoma V 

Human genomic DNA of 9q32 anti-oncogen e 

of flat Epitherium cancer, 

segment 6/10 , 
WNTl inducible signaling pathway protein 2 

(WISP-2) 

Structure protein 
Collagen, type IV, alpha 6 
Beta (Myosin heavy chain 

Other 

Homo sapiens Cri-du-chat region mRNA, 

clone N1BB11 
Human mRNA for lamtnin alpha 5 chain, 

partial cds. 
NIK like and Thyroxin-binding 

globulin precursor 
Hypothetical protein DKFZp434M0331 
Hypothetical protein FLJ1102 1 similar ; 

to splicing factor 
Hypothetical protein 
Chromosome 18 open reading frame 1 
Arfapiin 1 

Zinc finger protein 8 (2FP8) 
Zinc finger protein 137 (clone pHZ-30) 
OHactomedin related ER localized protein 
Cyclin-dependent kinase 8 
Iniegrin, alpha 1 
KIAA0421 protein 
KIAA1233 protein 

Unnamed protein product 
NBL4 

BC33l191_l 



AF041443 

U44103 
All 25364 



AF036268 

X90568 

X69089 

AB040884 

U11087 



AI800553 

U65090 

AB011145 

M62840 • 

U79716 

M31008 

L11910 

AB036268 



AF 100780 



D21337 
M58018 

U52827 

AB010099 

Z83850 

AL137720 
AK023985 

AL 049851 

NM. 004338 

AW408785 

M29581 

U09414 

AI738468 

BE 4 67537 

D87462 

AB007881 

AB033059 

AK026362 

X75535 

AAD39268 
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Table 2. Genes clown, regulated (> 2 fold) in human THP-1 cells 
after 2 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Zinc-finger homeodomain protein 4 . . . . BAB03600 

: Human protein kinase MEKK2b n>RNA, AF23979B 
complete cds. 

GlypicanB mm3 
Human genomic DNA, chromosome 22q11.2, AP000362 
clone N75A12. 



Collagen, type I, alpha 1 



274615 



linger stimulation of THplj ceils with CpG-ODN (8h) 
resulted in the up-regulation of 58 genes, Tnese genes in- 
eluded notably IL-10 receptor beta, formyl peptide receptor, 
like a (FPRL1), vitamin D receptor, nuclear receptor sub- 
family 1 (UR), early B-c ell factor, protein Icmase C gamma 
(PKCJ, Nclc, Ash. phospholipase C binding protein (NAP4), 
phosphoriboxyl pyrophosphate a rmd ©transferase, dishev- 
eled 3, WISP-2 etc. Analysis of the functions of the 58 up- 
regulated genes showed that they are associated with anti- 
mflarnmation, transcriptional regulation, intracellular signal 
transduction, tumor progression, cell differentiation, pro- 
teolysis, neurodegeneration, neuroprotection etc. {Table 3).. 
We also found that the stimulation of THP-1 cells with CpG- 
ODN for different periods of times resulted in different pro- 
files. Several defense related genes such as 1L-38 receptor 
accessory, protein, Pro-Pol-dUTPase polyprotein, SpHO and 
connexin 59 were transiently up-regulated at 2h short stira- 

Table 3. Genes up-regulated (> 2 fold) in human THP-1 cells after 
8 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebank 
number 



Inflammation and receptor 

IL-10 receptor {beta) 

Formyl peptide receptor-like 1 (FPRL1) 

Vitamin D receptor 

NMDARl 

CD44 antigen 

Nuclear receptor subfamily 1 (LxR) 
Neuromedin B receptor 

Transcriptional regulation 

Early B-cell factor febf) 

Neurogenic differentiation 1 (Neuro D) 

MAX dimeri2ation protein (NESH protein) 

Ribosomal protein S6 Kinase 

ASH2L 

Regulatory protein 
Hypothetical prolein DKF2p434H0820 
Peroxisomal farnesylated protein 
LTBP4 * 

Neuronal peniraxin II 



U08988 

AF081535 

J03258 

232774 

AW028346 

NM_005693 

M73482 



AF208502 
AB018693 
AB037886 
AF090421 
AB022785 

AL 137555 
X75535 
AF 051 34 4 
U29195 
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Table 3. Continued 



Gene name and description 
(changed fold > 2) 



Channel and transport 
Gamma-aminobutyric 161 J A receptor 
ATP synthase sub unit F6 * 
transient receptor potential channel 1 
Choroideremia (Rab escort protein 1) 

Signal transduction 
Protein Jtinase C, gamma 
Regulator of G-protem signaling 5 
Nek, Ash and phospholipase C binding . 
. protein INAP4) 

Highly similar to adeylate kinase gene 

Enzyme and protease . 
Phenylalanine hydroxylase 
Cor boxy peptidase AT 
Xylulokinase 
Pancreatic lipase 
Ubiqurtin specific protease 12 
Transmembrane protease, serine2 
Aspartate bato- hydroxylase 
Phosphoribosyl pyrophosphate 
amidotransferase 

Tumor progression and cell differentiation 

CDC 23 

WISP-2 

Microseminoprotein, beta 
Dishevelled 3 

Structure protein 

■ Tricbohyalin 
Kertain 

Other 

Human transferrin pseudogene 
TIMP-2 

Collagen-like protein 

Human genomic DNA, chromosome 21q, 

section 60/105 
Human genomic DNA chromosome 2lq, 

section 64/105 
KIAA0136 
KIAA0379 
KIAA04S9 

KIAA1114 J . 

KIAA1451 

KIAA0756 

Zinc finger protein 267 

Hypothetical protein FU10633 

Hypothetical protein EUROIMAGE 1955967 

Myb1 bomolog like 1 

Anti2yme inhibitor 

Disinteg tin-like and metalloprotease 

Ireprolysin type) with Thiombospondin 

type 1 motif, 3 
ADP-ribosylation factor 3 fARF-3J 
Testis specific protein, V-linked 
Unnamed protein 



Genebank 
number 



IMMJ>04961 
M37104 
Z73903 
X57637 

Z15114 

AI674877 

AB005216 

AB016886 

AA203389 

X67318 

AK001205 

J05125 

AF022789 

U75329 

U03109 

D13757 



AF053977 
AF1 00780 
M34376 
NM_004423 

L09190 
AF061809 

M22376 
U44383 
U67921 
AP001716 

AP001720 

D50926 

AB002377 

AB007958 

AL049732 

AB040884 

AB018299 

AF220492 

AK001495 

AK026108 

AK001893 

D88674 

AB002364 
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Table 4. List or antimicrobial and antH inflammatory genes 
modulated by CpG-ODN treatment of THP-1 cells 



Gene name " 


Genebank 


Expression fold ' 




. number 










2b 


8h 


Connexin 59 gene 


L29277 


2.12 ±0,05 


1.74 ±0.08 


IL-18 receptor 


X58806 


2.32 ± 0.21 


1.19 ±0.13 


accessory protein 








Integrin, alpha 1 


X68742 


2.01 ± 0.03 


1.05 ±0.10 


Nuclear body protein 


U63420 


2.22 ± 0.11 


1.39 ± 0.04 


Sp140 








Pro-Pol-dUTPase 


AC004748 


2.33 i 0.18 


0.95 ±0.03 


pofyprotein 








Thioredoxin 


NM_003329 


2.20 ± 0.08 


1.07 + 0.01 


FPRLi 


AF081535 . 


0.95 ± 0.06. 


2.13 ±0.11 


IL-10 receptor 


U08988 


1-22 ± 0.18 


2.21 ± 0.07 


LxR 


NM_005693 


0.90 ± 0.03 


2.37 ±0.31 


Vitamin D receptor 


J03258- 


1.37 ± 0.23 


2.39 ±0.11 



Expression fold is designated as the ratio of CpG-ODN treated 
over control 



uJation but were down-iegubted thereafter, while anti- 
inllamrnalory associated genes such as FPRU, 1L10 recep- 
tor, vitamin D receptor and LxR were up-regulated after 8h 
stimulation (Tables J, 3 and 4). 

3.2 Verification of the microorray results with RT-PCR 
or Western blotting 

To verify the results from the naicroarray analysis,, we also 
performed RT-PCR on the up-regulated genes (Fig. 3). Con- 
sistent with results obtained in the microaTray gene expres- 
sion analysis, RT-PCR studies showed that the rnRNA levels 
of some selected genes, including ubiquitin specific pro- 
tease 12, regulator of G -protein signaling 5, NAP4 and 
ASH2L, were increased in response to CpG-ODN (Fig. 1, 
Table 3). In addition, the protein expression level of ARF-3 



- GpC-ODN 



CpG-ODN 



Untreated 



Ubiquttm specific 

Regulator of G- 
prct»ir> signaling 5 j 

NAP4 



M74491 
M98525 
AK02604? 



Figure 1. (nduclion of various genes by CpG-ODN. THP1 cells 
were stimulated with medium alone, 1.5 pw GpC-ODN (as the 
negative control) or CpG-ODN for the indicated times.-RT-PCR 
was then performed to analyze gene expression levels. f>actin 
wds used as on internal control. The experiment was repeated 
three times with similar results. 
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Untreated 



(B) RAW 



ARF-3 




Actin 



Figure 2. CpG-ODN induced ARF-3 protein expression in human 
THP-1 or mouse RAW264.7 cells. THP-1 |A> or mouse RAW264 7 
(B) cells were incubated with 1.5 jjm CpG-ODN for the indicated 
t>me potnts. The protein expression level of ARF-3 was deter- 
mmed by Western blotting of cell extracts using anti-ARF-3 anti- 
body. The experiment was repeated three times with similar 
results. " 

was shown to increase in Western Wotting analysis in cell 
lysates from THP-] cells treated with CpG-ODN for 8-24 h 
(Kg- 2A). Similar studies showed that the ARF-3 protein was 
also induced by mouse specific CpG-ODN in other TLR9 
expression cell Knes such as the mouse macrophage 
RAW264.7 cell line (Hg 2B). 

3.3 Proteins regulated in CpG-ODN stimulated THP-1 
cells 

To further assess whether there was any correlation between 
regulation of gene expression and expression of cellular pro- 
teins, a proteomic approach was adopted to identify protein 
expression profiles. THP-1 cells were treated with CpG-ODN 
for defined rimes (from 8 to 40h). and their cytoplasmic 
proteins were extracted for 2-DE analysis. Although the use 
of high concentrations of urea might give us a broader view 
of all the proteins afTected by CpG-ODN, preliminary results 
from 2 D gels showea that the resolution of the protein mix- 
tures were not satisfactory. To improve and get the best reso- 
lution from 2-DE. total proteins were roughly separated into 
supernatant and precipitated fractions using 50% saturated 
ammonium sulfate solution. To remove salts and other con- 
taminants, both protein fractions were precipitated with TCA 
solution and then subjected to 2-DE. By protein spot deter- 
mination analysis, about 500 and 450 well-resolved spots 
were observed on each pH 3.0-10.0 gel for precipitated or 
■supernatant fractions, respectively/Comparative analysis of 
2-DE between treatments and control showed that the inten- 
sities of the protein spots from the ammonium sulfate pre- 
cipitated fraction did not change, while several protein spots 
were up-regulated by at least -80% in the supernatant frac- 
tion of 8h CpG-ODN stimulated THP-1 cells (Fig. 3). 

The piotein spots were individually excised from gels for 
further idennficalion. After trypsin digestion, several protein 
spots were identified without ambiguity by MS MALD1- 
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Figure 3. The effect of CpG-ODN on the 2-DE profile of THP-1 
cells, THP-1 cells were treated with or without 1.5 um CpG-ODN 
for 8h, Total proteins were extracted and roughly separated into 
two fractions by 50% saturation with ammonium sulfate The 
supernatant fraction was then separated by 2-DE. Protein spots 
were visualised by SYPRO Ruby staining. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed thai these 
proteins changed in intensity by over 80%. Protein spots were 
identified by trypsin digestion and MS. Localization of protein 
spots 1 (enoiy-coerizyme A hydratase), 2 tproteasome a) and 3 
(cyclophilin A; two isoforrns) are shown. The experiment was 
repeated three times with similar results. 

TOF. These proteins included HSP60, HSP90, cydophi- 
lin A, enoyl-coenzyme A hydratase, eukaryotic translation 
elongation factor, proteasome a and 0 chain and ATP syn- 
thase beta chain (Table 5). , Similar experiments on cells 
treated for a longer period of time with CpG-ODN stimu- 
lation (25 h) revealed that 27protein spots were changed in 
intensity by at least -8096. These protein spots contained 
members of HSPs (HSP27, hsc70, grp78 and grp<M), 
metabolic enzymes (phosphoglyceraie kinase (PGK) and 
pyruvate kinase (PYK)), macrophage capping protein and 
cyclophilin A (Table 6). Among these proteins, macrophage 
capping protein, PGK, PYK, cyclophilin A and HSP27 
(Figs. 4 and 5 A) were found to be up-regulated. Interest- 
ingly, we found that a truncated form of grp78 with an 
expected mass of 25 kDa and p/ of 5.3 was up-regulated 
while grp78 itself was dowrwegulated. A similar situation 
was also found for grp94 and hsc70 and their truncated 
derivatives (Table 7 and fig. 5). In addition, we also ob- 
served six down-regulated protein spots on 2-D gels in 
samples after 25 h CpG-ODN treatment. Among these six 
proteins, we have successfully identified three as 40s ribo- 
soma) protein SA, grp78 and hsc70," respectively (Table 6), 
while the other three, due to their relative low abundance, 
have not been identified yet. 
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Table 5. List of proteins modulated by 8 h CpG-ODN treatment 



901 



Protein nbme 


Accession 


Mr . 




Matched 


Cover- 


Scare 


Expression 




no. 


(theor.) 


(theor.) 


no. 


age% 




fold 


ATP synthase beta-chain git 1 4549 


56 525 


5.26 


16 


58 


171 


2.33 ± 0.06 . 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 . 


62 


2.85 ± 0.13 


Enoyl-Coenzyrhe A 


* g»4503447»* 1 35 971 


6.61 




44 


62 . 


2.52 ±0.05 


hydrolase 














Eukaryotic translation 


gi4503481 


50 087 


6.25 


6 


25 


68 


3.41 ±0.21: 


elongation factor 
















HSP60 


pioso^ 


57 963 


5.24 


13 


27 


76 


2.78 ± 0.03 


HSP90-beta 


P08238 


83133 


4.97 


10 




65 - 


2.36±o;t0 


Proteasome a chain 


9*4506181 


25 882 


6.92 


11 


59 


80 


2.52 ±0.11. 


Proteasome p chain 


gi4506193 


26472 


8.27 


9 


46 


84 


3^5 ±0.17 



Expression fold is designated as the ratio of CpG-ODN treated over control 
a). NCBI accession number 
. b) Swiss-Prof accession number 



Table 6. List of proteins modulated by 16 and 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot M r 




Match- 


Cover- 


Score 


Expression fold 




no; 


(theor.) 


(theor.) 


ed no. 


age . 




16 h 


25h 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 


62 


2.53 ± 0.02 


2.48 ±0.15 


78kDa glucose regulat- 


P11021 


722B8 


5.07 


13 


30 


148 


0.61 ± 0.04 


0.29 ± 0.06 


ed protein (grp78) 


















HSP27 


P04792 


22768 


5.98 


12 


59 


124 


1.00 ± 0.01 


2.61 ± 0.12 


Heal shock cognate 


P1 1142 


70 854. 


5.37 


16 


34 


114 


0.64 ± 0.01 


0.31 ± 0.03 


70kDa protein 


















(hscJO) 


















Macrophage capping 


P40721 


38 494 


5.88 


9 


30 


58 


1.31 ±0.01 


2.58 ± 0.03 


protein 


















Phosphoglycerate 


P00558 


44 284 


7052 


11 


33 


71 


2.32 ±0.11 


4.23 ± 0.13 


kinase 


















Pyruvate kinase 


P14618 


57 710 


7.95 


17 


32 


114 


1.65 ± 0.07 


2.70 ±0.12 


40s ribosomal protein 


P08865 


32 833 


4.79 


5 


23 


61 


0.33 ± 0.02 


0.35 ± 0.05 



SA (RSP40) 



Table 7. List of truncated proteins detected in THP-1 cells after 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot 






Matched 


Cover- 


Score 


Expression 




no. 


(obs.) 


fobs.) 


no. 


age% 




fold 


94kDa glucose- regulated 


PI 4625 


• -59700 


-5.00 


12 


14 


104 


New* 1 


protein (grp94) 
















Truncated form of grp78 


P11021 


-25000 


-5.30 


12 


22 


75 


New 


Truncated lorm of hsc70 


P11142 


-22000 


-5.80 


12 


20 


96 


New 


Truncated form of hsc70 


P11142 


-19 000 


-6.10 


11 


18 


113 


New 



signaled proteins detected in the CpG-ODN Heated gel but not in the corresponding control gel 



of genes and their corresponding proteins (Table 8), sug- 
gesting that more in-depth studies were needed. To further 
evaluate whether changes observed In protein expression 
correlated with changes in mRNA levels, we randomly chose 
two proteins (PGK and PYK) that were induced after 16b 
CpG-ODN treatment and determined their mRNA levels by 



3.4 Comparison of microarray and proleomic results 

Table 8 shows the expression of six genes and theii corre- 
sponding proteins that were modulated by 8 b treatment of 
THP-] cells with CpG-ODN. Besides enoy J- coenzyme A 
hydratase, there was poor correlation between the expression 
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Figure 4. 2-D gel electrophoretic anafysis of CpG-ODN- treated 
THP-1 cells. (A) Total cell protein from unstimulated THP-1 cells 
was subjected to 2-DE,(B) THP-1 cells were treated whb or with- 
out 1.5 P M CpG-ODN for defined times. Cellular proteins were 
extracted and separated by 2-DE. Several up-regulated proteins 
are shown in the SYPRO Ruby staining geh Comparison of CpG- 
ODN treated. THP-1 cells, to untreated cells showed that these 
proteins changed in intensity by over 80%. The experiment was 
repealed three times with similar results. 



|>27 hD» 




Figure 5. 2-D gel electrophoretic anolysis of CpG ODN-treated 
THP-1 cells. THP-l cells were Ireared with or without 1 5 M m CpG- 
ODN for defined limes. Cellular osteins were extracted and 
separated by 2-DE. Piotein spois were detected by SYPRO Ruby 
sta.n.ng. (A) Expression of HSP27 was induced by increasing the 
period of CpG-ODN stimulation A truncated form of hsc70 was 
detected on .he gel. |B) The native form of gr P 78 was detected in 
untreated cells, while the truncated form of 9 rp78 was observed 
after 25 h stimulation. The experiment was repeated three times 
with similar results. 
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Table a Comparison of gene and protein expression levels in 
. THP-1 cells after 8 h CpG-ODN treatment 



Protein name 



Gene 

expression 
fold from 
microarray 



Protein 
expression 
fold from 
2-D gel 



Enoyl-Coenzyme A hydrolase 
Eukaryotic translation 

elongation factor 
HSP60 . 
HSP90-beta 
Proteasome a chain 
Proteasome p chain 



1.72 ± 0.31 
0.95 ± 0.01 

0.94 ± 0.07 
1.58 ±0.11 
0.98 ± 0.03 
1.09 ±0.11 



2.52 ± 0.05 
3141 ± 0.21 

2.78 ±0.03 
2.36 ±0.10 
2.52 ±0.11 
3.85 ± 0.17 



RT-PCR. Our results showed that rnRNA levels of PYK 
increased after 16b. CpG-ODN treatment, while mRNA 
levels of PGK were dramatically increased after 24h stimu- 
lation (Kg. 6). In addition, we also performed enzyme activ- 
ity analysis and found that the activity of PGK and PYX were 



3.5 
3 

£ 2 -5 



(A) 

D GpC-ODN 
© CpG-ODN 




if 



24 



Time (h) 



(B) 

□ GpC-ODN 
0 CpG-ODN 



3 

_ 2.5 

ID 
O 

— 2 
^ 1.5 

CO ' 

to 

> 

or 

D.5 



4 8 
Time {h) 

Figure 6. Activities of PYK and PGK induced by CpG-ODN. THPl 
cells were stimulated whh medium atone. 1.5 m m GpC-ODN (as 
the negative control) or CpG-ODN for the indicated times. Ceil 
lysales were extracted and assayed tor (A) PYK and (D) PGK ac- 
tivities. Data lepresent mean z SEM. [n= 3). 
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GpC-ODN CpdODM 
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Kinase 
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Figure 7. Induction of PYX and PGK transcripts by CpG-ODN. 
THP1 celts were stimulated with medium alone, 1.5 jim GpC-ODN 
or CpG-ODN for the indicated, times. HT-PCR was then performed 
to analyze gene expression, fcactin was used as an internal con- 
trol. The experiment was repeated three times with similar 
results. 



indeed increased by a factor -25 after Cj>G-ODN stimula- 
tion (Kg. 7). To confirm that mRNA induced by CpG-ODN 
would also be accompanied by an in a ease in protein 
expression even though it was not detected in 2-D gel anaxyr 
sis, we used more sensitive and specific Western blotting 
analysis. As shown in Table 3 and Fig. 2, ARF-3 was identi- 
fied in the micrdarray gene profile hut not in the proteomic 
expression profile. Nevertheless, we observed enhanced pro- 
tein expression of ARF-3 after CpG-ODN stimulation by 
Western blotting. Moreover; to investigate whether the up- 
regulation of ARF-3 by CpG-ODN is mediated through the 
TLR9 pathway, TLR9-de£cient HEK293 cells were transiently 
cotransfected with hTLR9 and luciferase-reporter gene driv- 
en by a NF-KB-dependent promoter. Our data showed that 
NF-xB activity of untransfected H£K293 cells were not 
responsive to CpG-ODN stimulation, while in transfected 
HEK293 cells expressing hTLR9, NF-kB luciferase activity 
was up-regulated 12-fold after 8h CpG-ODN stimulation. 
The activation of NF-tcB induced by CpG-ODN was blocked 
by. pretrearment of the transfected cells with an ARF-3 in- 
hibitor, such as brefeldin A (Fig. 8), suggesting CpG-ODN 
induces ARF-3 and activates NF-kB after the interaction of 
CpG-ODN with TLR9. 



4 Discussion 

In this study both miooanay and proteomic approaches 
were used to evaluate the effect of CpG-ODN on gene/pro- 
tein expression profiles of THP-1 cells at several time points. 
Comparison of the gene expression prohles showed that 
stimulation of the cells with CpG-ODN for different periods 
of time resulted in different profiles (Tables The differ- 
ences in mRNA expression between the cells with short and 
long stimulation could be attributable to the low reproduci- 
bility. However, to avoid experimental variations, we not only 
used the same batch of miooanays from the same manu- 
facturer but also applied the samples of short and long term 
stimulation at the same time. Jn this way, we found that the 
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changes in expression fold of mRNA after CpG-ODN treat- 
ment were quite reproducible as shown by their mean 
±. SEM (Table 4). A more likely explanation for the difference 
in the expression level of mRNA after different periods of 
stimulation with CpG-ODN is that the transient increase or 
decrease in these mRNA by. CpG-ODN plays a significant 
role in mbciulating biological functions. For example, we 
found that the IL-18 receptor accessory protein from THP-1 
cells was up-regubted after 2h of CpG-ODN stimulation. 
The 1WS/IL18R system is known to activate Thl-mediated 
immune responses that play a critical role in host defense 
against infection [26J. Together with IL-18/1U8 R, several 
genes for antimicrobial defense were also increased, includ- 
ing thioredoxin, Prc>-Po)-dUTPase j>olyproteiri and Spl40. 
After 8h of CpG-ODN stimulation, however, none of these 
genes was activated any more (Table 4); Since sustained or 
excessive production of these antimicrobial molecules might 
lead to inflammation and cellular damage [27], a plausible 
explanation is that THP-1 cells fight against the invasion of 
pathogens by up-regulating antimicrobial defense-associated 
genes at an early stage of stimulation and then shut them 
down to avoid over-activation. Whether this explanation is - 
true remains to be verified. 

it is noteworthy that our data also identified the up-ieg- 
ulation of several anti inflammatory associated genes after 
8h of CpG-ODN stimulation. These genes included FPRL1, 
1L-10 receptor, vitamin D receptor, and UR (Table 3). FPR 
and FPR LI have been defined as chemotactic factors 
involved in host defense against bacterial infection and in the 
clearance of damaged cells. Additional studies have indicated 
that FRPL3 interacts with a menagerie of structurally diverse 
pro- and anti-inflammatory ligands associated with diseases, 
inducting amyloidosis, Alzheimer's diseases, prion disease 
and HIV (28, 29]. Therefore, FPRU may play an important 
role in regulating and/or balancing the production of pro- 
and anti-inflammatory molecules in CpG stimulated THP-1 
cells. Additionally, a recent study has demonstrated that LxRs 
and their ligands act as negative regulators of macrophage 
inflammatory ^ene expression and inhibit the expression of 
inflammatory mediators such as inducible nitric oxide syn- 
thase, cydooxygenase and 1L-6 in response to bacterial 
infection or LPS stimulation |30]. Of interest, we found that a 
transcription factor gene connexin 59, a regulator of. 1L-6 
expression, was up-regulated after 2 h of CpG-ODN stimula- 
tion. It is thus likely that CpG-ODN stimulation of THP-1 
cells for 2 b may induce the expression of the pro-inflamma- 
tory cytokine 1L-6 through the up-regulatipn of the connexin 
59 gene, while 8h of CpG-ODN treatment may counter- 
balance the initial inflammatory response by inducing LxR to 
inhibit JL-6 production. 

Signal transduction molecules play an important role in 
cellular activation. Intracellular signal transduction systems 
employing various intermolecular interactions through 
docking elements, including SH2 and SH3 domains, have 
been reported |31-33J. Here we found that THP-1 celis 
treated with CpG-ODN for 2 h up-regulated gene expression 
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of Grb2-Bke protein (which contains an SH3 domain), while 
8h of stimulation induced Ndc, Ash and phospholipase C 
binding protein (NAP4 which contains an SH2 domain). It is 
thus possible that Grb2-like. protein arid NAP4 may play 
important roles in CpG-ODN mediated signaling pathways. 
Furthermore, recent studies have also revealed that binding 
of CpG DNA to TLR9 results in activation of ?NK [34J. Since 
JNK is activated by Nek adaptor protein and Nc* interacting 
Hrjase [35, 36], it is possible that CpG-ODN may activate JNK 
via ur>regulation of NAP4. Although a recent publication 
described the gene expression profiles of a cultured mouse 
macrophage ceB line after CpG-DNA stimulation [34], their 
microarray results were only conducted at one time-point 
{6h stimulation). Moreover, they did not report the meas- 
urement of protein expression profiles in response to CpG- 
ODN stimulation. ' 

Comparison of the gene and protein expression profiles 
showed that there was discordance between mRNA and pro- 
tein levels flaWeS). Similar discordance between the 
expression pattern of genes and proteins was also reported in 
other system using different stimuli |37-MJ. The dis- 
cordance between mRNA and protein levels could be due to 
screening capability such as detection sensitivity, choice of 
cut-off point, quantitativity of microarray and 2-D gels, as 
well as time discrepancy between gene and protein expres- 
sion [39, 40, 42, 43]. Alternatively, it could also be explained 
by posMrancriptional events, such as alternative splicing or 
PTM [39, 40, 42, 43). Another possible explanation is that 
most of the spots observed in the 2-D gels are isoforms of 
some proteins. The intensity of each spots does not neces- 
sary represent total amount of a certain protein and thus 
does not correlate with its mRNA level. Our find ing that 
mjcroarray results correlated better with Western blotting 
results [c.g. t ARF-3 in Fig. 2), an approach more suitable than 
2-D gels for determining, the total amount rather than iso- 
forms of a given protein, seems to suggest that formation of 
isoforms should be.carefufly taken into consideration when 
one tries to correlate mRNA and protein expression data. 

Using a proteomics approach, we found that CpG-ODN 
treatment up-regulated the expression of many proteins 
including HSPs, metabolic enzymes, structural proteins, as 
well as macrophage capping protein, cydophilin and protea- 
some a and 0 chain etc HSPs are the most aWidant and 
ubrqmtous soluble intracellular proteins. They are up-regu- 
lated by various stressors including temperature, glucose 
deprivation, microbial infection and. cancer |44J. They func- 
tion as molecular chaperones to prevent protein aggregation 
and contribute to the folding of nascent and altered proteins. 
In addition, they are able to regulate immune responses, 
including production of inflammatory cytokines and che- 
molcines and activation or maturation of immune cells [45 
46]. Beside HSPs, cydophilin as well as proteasome a and 0 
cham have also been reported to be involved in the immune 
response [39, 47]; proteasome p chain is consistently un- 
regulated in human neutrophils following LPS exposure |39] 
Our finding that the protein levels of HSPs, cydophilin and 
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proteasome a and 0 chain were increased after CpG-ODN 
treatment suggests that these.molecuies might play a role in 
the immunostimulatjng effect of CpG-ODN. To what extent 
these proteins contribute to the immune responses of the 
cells to CpG-ODN is currently under study. Proteomic anal- 
ysis also showed that truncated forms of grp7», grp94 and 
hsc70 were induced, a phenomenon similar to calreticulin. 
observed by Richards and his coworkers [4*]. The expression 
of full length hsc70 and grp78 were decreased, while the 
levels of their truncated derivatives was increased after CpG- 
QDN treatment. These-results suggest that the degradation 
of these proteins has been enhanced We also found that 
proteosome a and 0 chains as well as ubiquibn specific pro- 
tease 12 were increased by CpG-ODN. Whether these 
enzymes or other enzymes were responsible for the genera- 
tion of truncated hsc70 and grp78 remains to be ducidated. 

Cells rely on multiple signaling pathways to determine 
their fates of survival, proliferation or apoptosis [49]. In fact, ■ 
apoptosis plays an important role in regulating pathogen' 
infection. To be able to grow and replicate in the target cells 
pathogens may have to block apoptosis. Results from several 
laboratories have made it dear that HSP70 and HSP27 pro- 
tect cells not only from heat, but also from most apoptotic 
stimuli |48, 50] by binding to Akt and subsequently mediat- 
ing anti-apoptotic activity through activation of Akt |51-53]_ 
Since our data revealed that CpG-ODN induced the expres- 
sion of HSP90 and HSP27, it is possible that CpG-ODN 
might prevent apoptosis by up-regulation of HSPs. 

Interestingly, our microarray data also showed that CpG- 
ODN mediated the induction of a set of genes associated 
with tumor progression and cell proliferation. Among these, 
one gene, WISP-2, was up-regulated by. CpG-ODN after both 
. 2 and 8h stimulation. WISP genes were first identified as 
downstream targets of the Wnt-10-catenin signaling path- 
way. They belong to the CCN family of growth factors that 
have been receiving increasing attention lately due to some 
of the family members having been reported to be involved 
in angiogenesis and turn ori genesis [54]. It would be inter- 
esting to evaluate whether CpG-ODN plays a role in angio- 
genesis and rumorigenesis by regulating W1SP-2. In addi- 
tion, we found that some genes associated with neurode- 
generation or neuroprotection, such as FPRU, NMDAR 
(NM DA) receptor, PKC and dishevelled 3 were up-regulated. 
To our knowledge, this is the first report to suggest an asso- 
ciation between these genes and CpG-ODN stimulation. As 
mentioned above, FPRU plays a crucial role in proin- 
flammatory aspects of systemic amyloidsis and neurodegen- 
erarive disease such as Alzheimer's disease and prion disease 
(28). NMDAR, PKC and disheveled are involved in modulat- 
ing amyloid precursor protein metabolism, which is central 
to the pathogenesis of Alzheimer's disease [55-57]. Most 
notably, recent studies have shown that the TLR4-dependent 
pathway is involved in neujodegenerarion of the central 
nervous system |58J. Whether CpG-ODN moieties of patho- 
gens play any role in neurodegenerative diseases such as 
Alzheimer's remains to be elucidated. 
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Figure 8. ARF-3 participates in the CpG-ODN-TLRS-NF-icB path- 
way. HEK293 cells were cotransfected with P 5xNF->cB and human 
TLR9J After overnight transfection, the cells were incubated with 
or without 1.5 fiM CpG-ODN for 8 h in the presence or absence of 
increasing concentrations of the ARF-3 inhibitor brefeldiri A. 
After incubation, cells were rysed and NF-xB luciferase activity 
was measured. Data represent mean ± SEN*. |/i = 3). 

Exposure of cells to LPS ot microbial infection has been 
known to induce several genes encoding metabolic enzyme 
(34, 39) : Our microarray data also revealed that a large number 
of genes encoding proteins involved in energy synthesis and 
fatty acid oxidation, such as enoyl- coenzyme A bydratase, 
propionyl coenzyme A carboxylase and cytochrome p450 . 
were activated by CpG-ODN treatment. In addition, we found 
that other proteins such as ARF-3 were up-regulated (Table 3, 
Fig. 8). ARFs are 20kDa GTPases of the ras superfamily that 
are critical to vesicular trafficking, including exocytic protein 
transport and endocytosis |59, 60). This study demonstrates 
for thefirst time that ARF-3 is involved in the activation of N F- 
kB induced by CpG-ODN (as shown in Fig. 8) 

CpG-DNA/ODN has been shown to elicit primarily 
responses via the TLR9/MyD88 dependent pathway [38, 21, 
22]. Chromosome location analysis showed that instead of 
localizing on one or two chromosome, the genes/proteins 
modulated by CpG-ODN stimulation are scattered on all 
chromosomes except chromosomes 23 and 24. These results 
seem to suggest that CpG-ODN either affects multiple chro- 
mosomes simultaneously or subsequently via cascades of 
cellular messengers. More studies are needed to elucidate its 
mechanism of actions. 



5 Concluding remarks 

In summary, by using microarray and pioteomic approaches 
to evaluate the effect of CpG-ODN at different time points, 
we have found that genes/proteins regulated by CpG-ODN 
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are related to Mammatory responses, antirniaobial defense, 
transcriptional regulation, intracellular signal transduction] 
tumor progression, cell differentiation, proteolysis, anti-apop^ 
tosis as well as neurodegeneration and neuroprotection. Our 
results may help delineate the CpG-ODN mediated pathway 
and contribute to further understanding of mechanisxm that 
link innate immunity with acquired immune response(s). 
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- " dmarkB "' ^> Skaih^, 3^ 

Institute of Medical Biochemistry, University of Oslo, Norway. 

cDNA probes, iso^e^^l^ T t6Sticular cel1 ^ ™ng 

mRNA and protein^ ^nedt^^ "f^ measur «. Amounts of 
germ cells (pachytene ^e3oc^ , Sert0 ' cultured peritubular cells, 
whole te.t<*from r *X s Pf.^.ds), L*ydigcell tumours as well as 

between the amount of Si «S£ " ^ ^ WaS 3 g00d COlTeIation 

of cell, such as germ oe^U^ZTZ^ZlT^ ^ *** 

amounts of RII beta mRNA were fo \tT^T * 9 ' Ways me ^ ^ 

immunoreactive protelnwasTow Rir^enm the * 

developing rat testis after 30 to 40 days 0 f fJe S f } ™ f ° Und in ^ 
only partially reflected at the protein bvei 86 am ° UntS ° f mRNA were 

spermatids were practically dl^Zf both tffifT a ° d r ° Und 

spermatid differentiation, ^^^ifw ^ 
protein. Cell specific distribution nf «h7, P , md 311 inbrease in RH alpha 

described. In some typefof ceU oW VB V W 1 , PKA subuni * * testicular cell type? is . 
demonstrated, whicSlyT^t^ 

for some of these mRNAs i^^SjTlf dlff * rences ln franslational efficiencies 
alphan,mei 0 t^ 

PMID: 8107013 [PubMed- indexed for MEDLINE] 
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Quantification of CK20 gene and protein expression hi colorectal cancer 
by RT-PCR and immunolnstochemistry reveals inter- and intratnmonr 
heterogeneity. 

Lassmann S , Bauer M, Soong R . Schreglmann J . Tahiti' IC Nalirie: J Rucer R 
Hofler R Werner M . * 

Pathologisches tnstitui, Universitatsklinikum Freiburg, Albertstrasse 19, 79104 Freiburg 
Germany. Iassmann@ukl.uai-freiburg.de ' ' 

Cytokeratin 20 (CK20) is an epithelial protein expressed almost exclusively in the 
gastrointestinal <GI) tract and is widely used as immunohistochemical marker for routine 
diagnosis. In contrast, CK20 gene expression is not an established marker for the 
classification of tumours and the detection of disseminated cancer cells in colorectal 
cancer. Recently, real-time reverse transcriptase polymerase chain reaction (RT-PCR) has 
provided the means for reproducible and quantitative investigation of molecular markers 
This report directly compares CK2Q mRNA and protein expression in serial sections of 
archival, formalin-fixed, paraffin-embedded (FFPE) colorectal adenocarcinomas. CK20 
expression was detected by immunohistochemistry (IHC) in 60/63 (95 2%) cases by 
conventional RT-PCR in 58/60 (96.7%) and by quantitative RT-PCR using the 
LightCycler (LightCycler is a trademark of a Member of the Roche Group) System in 
29/32 (90.6%) microdissected cases, one case yielding variable results. Despite the high 
detection rate of all three techniques, marked heterogeneity of CK20 expression was seen 
between different cases and also within individual cases. CK20 expression profiles were 
not related to particular histopathological features of the tumours. A good correlation (r = 
0.8964) was found between CK20 mRNA and protein expression by comparing 
quantitative RT-PCR with IHC in 32 cases. This was also true for selected heterogeneous 
tumour cells within individual cases. Both RT-PCR and IHC are therefore valuable tools 
for CK20 detection in colorectal adenocarcinoma, with real-time RT-PCR providing 
supplementary quantitative information. This suggests a promising supportive role for 
quantitative RT-PCR in molecular pathology. Copyright 2002 John Wiley & Sons, Ltd. 

Publication Types: 

♦ Evaluation Studies 
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Expression of the multidrug resistance-associated protein (MRP) mRNA 
and protein in normal peripheral blood and bone marrow haemopoietic 
eells. 

Lcgrand Q, PcrrotJY, Tang R . Simonto G , GurbuxaniS. ZittounR , Marie Jr . 

Laboratoire de Cinetique et de Cultures Cellulaires, Hotel Dieu, Paris, France. 

We studied the expression of multidrug resistance-associated protein (MRP) in normal 
haemopoietic cells from peripheral blood and bone marrow. The MRP mRNA levels 
were estimated by RT/PCR and in situ hybridization (ISH) assay, and the protein levels 
by flow cytometry. 21 samples of peripheral blood and 21 samples of bone marrow (1 1 
normal bone marrow donors, 10 patients in complete remission after chemotherapy for 
large cell lymphoma or acute myeloid leukaemia) were analysed. In peripheral blood the 
mean MRP mRNA level in CD34- cells was statistically higher than in the other cells (3- 
fold by the methods used). The levels of MRP in CD3+ varied from one individual to 
another (4.5-34,8 units by RT/PCR and 5-23 grains/cell by ISH); however, this was 
proportional to the variation in all the cell lineages of same individual (r - 0.84). In bone 
marrow the mean MRP levels of the various cell lineages (including CD34+) were 
similar to the basal level in HL60 cells. Individual expression levels were again variable; 
however, mere was no difference between untreated normal bone marrow and post 
chemotherapy normal bone marrow. MRP protein expression was determined by flow 
cytometry with the monoclonal antibody MRPm6. The CD4+ lymphocytes exhibited a 
higher MRP protein expression than die other cell lineages, including CD8+ cells. There 
was a good correlation between the three methods used (RT/PCR and ISH, P = 0.000 1 , r 
= 0.87; RT/PCR and flow cytometry, P = 0.0001 , r = 0.85; ISH and flow cytometry P = 
0.002, r = 0.67): 

PM1D: 8757504 [PubMed - indexed for MEDLINE] 
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Vascular endothelial growth factor enhances cardiac allograft 
arteriosclerosis. 

Lemstrom KB, KrebsR, Nykancn AI , Tikkanen JM , Sihvota RK Aaltola EM . 
QzxnJLIi W*wjJ> AljtahLK, Yla-Herttuala S, Koskiticn PK . 

Cardu>pulmonary Research Group, Transplantation Laboratory, University of Helsinki . 
and Helsinki University Central Hospital, Helsinki, Finland. Karl.Lemstrom@helsirdci.fi 

BACKGROUND: Cardiac allograft arteriosclerosis is a complex process of alloimmune 
response, chronic inflammation, and smooth muscle cell proliferation that includes cross 
talk between cytokines and growth factors. METHODS AND RESULTS: Our results in 
rat cardiac allografts established alloimmune response as an alternative stimulus capable 
of inducing vascular endothelial growth factor (VEGF) mRNA and protein expression in 
cardiomyocyr.es arid graft-infiltrating mononuclear inflammatory cells, which suggests 
that these cells may function as a source of VEGF to the cells of coronary arteries. Linear 
regression analysis of these allografts with different stages of arteriosclerotic lesions 
revealed a strong correlation between intragraft VEGF protein expression and the 
development of intimal thickening, whereas blockade of signaling downstream of VEGF 
receptor significantly reduced arteriosclerotic lesions. In addition, in cholesterol-fed 
rabbits, intracoronary perfusion of cardiac allografts with a clinical-grade adenoviral 
vector that encoded mouse VEGF(164) enhanced the formation of arteriosclerotic 
lesions, possibly secondary to increased intragraft influx of macrophages and 
neovascularization in the intimal lesions. CONCLUSIONS: Our findings suggest a 
positive regulatory role between VEGF and coronary arteriosclerotic lesion formation in 
the allograft cytokine microenvironment. " 
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CHAPTER 29 
Regulation of transcription 



1 



3 



3 



The phenotypic differences that distinguish the 
various kinds of cells in a higher eukaryote are 
largely due lo differences in the expression of 
genes that code for proteins, tbaL is, those tran- 
scribed by RNA polymerase II In principle, the 
expression of these genes might be regulated at 
any one of several stages. The concept of the 
"level of control' implies that gene expression 
is not necessarily an au lorn a tic process once it 
\ has begun. It could be regulated in a gene- 
. specific way at any. one of several sequential 
steps. We can distinguish (at least) five poten- 
tial control points, forming the series: 

Activation of gene structure 
I 

Initiation of transcription 
I 

Processing the transcript 
I 

Transport to cytoplasm 
I 

Translation of mJWA 

The existence of the first step is implied by 
rhe discovery that genes may exist in either of 
two structural conditions: Relative to the state 
of most of the genome, genes are found in 
an "active'' state in the .cells in which they 
are expressed (see Chapter 27). The change of 
Nocture is distinct from 'the act of transcrip- 
t! on, and indicates that the gene is "transcrib- 
*We." This suggests that acquisition of the 
"active* structure must be the first step in gene 
expression. 

Transcription of a gfi j)r, in ihr: active Mate ,. s 



controlled at the stage of initiation, Aat is, by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible' to 
analysis in the in vitro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eultary- 
olic cells, for example, via antilermmation 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polyadenyla- 
tion at the 3' end; Introns must be spliced out 
from the transcripts of interrupted genes, the 
mamre RNA must be exported from the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at the level of -nuclear 
RNA^ might involve any or all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, the translation of an mKNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
adult. "somaiic cells, but it does ocaxr in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve the block- 
ing of initiation of translation of some mlUNAs by 
specific protein factors. 

But baring acknowledged thai control of gene 
expression can occur at multiple stages, and 
that production of RNA cannot inevitably be 
equaled with production of protein, il is clear 
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848 i Chapter 29 



that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eu&aryotic differentiation; 
indeed, we see examples in Chapter 58 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a. large number of target 
genes, and v?e seek to answer two questions 
about this mode of regulation; what identifies 
the common target genes to the transcription 
factor; and hov? is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
. HSE (lieat shock response element), . GKE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in Table 29.1. Response elemenls 
have the same general characteristics as 
upstream elemenls of promoters, or enhancers. 
Tbey contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by die factor 
extends for a short distance on either side of 



'■: • '. Table 29.1 . inducible transcription "factors, bind tb*'l 
V " response elements that Identity groups of promoters " 
'or'enhancers subject to coordinate cpntroL: 



Regulatory Agent Module Consensus Foctoi 

Heat shock ' HSE CMNGAANN7CCNNG HSTF 

Glucocorticoid GRE TGGTACAAATGTTC1 Receplor 

PhOfbol estei 1 RE 7GAC7CA API 

Semrn SRE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances from 
the startpoinl, but are usually <200 bp upstream 
of it The presence of a single element usually 
is sufficient to confer, the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pro- 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other 
usually an USE is found in a promoter, while a 
GRE is found in an enhancer. We assume that 
all response elements function' by the same 
general principle. A gene is regulated by a 
sequence at the promoter or cnfwncer that is 
recognized by a specific protein. 77ie protein 
Junctions as a transcription factor needed Jot 
UNA polymerase to initiate. Active protein is 
available only under conditions when the gene is 
to be expressed; its absence means that the pro- 
moter is not activated by this particular circuit. 

An example of a situation in which many 
genes are controlled by a single factor is pro- 
vided by the heat shock response. This is com- 
mon to a wide range of proiaryotes and 
enkoryotes and involves multiple controls of 
gene expression: an increase in temperature 
turns oJT transcription of some genes, turns on 
transcription of the heat shock genes, and 
causes changes in the translation of wRNAs. 
The control of the heat shock genes illustrates 
the differences between prokaryoiie and 
eukaryotic modes of control. In bacteria, a ne* 
sigma factor is synthesized that directs RNA 
polymerase holoenzyme to recognize an alter 
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Retinal preconditioning and the induction ofheat-shoclc protein 27. 
LiY, RothS, LascrM, MzJX, Crosson CE . 

cSZmIs% h ^ m ° l0gy ' MediCa ' UniVCrSity ° f S ° Uth Car0,ina > Char,est0 ". ^uth 

!^°n E: / rief P Cri0dS ° f iSbhemia have bcen shown t0 Protect the retina from 
potential y damagtng penods of ischemia. This phenomenon has been termedTsThemic 
precond.t.onmg or lS chemic tolerance. In the present study the cellular cSes in Lvels 
of heat shock protein (Hs P )27, -70, and -90 mRNA and expression of S e S na 
associated wtth ischemic preconditioning were evaluated. METHODS- IJnStol 

■Sf» H l ^ da y s ' toallowthe retina to reperfuse. Retinas were dissected the 

mRNA and protein .solated, and Northern and Western blot analyses conducted to detect 
changes ^expression of Hs P 27, -70, and -90. tmmunohistochemiea. studt wte US S t0 
»dent,fy retmal regions where Hsp changes occurred. Selected animals were surZSo 
a second .schemic event, 60 minutes in duration, to correlate the changeTTn SSL of 
Hsp with funct,onal protection of the retina from ischemic injury. RESVLT^lnTZl 

hours after retinal precondtt.omng, levels of Hs P 27 mRNA were elevated above control 
levels, and 24 hours later, mRNA levels increased 200% over basal le^s Hsp27 

S Hsp27 ? 0tem ,eve,s were increased by 200% over basal levels 24 hours after 

eve. L r To°ntr m8 ' ? ^ «" 72 ^ «« ^returned to co rol 
levels, in contrast no consistent change m Hsp?() Qr 9Q mRNA 

observed during the course of the study. Immunohistochemical studies demons^ that 
. he mcrease m expression of Hsp27 was localized toneuronal and non-neuronaSs n 
he,nner layer, of the retina. Electroretinography studies demonstrated a strong 
correlation brtween the protection of retinal function from ischemic injury and the 

7hs^ S 0 ^ CO f NCLUSIONS : ThCSe reSUkS Pf0Vide CViden - SSduction 
o Hs P 27 ,s a gene-spec.fic event assocated with ischemic preconditioning in the retina 

c 1 ZZZl Tt SSi ° n ° f HSP2? OCCUrS in - b ° th neUr ° nal - d non-neurpna. rt n7 
cells, and appears to be one component of the neuroprotective events induced by 

ischemic preconditioning in the retina. . .' y 

PMID: 12601062 [PubMed - indexed for MEDLINE] 
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Enhanced expressions of amchidonic acid-sensitive tandera-pore domain 
potassium channels in rat experimental acute cerebral ischemia. 

Lj_ZB, Zhang HX l.i i TJ. 

tl^TJ !f ated o MS ■ iC3 ' ChineSC Academy of Medical Sciences and Peking Union 
Medical College, Beijing 100050, China. ^ 

To further explore the pathophysiological significance of arachidonic acid-sensitive 
potass.um channels, RT-PCR and Western blot analysis were used to investigate the 
express.on changes of TREK channels in cortex and hippocampus in rat experimental 
acute c«rebraf ,sc hernia in this study. Results showed that TREK-1 and TRAAK mRNA 
m cortej TREK-1 and TREK-2 rrtRNA in hippocampus showed significanYmtr^ h 
after rnrddle cerebral artery occlusion (MCAO). While the mRNA expression levels of 
Uie all three channel subtypes increased significantly 24 h after MCAO in cortex and 

Sw5? PUS fi S3mC tim6 ' thC Pr ° tei ' n ex P ressions the three channel proteins 
i^ ^ficant mcrease 24 h after MCAO in cortex and hippocampus, but only 
1 showed mcreased expression 2 h after MCAO in cortex and hippocampus 

Immunohis tochemical experiments Verified that all the three channel proteins had higher 
expression levels m cortical and hippocampal neurons 24 h after MCAO These resuS 

TlEKchM^hr^A betWeC " TREK channe,s md a ™ te ««ebral ischemTa 

1 REK channels might provide a neuroprotective mechanism in the pathological process. 

PMBD: 15652517 (PubMed - indexed for MEDLINE] 



IT; 



21 3|A^ralBioI.2001 Jan;46( 1 ):23-31. Re.atedArtfc.es. Links 

Increasiug expression of tissue plasminogen activator and plasminogen 
achvator wuibitor type 2 in dog gingival tissues with progressive 
inflammation. 

Lindberg P, Kmnby B, Lecander T T.^NV 

STTh? ( i?? B, ? h Sciences > Malm ° University, S-214 21 Malmo, Sweden, 
pia.lindberg@od.riiali.se 

Urokinase and tissue-type plasminogen activators- (u-PA and t-PA) are serine proteases 
meLuo ^ Plasm ' n ^ en ' A nt0 P'asmin, which degrades matrix proteins and actiSeT 
metaloproteinases The P As are balanced by specific inhibitors (PAI-l and PA 1-2) 

nCi P ™r i ° n " PA PAI " 2 WaS ^onstrated in human gingival 

tissues. The mm now was to mvestigate the production and localization of t- pf and PAI- 

hlTtwT T m • d °? S ^ thrCe ******* Periodontal conditions; clinically 
healthy gmgiv^ chrome gingivitis and an initial stage of ligature-induced loss of 
attachment. Atthe start of the experiment the gingiva showed clear signs of 
SZETah 7 ^t 1 ^ were ob ^ined after 21 days period of intense 

sTme S ^ ^ indUCCd by Placin ^ ™ bber 'igatures'around the neck of 

n^lS f T^ taken fr ° m ^ renting the different conditions and 

JSTll ? h y b " d,zation 3,1(1 immunohistochemistry. In clinically healthy 

STs Z " T J'" ° t_PA Sign3lS ° r Stainin g were seen » ^ive tissue. Both 
n^A scaling and 

Snlc? 7 ^"f™^ l - PA ^ well as antigen were founding sulcular 

Se wiTvo! P H t0 3 ^ i,aX ^ 38 gingivUis - 0^-na.ly the connective 
bin I t i ^ COnnectio " ««h ^ssels. PAI-2 mRNA was seen in a - 

Hun outer layer of the sulcular and junctional epithelia in clinically healthy gingiva, but 
no S^als were seen in connective tissue. PAI-2 antigen was found primal fn thY 
outerlayer of me sulcular and junctional epithelia. Somecells in the connective tissue 
more ST t T? *?> ^ Were main >y ^"d in the same locations, but 

Z^^f^^T? ^ COnnCCtiVe tisSUe " ^staining was seen in 

the outer ha f of the sulcular and junct.onal epithelia as well as in the upper part of the 
connective t.ssue, close to the sulcular epithelium. In sites with loss of atcLent, PAI- 
2 mRNA was found throughout the sulcular and junctional epithelia, as was the antigen 
which stained intensely. No PAI-2 mRNA was seen in connective tissue- the antigen was 

p A u i?Ar 2 , in espec,a,Iy n r vessds - Th,s study shows -** the 

fi^jf t l Wth ex P enmenta I gingival inflammation in the dog, and 

farthermore, he two techniques demonstrate a strong correlation between the 
topographical distnbut.on of the site of protein synthesis and the tissue location of the 
antigens for bofli t-PA and PAI-2. The distribution correlates well with previous 
findings in humans. 
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Effect of duration of fixation on quantitative reverse transcription 
polymerase chain reaction analyses. 

Macabc o-Ong M , Ginzingcr PG , Deklcer R McMillan A . Regezi JA WW DT 
Jordan RC ' ' 



Oral Pathology, Department of Stomatology, University of California San Francisco 
California 94143-0424, USA. 

Increasingly, there is the need to analyze gene expression in tumor tissues and correlate 
these findings with clinical outcome. Because there are fewjissue banks containing 
enough frozen material suitable for large-scale genetic analyses, methods to isolate and 
; quantify messenger RNA (mRNA) from formalin-fixed, paraffin-embedded tissue 
sections are needed. Recovery of RNA from routinely processed biopsies and 
quantification by the polymerase chain reaction (PCR) has been reported; however, the 
effects of formalin fixation have not been well studied. We used a proteinase K-salt 
precipitation RNA isolation protocol followed by TaqMan quantitative PCR to compare 
the effect of formalin fixation for 24, 48, and 72 hours and for I week in normal (2), oral 
epithelial dysplasia (3), and oral squamous cell carcinoma (4) specimens yielding 9 fresh 
and 36 formalin-fixed samples. We also compared mRNA and protein expression levels 
using immunohistochemistry for epidermal growth factor receptor (EGFR), matrix 
metalloproteinase (MMP)-l , p21, and vascular endothelial growth factor ( VEGF) in 1 5 
randomly selected and routinely processed oral carcinomas. We were able to extract 
RNA suitable for quantitative reverse transcription (RT) from all fresh (9/9) and 
formalin-fixed (36/36) specimens fixed for differing lengths of time and from all (15/15) 
randomly selected oral squamous cell carcinoma. We found that prolonged formalin 
fixation (>48 h) had a detrimental effect on quantitative RT polymerase chain reaction 
results that was most marked for MMP-1 and VEGF but less evident for p21 and EGFR. 
Comparisons - of quantitative RT polymerase chain reaction and immunolustocheniistry 
showed that for all markers, except p21, there was good correlation between mRNA and 
protein levels. p21 mRNA was overexpressed in only one case, but protein levels were 
elevated in all but one tumor, consistent with the established translation^ regulation of 
p2L These results show that RNA can be reliably isolated from formalin-fixed, paraffin- 
embedded tissue sections and can produce reliable quantitative RT-PCR data. However, 
results for some markers are adversely affected by prolonged formalin fixation times. 
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Id-1 and Id-2 arc overcxpressed iu pancreatic cancer and iu dysplastic 
lesions iu chronic pancreatitis. 

Maruvama H , KjceffJ, WUdiS, Prjess H , Buchter MAY . l sraf 1MA l£^jvr 
Dmsion of Endocrinology, Department of Medicine, University of California, Irvine, 

Id proteins antagonize basic helix-loop-helix proteins, inhibit differentiation, and enhance 
cell prol.ferat.on. In this study we compared the expression of M l , Id-2 and id-3 in the 
normal pancreas, m pancreatic cancer, and in chronic pancreatitis (CP). Northern blot 
analysts demonstrated that all three Id mRNA species were expressed at high levels in 
pancreauc cancer samples by comparison with normal P r CP samples. Pancreatic cancer 
™ ^ uen «y coexpressed all three Ids, exhibiting a good correlation between Id 
mRNA and protein levels, as determined by immunoblotting with highly specific antMd 
ant.bod.es. tmmunohistochemistry using these antibodies demonstrated the presence of 
taint ld-I and Id-2 immunostaining in pancreatic ductal cells in the normal pancreas 
whereas Id-3 .mmunoreactiviry ranged from weak to strong.^ the cancer tissues, many 
of the cancer cells exhibited abundant Id-1, Id-2, and Id-3 immunoreactivity. Scoring on 
the basis of percentage of positive cells and intensity of immunostaining indicated that 
Id-1 and Id-2 were increased significantly in the cancer cells by comparison with the 
respect.ve controls. Mild to moderate Id immunoreactivity was also seen in the ductal 
cells m the CP-I,ke areas adjacent to these cells and in the ductal cells of small and 
interlobular ducts in CP. In contrast, in dysplastic and atypical papillary ducts in CP Id-I 
and Id-2 immunoreact.vity was as significantly elevated as in the cancer cells These 
findings suggest that increased Id expression may be associated with enhanced 
proliferative potent.al of pancreatic cancer cells and of proliferating or dysplastic ductal 
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ld-1 and ld-2 Are Overexpressed in Pancreatic 
Cancer and in Dysplastic Lesions, in 
Chronic Pancreatitis 



Haruhisa rvlaruyama,* Jorg Kleeff/ Stefan Wildi/ . 
Helmut Friess* Markus W. Buchler* 
Mark A. Israel * and Murray Korc* 

from the JXvision of Endocrinology, Diabetes, and Metabolism*. 
Departments of Medicine, Biological Chemistry and 
Pharmacology, University of California, Irvine, California; the ' 
Department of Visceral and Transplantation Surgery* University 
of Bent, Bern, Switzerland; and the Pteuss laboratory* 
Department of Neurological Surgery, University of California, 
San Francisco, California . 



Jd proteins antagonize basic betibc-Joop-belix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Id-3, ld-2, and Id-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species were expressed at high levels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer eel) lines frequently 
coexpressed ali three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific anti-Id 
antibodies, lmmun ©histochemistry using these anti- 
bodies demonstrated the presence of faint ld-1 and 
)d-2 immunostaining in pancreatic ductal cells in the 
norma] pancreas, whereas ld-3 immunore activity 
ranged from weal; to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant Id 3 , 
ld-2, and ld-3 immunoreaciivity. Scoring on the basis 
of percentage of positive cells and intensity of immu- 
nostaining indicated that Id ] and ld-2 were increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild lo moderate Id immuno- 
reactivity was also seen in the ductal cells in the 
CP- like areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
Id- J and ld-2 immnnoreactiviry was as significantly 
elevoicd as in the cancer cells. These findings suggest 
that increased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of prolife rating or dysplastic ductal cells in 
CP. (Am J Pathol 199% i55:8J5-S22) 



Basic helix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation. 1 * 2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1 - 2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes. 3 ^ These tissue- 
specific bHLHs typically form heierodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A,£2-2, HEB, and other genes (also termed 
E-proteins). 6-9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that lack 
the basic DNA binding domain. 1 - 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
dillerentiation-associaled genes. to ~ 12 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo 3-18 In addition, Id proteins seem 
to be required tor cell cycle progression through GJS 
phase in certain cell types, and interaction between ld-2 
and pRB is associated with enhanced proliferation in 
some cell lines in v/iVo. 19 " 23 

Pancreatic cancer is the fifth leading cause ol cancer 
death in the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the over expression of a variety ol myo- 
genic growth factors and their receptors', and by onco- 
genic mutations of K-ras and inactivation ol the p53 tumor 
suppressor gene. 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein ld-2, and 
that enhanced expression ol this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory . disease that is characterized by 
dysplaslic ducts, foci of proliferating ductal cells, acinar 
cell degeneration, and fibrosis. 27 We now report that 
there is a five- !o sixfold increase in Id-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Patients and Methods; 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range. 
14-^68 years), CP tissues from 13 males and 1 female 
(median age 42.1 years; range 30-56 years), and pan- 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years; range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
. duodenopancreatectomy (Whipple/pylorus-preserying 
Whipple; n = 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n = 1) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
- (UICC) 6 tumors were stage 1, 1 was stage 2, and 9 were 
stage 3 ductal cell adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition; tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. All studies were ap- 
proved by the Ethics. Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
. viously. 26 - 28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1.2% agarose/1.8 
mol/L formaldehyde gels, electrotransf erred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and. hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979 bp human Id-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of Id-1. ld-2, and ld-3, respectively. A 
BamHI 190-bp fragment of mouse 7S cDNA that. hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
autoradiography were scanned to quantify the intensity of 
the radiographic bands. 26 - 28 For each sample the ratio of 
!d mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression ol the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used lor normal/ 
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Figure 1. mRNA expression of J cM, ld-2, and Jd-3 in pancreatic cancer and 
chronic pancreatitis. Tola) RNA (20 jig/lane) from six noimnl, eighi cancer- 
ous, and seven chronic pancreatitis tissue samples were; subjected to North- 
em bio* analysis using 37 P-bbeled cDNA probes (500.000 cphVmJ) specific 
for Jd-1, ld-2, and Jcl-3. respectively. A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control. Exposure times of the normal/cancer 
blots were 1 day for all Id probes, and 2 dap for the normal/CP blots. 
Exposure time was 4 boors for mouse 7S cDNA. By comparison with the 
normal samples, id-1 and ld-3 mRNA levels were elevated. in 8 and 9 cancer 
Samples, respectively, whereas ld-2 was, elevated in 6 cancer samples. 

cancer and normal/CP membranes, the median score lor 
Id-1, ld-2. and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific, San Raphael. CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of significance. 

Ceil Culture and Western Blot Analysis 

PANC-1. MlA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cetl lines were obtained from ATCC (Manas- 
sas, VA). C01O357 human pancreatic cells were a gilt 
from Dr. R.-S. Metzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MIA-PaCa-2. PANC-1) or 
FtPMl (ASPC-1. -CAPAN-1) supplemented with 10% fetal 
bovine serum. 100 U/ml penicillin, and 100 ^g/m\ strep- 
tomycin. Foi immunoblol analysis, exponenlially growing 
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' Figure 7. Dea-silomeinc analysis of Northern Wots- A mora digraphs of 
Northern bids from 12 normal. 1 4 CP, and )6 pancreatic cancers were 
analyzed by densitometry. mUNA levels were determined by calculating ihe 
ratio of Ihe optical density for. the respective Jd mRNA species in relation to 
. the optical dtnsiry of moose 7S cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in ihe cancer and CP. samples, 
the same normal samples were used for normal/cancer and normal/CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues bv 
triangles, and cancer tissues by squares. Data a JC expressed as median 
scores i SI). By comparison viih the normal samples, only the cancer 
samples exhibited significant increases: 6..5-fold {P < 0.0)) lor Id-], fivefold 
(P < 0.0)) for Id-?, and twofold IP = 0.027) for ld-3 



cells (60-70% confluent) were soJubilized in lysis buffer 
containing 50 mmol/L Tris-HQ. pH 7.4. 150 mrnOl/L NaCI; 
1 rnmol/L EDTA, 1 ;tg/ml peps latin A, 1 mrnol/L phenyl- 
. rnelhylsullonyl Muoride (PMSF), and 1% Triton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 



Figure 3. Id mRNA and protein expression in pancreatic oncer cell lines. 
Upper panels:. Total UNA (20 jig/lane) from > pancreatic cancer cell lines 
were subjected to Northern blot analysis using 37 P- labeled cONA probes 
(500,000 cpm/ml) specific Tor ld-1, )d-2, and ld-3, respectively. Exposure 
times were ) day (or all Jd probes. Lower panels: Immunoblotting. Cell 
lysates (30 /ig/lane) were subjected to SDS-PAGE. Membranes were probed 
wiih specific Id-), Jd-2, and ld-3 antibodies, Visualization was performed by 
enhanced cbemiluminesceoce. 



amide gel electrophoresis (SDS-PAGE), transferred to 
ImmobiJon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit fgG. Visuali2ation 
was performed by enhanced chemiluminescence. 



immunohisiochemtsiry 

Specific rabbit anti-human Id- 1 (C-20). ld-2{C-20). and 
Id- 3 (C-20. all from Santa Cruz Biotechnology, Santa 
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Figure 4. Norma) and canceious pancreatic tissues were subjected to immu- 
ncttlatning using highly specific anti-ld-1 (A-C), anti-Id- 2 (D-F). and ami-»ti-3 
(G-D antibodies as desciibcd in the Methods section. Motleraie to strong Jd- 1 
imrmiDOreactiviry ttos present in ihe cytoplasm of duel-like cancer cells (A 
and C, left panel). In the normal pnnoeas ilieie was weak id-1 immunorc- 
aciK'ity in the ductal cells (B). Piesbsorprion with ihe fd-Kspecific blocking 
peptide abolished the Id- J immunoreactiviry (C, right panel). Suong ld-2 
immuncweaciiviry was observed in the cytoplasm of the cancer cells that 
exhibited duct-like structures t.D and F. left panel), wheieas in the normal 
pancreas, there was'only weak ld-2 immunoi e-JCiiviry in the ductal cells (E). 
Pieabsorption with ihe Id- 2- specific blocking peptide abolished ihe ld-2 
imimmoreaciivtiy (F. ligh! panel). Moderate to strong ld-3 immimoreaciivity' 
was present in the duci-lifce oncer cells (G .mil I, left panel). Moderate to 
Strong ld-3 inirrmnoreaciiviiy was al.to preseni in ihe duci:»l t ells of normal 
pancreatic tissue samples (H). ld-3 immunoieacirv ity wo.s completely abol- 
ished by pieabsoiption wiih il>e kl-3 sjjeolic blocking peptide (I. fight 
panel). A, D, and G constitute seiial sections of a pancreatic cancer sample, 
revealing coexpression of the three Id proteins. Scale bats. 25 pm. 

Cruz, CA) polyclonal antibodies were used foi immunhis- 
tochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with ld-1, ld-2, and ld-3, respec- 
tively, of human origin, as detei mined by Western 
blotting. Paraffin-embedded sections (4 j_tm) were sub- 
jected to immunostaining using the streptavidin- peroxi- 
dase technique. Where indicated, imrnunostaining for all 
three td proteins was performed on serial sections. En- 



' dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes "(23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 

. taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and slreptavidin-peroxidase complex^ 
using diaxninobenzidine tetrahydrochloride as the sub- 
strate. Sections were counlerstained with Mayer's hema- 
toxylin. Preabsorptipn with ld-1-, ld-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of Ihe respective primary antibody- The immunohisto- 
chemical results were semiquanlitatively analyzed as de- 
scribed previously. 29 ; 30 The percentage ol positive 
cancer ceils was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1, <33% of the cancer 
cells exhibiting, immunoreactivity; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal, was also stratified into . 
four groups: 0, no immunoreactivity; 1, weak immunore- 
activity; 2, moderate immunoreactivity; 3, strong immu- 
noreactivity. Finally, the sum of Ihe results of the cell- 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of Ihe 1.2-kb ld-1 transcript and the 
1.6rkb Id2 mRNA transcript in 11 of the 12.norma! pan- 
creatic samples, and the 1.3-kb ld-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, ld-1 mRNA levels were elevated in 8 of 16 
samples, ld-2 mRNA levels were elevated in 9 of these 
samples, and ld-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A. 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometry analysis ol 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in ld-1 mRNA levels, a fivefold 
increase (P < 0.01) in id-2 mRNA levels, and a twofold 
increase (P = 0.027) in ld-3 mRNA levels in the pancre- 
atic, cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of ld-1, ld-2. and 
ld-3. in CP tissues in comparison to the corresponding 
levels in the normal pancreas {Figure 2). 

Next, we assessed the expression of the three id 
genes in 5 human pancreatic cancer celt lines by North- 
ern and Western blot analyses. Id- 1 mRNA was present 
at varying levels in all. 5 ceil lines (Figure 3). ASPC-1. 
CAPAN-1. MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels ol Id- 1 mRNA, whereas COLO- 357 cells 
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expressed relatively low levels of this mRNA moiety. 
Weslern blotting wilh a highly specific ami- Id- 1 antibody 
confirmed the presence of Ihe approximately 14-kd ld-1 
protein in the 4 cell lines that expressed high levels of 
ld-1 mRNA (Figure 3). Furthermore, the three eel) lines 
with Ihe highest .ld-1 mRNA expression (CAPAN-1, MIA- 
PaCa-2, and PANC-1) also exhibited Ihe highest ld-1 
protein expression. Variable levels of the 1.6-kb ld-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 arid MIA-PaCa-2 cells. Immunoblot analysis 
with a highly specific anti-id^2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exception of PANC-1 
cells, in which the 16-kd band was relatively fain! (Figure 
.3). With ihe exception of MIA-PaCa-2 cells, there was a 
good correlation between ld-2 mRNA and protein levels 
(Figure 3). ld-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells, ld-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3). 
Immunoblol analysis wilh a highly specific anti-ld-3 anti- 
body revealed an approximately 14-kd band thai was most 
abundanl in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a faint 
ld-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with (he exception of PANC-1 cells, there was a good cor- 
relation between ld-3 mRNA and protein levels. 



To determine the localization of ld-1, ld-2, and ld-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong ld-1 immunoreactivity was present in 
the cancer cells in 9 of 10 Randomly selected cancer 
samples. An example of moderate ld-1 immunoreactivity 
is shown in Figure 4A. and of strong immunoreactivity in 
Figure 4C (left pane!). In contrast, in the normal pancreas, 
fainl ld : 1 immunoreactivity was present only in the ductal 
cells ol pancreatic ducts (Figure 4B, arrowheads). Pre- 
absorption wilh the ld-1-specific blocking peptide com- 
pletely abolished the immunoreactivity (Figure 4C. 
righl panel). The cancer cells also exhibited strong ld-2 
(Figure 4, D and F, lell panel) and moderate to strong ld-3 
immunoreactivily. An example of moderate ld-3 immuno- 
reactivity is shown in Figure 4G. and of strong immuno- 
reactivity in Figure 41 (lefl panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas ld-3 immuno- 
reactivity in these ceils was more variable and ranged 
from moderate to occasionally strong (Figure 4H). Islet 
cells and acinar cells were always devoid ol Id immunore- 
activity. Preabsorption ol !he respective antibody with the 
blocking peptides specific tor ld-2 {Figure 4F, righl panel) 
and id 3 (Figure 41. right panel) completely abolished in> 
rnurxweaciivity. Analysis ol serial pancreatic cancer sec- 
tions teveafed thai there was otlen colocateation ol the 
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Figure 6. Jnmwnohistocheinislry of atypical papillary epithelium in CP tissues. Serial seciion analysis of some CP samples revealed the presence of large dnct-like 
Stmctores wjih atypical pnpillaxy epithelium. Mild to moderate Id- J (A) and ld-2 (B) immnnoreactiviry and weak ld-3 (C) immunoreucliviiy was present in the 
cytoplasm of the cells forming these large ducts with papillary structures. Some CP samples also exhibited moderate Id-3 immuooieactivity in ihese Cells (D). Sole 
bar, 25 pirn. 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and G. 

ld-1. ld-2, and id-3 immunoreactivity was also present 
at moderate levels in the cyloplasm of ductal cells within 
CP-like areas adjacent to the cancer cells (Figure, 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. In 4 of 9 CP 
samples, there were loci of ductal cell dysplasia of rela- 
tively large interlobular ducts, all ol which exhibited mod- 
erate to strong ld-1, ld-2, and ld-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis of one of those CP samples revealed 
mild to moderate ld-1 and ld-2 immunoreactivity arid 
weak lcf-3 immunoreactivity in the cells of these atypical 
papillary ducts (Figure 6, A-C). In contrast, in some ot 
these CP samples, moderate to strong ld-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
the ductal cells forming the typical ductular structures ol 
CP, such as large interlobular ducts and small proliferat- 
ing ducts, exhtbiied generally only weak to occasionally 
moderate Id immunoreactivity (data not shown). 



The immunohislochemical data tor ld-1. ld-2, and ld-3 
are summarized in Table 1. In the case of ld-1 and ld-2. 
the cancer celts as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast; due to the marked variability in ld-3 immuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitule a family of HLH transcription factors 
that are imporlant regulators of cellular dilferenliation and 
proliferation. 1 - 2 To date, four members of the human Id 
family have been identified. Their expression is 

enhanced during cellular proliferation and in response to 
mitogenic stimuli, 1931 and overexpression ol Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 15 3? 3y1 7he lorced expression 
ot ld-1 in mouse small intesiinal epithelium results in 
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Table 1. Histological Scoring 



ld-1 



ld-2 



ld-3 



Normal [n = 6) 
Cancer \n = 10) 
CP {n = 9) 



Ductal cells 
Cancer cells 
' TypicaJ CP lesions {n - 9) 
Dysplastic ducts (n = 4) 
Atypical papillary ducts (n = 5). 



2.0 ± 0.4 
4.5* ± 0.5 

Z7 ± 0.5 
5.3* ± 0.2 
4.4* ± 0.2 



2.3 ± 0.2 
5.2 s ± 0.3 

3.1 ± 0.6 
5.8* ± 0.2 
5.2* t 0.2 



2.5 ± 0.9 
4.5 ± 0.6 
3.4 ± 0.7 
5.3 ± 0.4 
5.0 ± 0.4 



Scoring of the histological specimens was pet formed as described in the Patients and Methods section. Values are the means - SO of the number 
ol samples indicated in parenthesis. P values are based on comparisons with the respectfve controls in the normal samples. 
\ P < 0.02; f P.< 0.01; *P = 0.004; 5 /> = 0.001. 



adenoma formation in these animals. 35 The growth-pro- 
moting effects of Id genes are thought to occur through 
several mechanisms. For example'; ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20 - 21 and ld-1 and ld-2 * 
can antagonize the bHLH-rhediated activation of known 
inhibitors of cell cycle progression such as the cyclin- 
dependent kinase inhibitor P21. 23 

In the present study; we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1. Id-2. and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA . 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence of ld-1. ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas taint ld-1 and ld-2 immunoreaclivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1. ld-2. 
and ld-3 immunoreactivity. In the cancer samples, all 
three Id proteins often colocalized in ine cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the halt-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view ol the lact that two possible initiation 
codons have been reported tor the ld-2 gene. 36 our 
observation raises the possibility (hat the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosis 
pathways. Thus, these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 39 and 
abnormal resistance to Fas- tigand- mediated apopto- 
sis. 39 It has been shown recently that forced, constitutive 
expression of Id genes together with the expression ot 
anti-apoptotic genes such as Bcl^2 or BclX, can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells inyfvo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic, cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2. and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2. and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
* ducts without dysplastic changes. In general, Ihete was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples. Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk ol 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population. 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- . 
creatic cancer, such as the adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted, that K-ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process. 37 - 41 " " 3 Increased Id ex- 
pression may contribute to malignant transformation ot 
cultured cell lines in vitro- 1 and has been linked to cell 
invasion in a murine mammary epithelial cell line. 44 In 
view of the current findings that ld-1. ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP. these observations raise the 
possibility that elevated levels of ld-1, td-2. and. to a 
lesser extent, ld-3 may represent relatively early markets 
ol pancreatic malignant transformation and may contrib- 
ute to the palhobiology ot pancreatic cancer. 
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c-fos and estrogen receptor gene expression pattern in the rat uterine 
epithelium during the estrous cycle. 

Mendo 2a -Rodr,> ,.e7.rA Mcrchant-La rios IT , Segura-Valdez MT. 
MendozaN .CruzME, Arteaga-Lopez P Camacho-Arrovo 1 Donu^zlT Cerbon 

Facultad de Quimica, Universidad Nacional Autononia de Mexico, Ciudad Universitaria 
Coyoacan 04510, Mexico, D:F, Mexico. uruversitana, 

nwTT' f^^ ^ °^ ectomized « fr ?8m treated animals support the idea that c-fos 
K£ * th ? pr ? hferat,on of uterine epithelial cells. However, these studies invite us 
Strode £S : P ^1 ^r f0S " .^es of the endometrium during the 

™T 7pS PrCSent StUdy W3S to ^"nine the e-fos and estrogen 

receptor (ER) gene expression pattern in the rat uterine epithelium during the estrous 
cycle in whidumtural and eyelid changes of steroid hormones occur, andcorrelate these 
changesAvith the proliferation status of this cellular types. Proliferation was assessed 
during the estrous cycle using bromodeoxyuridine incorporation to DNA. ERalpha and 
beta proteins were assessed by immunohistochemistry. The regulation of c-fos gene 
express.on in the uterus of intact animals during the estrous cycle was evaluated using 
both n situ hybridization and immunohistochemistry. Estradiol (E(2)) and progesterone 

Inili? T f Tf 3886886(1 by ^MmmwioMsay. The results indicated that . 
luminal (LE)^ and glandular epithelia (GE) presented maximal proliferation during the 
metestrus (M) and the diestrus (D) days. However, during the proestrus (P) day only LE 
presented proliferation, and during the estrus (E) day only the stromal cells proliferated 
A marked .mmunostaming for ERalpha was detected in both LE and GE cells during the 

un1et£7 H d r iniSh6d ° n ^ P md E da * In ™^ ERbetawas 

undetectable ,n both epitheha during all stages of the cycle. The highest c-fos mRNA 

level was detected in both epithelia on the M day, followed by a sigVuficant reduction 

dunng the other days of the cycle. The highest protein content was observed on the M 

and D days, and the minimal value was detected on the E day. The c-Fos protein level in 

LE was increased during M and D days, presenting a high correlation with the cellular 

pro iferat.on pattern of this cell type. In conclusion, the overall results indicate that c-Fos 

protein presented a good correlation with uterine epithelial cell proliferation of LE. In the 

found. Both m LE and GE, c-fps mRNA did not strictly correlate with its protein levels 
c-tos seems to have a postranscriptional regulation in uterine epithelial cells during the ' 
rats estrous cycle..Copyright 2003 Wiley-Liss, Inc. 
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The distributions of [3H]MK-801 binding and the NMDA NR1 subunit mRNA were 
studied using receptor autoradiography and in-situ hybridization in rat and human brain 
whole-hemisphere coronal sections. Receptor protein detected by radioligand 
autoradiography and the mRNA for the key subunit of the receptor presented similar 
distributions in the forebrain, with a few areas showing an imbalance between the levels 
of mRNA and receptor protein. Human frontal cortex showed a relative abundance of 
NMDARl mRNA as compared to [3H]MK;-80 I binding. The same area in rat brain did 
not show any difference in the two distributions. In comparison, the rat claustrum 
presented a relative excess of NMDARl mRNA which was not'detected in human 
sections. Human caudate nucleus exhibited relatively high levels of [3HJMK-801 binding 
that were unmatched in rat caudate. The hippocampi of either species presented similar 
levels of [3H]MK-801 binding and NMDARl mRNA, but when the two signals were 
measured in specific subfields of the hippocampal formation, the differential distribution 
of the two signals reflected the anatomy of hippocampal connections assuming a 
preferential dendritic distribution for MK-80I binding. Interestingly, rat and human 
hippocampi also showed some important species-dependent difference in the relative 
distribution of the receptor protein and mRNA. The data presented show an overall good 
correlation between the mRNA for the key subunit of the NMD A receptor and the 
functional receptor detected with radioligand binding and highlight the presence of local 
differences in their ratio. This may reflect different splicing of the mRNA for the 
NMDARl subunit in specific brain areas of rat and human. The species-dependent 
differences in die relative distribution of the mRNA for the key subunit of the NMDA 
receptor and that of a marker of functional receptors also highlights important differences 
in the NMDA function in rat and human brain. 
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Abstract 

Translation initiation is regulated m response to 
nutrient availability and myogenic stimulation and is 
coupled with ceB cycle progression and cell growth. 
Several alterations in translational control occur in 
cancer. Variant mRNA sequences can alter the 
translational efficiency of individual mRNA molecules, 
which in turn play a role in cancer biology. Changes irT 
the expression or availability of components of the 
translational machinery and in the actuation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translational 
activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles of translational control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. - 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit the differences in gene expression between 
cancer cells and norma) cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stabi5ry, 
mRNA translation, and protein stabi&ty. 

The power of transitional regulation has been best recog- 
nized among developmental biologists^ because transcription . 
does not occur in early embryogenesis in eukaryotes. For ex- 
• ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches midbJastula transition, the 
4000-cell stage. Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a translalionalry 
inactive, masked form. The mRNA are translalionalry activated 
at appropriate times during oocyte maturation, fertilization, and 
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early embryogenesis and thus, are under strict fractional 
control. 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a conse- 
quence of mitogenesis. Until recently, however, little was 
known about the alterations in mRNA translation in cancer, 
and much is yet to be discovered about their role in the 
development and progression of cancer. Here we review the 
basic principles of translational control, the alterations en- 
countered in cancer, and selected therapies targeting transla- . 
lion nnitiatiort to elucidate potential new therapeutic avenues. 

Basic Principles of Translational Control 
Mechanism of Translation initiation 
Translation initiation is the main step in translational regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the 40S and 60S ribosomal subunits are recruited to 
the "5* end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors into an 80S ribosome at the start 
codon of the mRNA {fig. t). The 5* end of eukaryotic mRNA is 
capped, i.e., contains the cap structure m 7 GpppN (7-methyh 
guarKJstnr>tj'tphospho-5'-f]TjoniJcleosjdeX Most translation in 
eukaryotes occurs in a cap-dependent fashion, /.&, the rap is 
specifically recognized by the elF4E. 3 which binds the 5' cap. 
The eIRF translation initiation complex is then formed by the 
assembly of e(F4E, the RNA be0case e)F4A, and e)F4G, a 
scaffolding protein that mediates the binding of the 4 OS ribo- 
somal subunit to the mRNA molecule through interaction with 
the elF3 protein present on the 40S ribosome. e!F4A and elF4B 
participate in melting the secondary structure of the 5'UTR of 
the mRNA. The 43S initiation complex (40S/elF2/Met-tRIW 
GTP complex) scans the mRNA in a 5'-»3' direction until ft 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 48S initi- 
ation complex- The initiation factors are then displaced from the 
4BS complex, and the 60S ribosome joins to form the 80S 
ribosome. 

Urifike most eukaryotic translation, translation initiation of 
certain rnFWAs, such as the picomavirus RNA, is cap inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require e!F4E. Either the 43S complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the erx^phatomyocarditis virus, or it can - 



The abbrevblions used are: eJME. euXaryolic initiation factor At; ITTR, 
untranslated Fegxirr, WES, internal ribosome entry site; 4E-BP1, eukaryolic 
initiation faclor 4E-binding protein 1; S6K. ribosomal p70 S6 Kinase; mTOR. ' 
mammafian target ot rapamycirr. ATM, ataxia telangiectasia mutated; PDk[ 
phosphatkJyfinosilol 3- kinase; PT£N phosphatase and tensin homolwj de-' 
feted from Chronxttome 10. PP2A, proiem phosphatase 2A; 7GF-p3. trans- 
forming growth tactor-03; PAP. porylA) polymerase; EPA, cfeosopentoenac 
X3&,rrx±^-7, mctenoma dirtereiittalion- associated gene 7. 



L613 95? 9303 ' DURA 7 JQH t^ m ^ } -no.-.n, 



. * MAY 25 2006 15:39 FR C.I . JCIST 

72 Translation Initiation tn Cancer 



613 952 9303 TO = 03246606 



P. 04 




BiiKfing of eIF4F complex lo 
mRNA 5 y cap 



AUG 



Unwinding of secondary structure 
and formation of 4S5 




Scanning, 

release of initiation factors, 
SOS formation 



F>9- J- Translation initiation in eukaryoies. The 4E-BPs are hyperphos- 
phorytated to release e)F4E so that it can interact with the 5* cap, and the 
cIF4F initiation complex is assembled. The interaction ol poty(A) binding 
protein with the initiation complex and circirlarizalion ol the mRNA is not 
depicted in the diagram. The secondary structure ol the 5' UTR is melted, 
the 4 OS ribosomal subonit is bound to elF3, and the ternary complex 
consisting of e\F2, OTP. and the MeMRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a 5'->3' direction until an AUG start codon 
is found in the appropriate sequence context The initiation factors are 
released, and the large ribosomal subunii is recruited. 



mitia&y attach to the IRES and then reach the initiation codon by 
scanning or transfer, as is the case with the pofiovirus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
involved. Key components in translations! regulation that 
may provide potential therapeutic targets follow. 

c(F4E. elF4E plays a central role in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate- limiting component for initiation of cap- 
dependent translation. elF4E may also be involved in mRNA 
splicing, mRNA 3' 'processing, and mRNA nucteocytoplas- 
mic transport {2). eIRE expression can be increased at the 
transcriptional level in response to serum or growth factors 
(3). elRE overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



rational machinery and thus are inefficiently translated (4 -7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth factor-2, and cyclin D1 (2, 8, 9). 

Another mechanism of control is the regulation of e(F4E 
phosphorylation. elF4E phosphorylation is mediated by the 
mHc^er^activated protein kinase-interacting kinase 1, which 
is activated by the mitogen- actrvated pathway, activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mitogen-activated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin II, sre kinase bverexpression, and ras over- 
expression, lead to elF4E phosphorylation (14). The phos- 
phorylation status of elF4E is usually correlated with the 
transitional rate and growth status of the ceft; however, 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when translational rates actually 
decrease (15). Thus, further study is needed to understand 
the effects of elF4E phosphorylation on elF4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the eIF4E-binding pro- 
teins (4E-BP, also known as PHAS-Q. 4E-BPs compete with 
eIF46 for a binding site in elF4E. The binding of e)F4E to the 
best characterized elF4E-blnding protein, 4E-BP1, is reguv 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 hy phosphorylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and H, interleukin 3. granulocyte-maCTophage colony-stim- 
ulating factor + sleel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to. decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of nutrients or growth factors result*. . 
in 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K is thought to play an important role in translational 
regulation. S6K -/- mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on ceH proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
ofig^pyrimidine tract (5' TOP) tound at the 5' UTR of ribosomal 
proteh mRNAs and other mRNAs coding tor components of 
the translational machinery. Phosphorylation of S6K is regu- 
lated in part based on the availability of nutrients (1 8, 19) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-like growth factor I (20). 

elF2a Phosphorylation. The binding of the initiator tRNA 
to the small ribosomal unit is mediated by translation initia- 
tion (actor etF2. Phosphorylation of the tr-subunit ol eJF2 
prevents formation ol the elF2VGTP/Met-tRJMA complex and 
inhibits global protein synthesis (21, 22). elF2a is phospho- 
rylated under a variety ol conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
elF2£r is phospbory feted by heme- regulated inhibitor, nutrient- 
regulated protein kinase, and the IFN-induced. double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The mTOR Signaling Pathway. The macrofide antibiotic 
raparnycin (SiraHmus; WyettvAyerst Research, CoHegeville, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways involved in T-ceff activa- 
tion. The raparnycin- sensitive component of these pathways 
is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate 6, 
progression and translation in response to nutrient avaBabi^ 
ity (24). mTOR is a serine-threonine kinase that modulates 
translation initiation by altering the phosphorylation status of 
4E r BP1 and S6K (Fig. 2; Ref. 25). 
4E-BP1 is phosphorytated on multiple residues. mTOR phos- 
. pborytetes the Thr-37 and Thr-46 residues of 4E-BP1 m vftm 
(26); however, phosphorylation at these sites is not associated 
with a loss of eJF4E binding. Phosphorytation of Thr-37 and 
- Thr-46 is required for subsequent rjhospbcrylation at several 
COOB- terminal, serurn-sensrtive sites; a combination of these 
phosphorylation events appears to be needed to inhibit the 
binding of 4E-BP1 to elF4E (25). The product of the>l7M gene, 
P38/MSK1 pathway, and protein, kinase Ca also play a rote iri 
4E-BP1 phosphorylation (27-29}. 

S6K and 4E-BP1 are also regulated, in part; by PI3K and its 
downstream protein kinase Akt PTEN is a phosphatase that 
negatively regulates PI3K signaling. PTEN null cells have 
constitutivery active of Akt, with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
PI3K inhibitors wortmannin and LY294002 and by mTOR 
inhibitor raparnycin (24). Akt phosphorylates Ser-2448 in 
rnTOR in vitro, and this site is phosphorytated upon Akt 
activation in vivo (31-33). Thus. mTOR is regulated by the 
PI3K/AM pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also- regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR airfophospboryfatjon is blocked by wort- 
mannin but not by raparnycin (34). This seeming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than htrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with carycu&n A, an inhibitor of phosphatases 1 
and 2A. reduces raparnycin- induced dephosphorybtion of 4E- 
BP1 and S6K by raparnycin (35). PP2A interacts with tuff-length 
S6K but not whh a S6K mutant that is resistant to dephospho- 
rylation resulting from raparnycin. mTOR phosphorylates PP2A 
in vitro; however, how this process alters PP2A activity is not 
known. These results are consistent with the model that phos- 
- phorylation of a phosphatase mTOR prevente de^ 
rylation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and raparnycin block inhibition ol the phosphatase by 
mTOR 

Polyadenylation. The pory(A) tart in eukaryotic mRNA is 
important in enhancing translation initiation and mRNA sta- 
bility. Pofyadfcnylation plays a key role in regulating gene 
expression during oogenesis and earty embryogenesis. 
Some mRNA that are translation^ inactive in the oocyte are 
poryadenylated concomitantly with transitional activation in 
oocyte maturation, whereas other mRNAs that are transla- 
tion^ active ouring oogenesis are deadenylated and trans- 
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rationally silenced (36-38). Thus, control of pory(A) tail syn- 
thesis is an important regulatory step in gene expression. 
The 5' cap and pory(A) tail are thought to function synergis- 
tically to regulate mRNA translational elficiency (39, 40). 

RNA Packaging. Most RNA-binding proteins ar e assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fate of the transcript (41). A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to play a role in restricting the recrurtment of 
mRNA to the translational machinery (4t-43). The major 
mRNA-associated protein, YB-1, destabilizes the interaction 
of elF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results in translational repression in vivo (44). Thus, 
alterations in RNA packaging can also play an important rote 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer: 
variations in rnRNA sequences that increase or decrease trans- 
tationaf efficiency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translation through abenantty activated signal bansduction 
pathways. The first alteration affects the translation of an indi- 
vkfual mRNA that may play a role un carcinogenesis. The sec- 
ond and third alterations can lead to more global changes, such 
as an increase in the overall rate of protein synthesis, arid the 
translational activation of several mRNA species. 

Variations in mBNA Sequence 

Variations in mRNA sequence affect the translational effi- 
ciency of the transcript. A brief description of these variations 
and examples of each mechanism foKow. 

Mutations. Mutations in the mRNA sequence, especialry 
in the 5' LTTR, can alter its translational efficiency, as seen in 
the foUowing examples. 
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c-myc Safto et af. proposed that translation of full-length 
c-myc is repressed, whereas in several Burkftt lymphomas 
that have deletions of the mRNA 5' UTR, translation of c-myc 
b more efficienl (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap- independent as well as a 
cap-dependent mechanism (46, 47). In patients with multiple 
myeloma, a C-*T mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (49). 

BHCA1, A somatic point mutation (117 G-»Q in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream tuciferase reporter dem- 
onstrated a decrease in the translation^ efficiency with the 5' 
UTR mutation. 

. CycJin-dependent Kinase Inhibitor 2A. Some inherited 
melanoma kindreds have a G->T transversion at base -34 
of cyclin-dependent kinase inhibitor^, which encodes a 
cycHn-dependent kinase 4/cych'n- dependent kinase 6 kinase 
mhibitor important in 6 t checkpoint regulation (51). This 
■ mutation gives rise lb a novel AUG translation initiation 
- codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Amotions in splicing and altemale transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting translation^ efficiency. 

ATM. The ATM gene has tour noncoding exons in its 5' 
UTR that undergo extensive alternative splicing (52) The 
contents of. 12 different 5' UTRs that show considerable 
diversity in length and sequence have been identified These 
divergent 5' leader sequences play an important role in the 
translattonal regulation of the ATM gene. 

mum. In a subset of tumors, overexpression of the onco- 
protem mdm2 results in enhanced translation of the mdm2 
mRNA Use of different promoters leads to two mdm2 tran- 
septs that differ only in their 5' leaders (53). The longer 5' 
UTH contains two upstream open reading Irames. and this 
mRNA B loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa). whereas 
. '"sPOradK; breast <^cerfe^^ 

wrth.a longer leader sequence (5' UTRb); the translation^ 
etfictency of transcripts containing 5' UTRb is 10 times lower 
man that of transcripts containing 5* UTRa (54) 

TGF-03. TGF-fJ3 mRNA includes a U-kb 5' UTR which 
exerts an mhibitory effect on translation. Many human breast . 

^TH C * e " 5* C ° ntain 3 nOVe ' TGF '^ ******* with 3 5 ' 
that * 870 nucleotides shorter and has a Mold greater 
translatronal efficiency than, the normal 7GF-03 mRNA (55) 
Alternate Polyodeny lotion Sites. Multiple polyadenyl- 
ahon , , d ; ng |o me generation Q , tra 

with d,ff enng 3' UTR have been described for several mRNA - 
spec.es. such as the RET prolo^ncogene (56). ATM gene 
* '* t,ssue Potior of metalloproteinases-3 (57). RHOA 



proto-oncogene (58), and cafroodulin-l (S9)l* Although the 
effect" of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect Iransrational recruitment. The role of these alter- 
ations in cancer development and progression is unknown. 

Alterations in the Components of the 
Translation Machinery 

Alterations in the components of translation machinery can 
take many forms. 

Oyerexpresssion of elF4E. Overexpression of elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Pofcmovsky et at. (62) 
found that eIF4E overexpression substitutes for serum and 
individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival, signaling. 

Elevated levels of elF4 E mRNA have been found in a broad 
spectrum of transformed ce!J lines (63). elF4E levels are 
elevated in ad ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
, inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that eJF4E overex- 
pression can be valuable as a prognostic marker. eIF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to III breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way in a prospective trial (67). However, in a different 
study. elF4E expression was correlated with the aggressive 
behavior of non-Hodgkin's lyrnphornas (68). In a prospective 
analysis of patients with head and neck cancer, elevated levels 
of elF4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest that'eIF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that elF4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of elF4G, similar to elF4E, leads to 
malignant transformation in vitro (69). etF-2o is found in 
increased levels in bronchioloafveolar carcinomas of the lung 
(3). Initiation factor e!F-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunit of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the elF3-p1 10 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- 
opment and progression of cancer, if any, is not known. 

Overexpression of S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer cell fine, com- 
pared with normal mammary epithelium (74). In a study by 
Barlund ef a/. (74), S6K was amplified in 50 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' poty{A). syn- 
thesis. PAP is overexprcssed in human cancer cells com- 
pared with norma) and viraJfy transformed cells (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and, in mammary tumor cytosote, was 
found to be an independent factor for predicting survival (76). 
Utile is known, however, about how PAP expression or ac- 
tivity affects the transitional profile. 

Alterations in RNA-bindtng Proteins. Even less is known 
about alterations in RNA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA- binding protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cell lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases diemoresistance by en- 
hancing the transcription of a multidrug resistance gene (80). 



Activation of Signal transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene PTEN, which leads to the activation of the Pt3K/Akt path- 
way. Activation of PI3K and Akt induces the oncogenic trans- 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (8t). 
A mutant Akt that retains kinase activity but does not phos- 
pborytate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of P13K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin- like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likely to contribute to the growth and. aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translation^! control. 
For example, HER2/neu mRNA is translationally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a cell type-independent manner and 
by a distinct ceH type-dependent mechanism that increases 
tronsiational efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, "rt is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/neu gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components ol the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
and Tumslatin 

Rapamycin inhibits the proliferation of rymphocytes. It was 
initially developed as an immunosuppressive drug tor organ 



transplantation, Rapamycin with FKBP 12 (FK506-binding 
protein, M, 12,000) binds to mTOR to inhibit its function. 

Rapamycin causes a small but significant reduction "rn the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosomai 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap-independent translation (17,.86). 

The raparnydn-sensifive signal transduction pathway, acti- 
vated during rnaRgnant. transformation and cancer progression, 
is now being studied as a target for cancer therapy (87): Pros- 
tate, breast, small cell lung, gjioblastoma. melanoma, and T-ceD 
leukemia are among the cancer lines most sensitive to the 
rapamycin analogue CCt-779 (Wyeth-Ayerst Research; Ret 
87). In rhabdorrrycosarcoma cell lines, raparrrycin is either cyto- 
static or cytocidat, depending on the p53 status of the celt; p53 
wild-type cells treated with raparnych arrest in the G, phase 
and maintain their viability, whereas p53 mutant cells accumu- 
late "m G, and undergo apoptosis (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
meduflobrastoma models, raparrrycin exhibited more cytotox- 
icity in combination wilh cisplatin and camptothecin than as a 
single agent. In vivo, CCl-779 delayed growth of xenografts by 
1 60% after 1 week of therapy and 240% after 2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in vivo was 1.3 times greater, with 
cisplatin in combination with CCl-779 than with cisplatin alone 

(90) . Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy. 

Rapamycin analogues CCl-779 and RAD001 (Novartis, 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycrn on lymphocyte proliferation, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCl-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in —24 h after therapy 

(91) . The principal toxicities of CCl-779 have included der- 
matotogical toxicity, myelosuppression, infection, mucositis, 
diarrhea, reversible elevations in Uver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCl-779 have been conducted in 
advanced renal cell carcinoma and in stage Ht/iy breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94. 95). 
Thus, CCl-779 has documented preliminary clinical activity in 
a previously treated, unselected patient population. 

Active investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCl-779 in PTEN-null tumors (30, 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCl-779, although only two of these lines 
lacKed PTEN {37) There was, however, a positive correlation 
between Akt activation and CCl-779 sensitivity (97). This 
conelation suggests that activation of the P13K-Akt palhway, 
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regardless of whether H is attributable to a PTEN mutation or 
to over expression of receptor tyrosine kinases, makes can- 
cer cell amenable to mTOR- directed therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of artgiogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferation as a result of mTOR inhibition in these cells or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor cells {99, 100), . 

The angipgenesis inhibitor tumstatin, another anticancer 
drug currently under study, was also found recently to inhibit 
translation in endothelial cefls (101). Through a requisite in- 
teraction with iintegrin, tumstatin inhibits activation of the 
PKJK/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of eIF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-signafing pathway. 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential a^uvant therapy for head 
and neck cancers, particularly when elevated elF4E is found in 
surgfcal margins. Smalt molecule inhibitors that bind the elF4G/ 
4&BP1-bihdin9 domain of elF4E are proapoptotic (116) and 
are also being actively pursued 



Exploiting Selective Translation lor Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer ceBs is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTR These mRNA would thus be at a competitive 
disadvantage m normal cells and not translate weH, whereas in 
cancer cells, they would translate more efficiently. For example, 
the introduction of the 5' UTR of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus type-1 thymidine 
kinase gene, allows for selective translation ot herpes simplex 
virus type-1 thymidine kinase gene in breast cancer ceB lines 
compared with riormal mammary cefl fines and results in se- 
lective sensitivity to ganciclovir (1 17). 



Targeting elfZcc EPA, Clotrimazole, mda-7 t 
and Flavonoids 

EPA is an n-3 polyunsaturated tatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). tt blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR. PKR, in turn phosphorylates and inhibits elF2a, 
' resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in vrVo, inhibits cell growth through 
depletion ol Ca 2 " stores, activation of PKR, and phospho- 
rylation of e!F2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cyclins A, E, and D1, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda-7 (Ad- 
mda7) induces apoptosis in many cancer cells including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of e)F2o and induction ol apoptosis (110). 

Flavonoids such as genistein and quercetin suppress tu- 
mor cell growth. AH three mammalian elF2a kinases, PKR, 
heme-regulated inhibitor, and PERK/PEK, are activated by 
flavonoids, with phosphorylation of e(F2o and inhibition of 
protein synthesis (111). 

Targeting elF4A and elF4E: Antisense RNA 
and Peptides 

Anlisense expression of eIF4A decreases the proliferation rate 
ol melanoma cefls (1 12). Sequestration ol efF4E by overexpres- 
sion ot 4E-BP1 is proapoptotic and decreases tumorigeniciry 
(113, 114). Reduction of e(F4E with antisense RNA decreases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times" (7). Antisense eIF4E RNA treat- 



Toward the Future 

Translation is a crucial process in every cell. However, several 
alterations in translations! control occur in cancer. Cancer cells 
appear to need an aberrantly activated translational state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the translational ma- 
chinery, such as elF4E, and signal transduction pathways in- 
volved in translation initiation; such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. It is possible 
that with the development ol better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved Similar to other cytostatic agents, however, 
mTOR inhibitors are mosl likely to achieve clinical utility in 
combination therapy. In the interim, pur increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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Overexpression of chemokines, fibi ogenic cytokines, and myofibroblasts in 
human membranous nephropathy. 

Meszano SA , Proguett MA, Burgos ME . Ardiles LG . Aros_CA, Caorsi I, Egido J . 
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Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in human 
membranous nephropathy. BACKGROUND: Proteinuria plays a central role in the 
progression of glomerular disease, and there is growing evidence suggesting that it may 
determine tubular cell activation with release of chemokines and fibrogenic factors, 
leading to interstitial inflammatory reaction. However, most studies on this subject have 
been performed in experimental models, and the experience in human kidney biopsies 
has been scarce. We analyzed the tissue sections of patients with idiopathic membranous 
nephropathy (IMN), a. noninflammatory glomerular disease that may follow a progressive 
disease with heavy persistent proteinuria, interstitial cell infiltration, and decline of renal 
function. METHODS: Paraffin-embedded biopsy specimens from 25 patients with IMN 
(13 progressive and 12 nonprogressive) were retrospectively studied by 
immunohistochemistry [monocyte chemoattractant protein- 1 (MCP-1), regulated on 
activation normal T-cell expressed and secreted chemokine (RANTES), osteopontin 
(OPN), platelet-derived growth factor-BB (PD-GF-BB)] and in situ hybridization [MCP- 
I, RANTES, PDGF-BB, transforming growth factor-beta 1 (TGF-betal)]. Moreover, we 
studied the presence of myofibroblasts, which were identified by the expression of alpha- 
smooth muscle actin (alpha-SMA), the monocytes/macrophages (CD68-positive cells) 
and T-eell infiltration (CD4+ and CD8+ cells). All of the patients were nephrotic and ' 
without treatment at time of the biopsy. RESULTS: A strong up-regulation of MCP-1 , 
RANTES, and OPN expression was observed, mainly in tubular epithelial cells, with a 
significant major intensity in the progressive IMN patients. A strong correlation between 
the mRNA expression and the corresponding protein was noted. The presence of these 
chemokines and OPN was associated with interstitial cell infiltration. TGF-beta and 
PDGF were also up-regulated, mainly in tubular epithelial cells, with a stronger 
expression in the progressive IMN, and an association with the presence of 
myofibroblasts was found. CONCLUSIONS: Patients with severe proteinuria and 
progressive IMN have an overexpression in tubular epithelial cells of the chemokines . 
MCP-1, RANTES, and OPN and the profibrogenic cytokines PDGF-BB and TGF-beta. 
Because this up-regulation was associated with an interstitial accumulation of 
mononuclear cells and an increase in myofibroblast^ activity, it is suggested that those 
mediators are potential predictors of progression in IMN. Finally, based on experimental 
data and the findings of this article, we speculate that severe proteinuria is the main factor 
responsible for the up-regulation of these factors in tubular epithelial cells. 
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Decreased uncoupling protein expression and intrarnyocytic triglyceride 
depiction iu formerly obese subjects. 
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r°r^ CTIVE: T ° eXamine the macular uncoupling protein expression 2 (UCP2) and 
UCP3 gene expression m morbid obese subjects before and after bariatric surgery fbilio- 
pancreatic diversion (BPD)]. RESEARCH" METHODS AND PROCEDURE Eleven 

RPrfS?, (BMI = 49 + '~ 2 ke/m(2)) were studied bef <*e BPD and 24 months a^CT 
BPD. Skeletal muscle UCP2 and UCP3 mRNA was measured using reverse 
franscnptase-competitive polymerase chain reaction and UCP3 protein by Western 

ctl n a g t Mr TT lC ^ lyCendeS WCre qUantified ^ Wgh-performanS h^u d 
chromatography. Twenty-four-hour energy expenditure and respiratory quotient (ROV 
were measured ,n a respiratory chamber. RESULTS: After BPd, flJSS^SuL 

-A 0 £ to i?a + ) o?Q ' P /m?° I} - T? intram ^ ic ^ende level d/opped (3.66 

UC?2 fnd ifpi Jma" 1 ^ m f ° f frCSh tiSSUC ' P < 0 0001 > after BPD - of 
UCl 2 and UCP3 mRNA was significantly reduced (from 35.9 +/-6.1% to 18 6+1- 4 5% 

of cyclophihn, P = 0.02; from 60.2 +/- I4.0%. to 33.4 + /- 8.5%, p = 0.03; re pect^eM 

60 3 1 Z7-TmT f°r^^y ***** (272.19 + / P 84.13 vs. 175.78 T 
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„r„? ^ ~ ° t ° 07 l ^!f^ ented die most P"WM independent variable for predicting 
UCP3 vanat,on. DISCUSSION: The strong correlation of UCP expression ^ decrease 
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2d^E? 1 m ' ram , HaS bee " rCVealed by in Situ hybridation, northern blot 

anddotblotanalys.s usmgspec.ficantisenseoligonucleotides. In addition the 
quant, ative distribution of the gamma 2S and gamma 2L subunit peptides participating in 
tmu n aSSCm > ? re -P tors/be ^^epine receptors hasten mapped by 8 

rnnmunoprecpitation w.th specific anti-gamma 2S and anti-gamma 2L antibodies 

mTwrTri? 65 1"* bra ! n regi ° nS 3Ce enriched in 8""™ 2L such a * nitons of 
the layer II of staate cortex and cerebellar Purkinje cells as well as the inferior colliculus 
upenor colliculus deep cerebellar nuclei, medulla and pons. Other neuronal types and 
regions are ennched m gamma 2S such as the mitral cells of the olfactory bulb pyramidal 
neurons of the pynform cortex, layer VI of the neocortex, granule cells of the d!lT 
gyrus and pyramidal cells of the hippocampus. Other cortical areas and cerebellar granule 
cells express both gamma 2S and gamma 2L in comparable amounts: There is a g £T 
correlation between die relative expression of gamma 2S and gamma 2L rnRNAs and the 
re Native presence of these protein subunits in folly assembled L mature recepforsTn the 
studied bram regions. The differential distribution of gamma 2S and gamma 2L might 
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Follicle-Stimulating Hormone Receptor and Its Messenger Ribonucleic Acid Are 
Present in the Bovine Cervix and Can Regulate Cervical Prostanoid Synthesis 1 
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ABSTRACT 

The hypothesis that FSH regulates the bovine cervical pros- 
taglandin E 2 (PGE 2 ) synthesis that is known to be associated with 
cervical relaxation and opening at the time of estrus was inves- 
tigated. Cervical tissue from pre-estrous/estrous, luteal, and 
postovulatory cows were examined for 1) the presence of bovine 
(b) FSH receptor (R) and its corresponding mRNA and 2) the 
effect of FSH on the PGE 2 regulatory pathway in vitro. The pres- 
ence of bFSHR mRNA in the cervix (maximal during pre-estrus/ 
estrus) was demonstrated by the expression of a reverse tran- 
scription (RT) polymerase chain reaction (PCR) product (384 
base pairs) specific for bFSHR mRNA and sequencing. Northern 
blotting revealed three transcripts (2.5, 3.3, and 3.8 kilobases 
[kb]) in cervix from pre-estrous/estrous cows. The level of FSH R 
(75 kDa) was significantly higher (p < 0.01) in Western blots of 
pre-estrous/estrous cervix than in other cervical tissues. There 
was a good correlation between the 75-kDa protein expression 
and its corresponding transcript of 2.55 kb throughout the es- 
trous cycle as described by Northern blot analysis as well as RT- 
PCR. Incubation of FSH (10 ng/ml) with pre-estrous/estrous cer- 
vix resulted in a 3-fold increase in the expression of FSHR and 
a 2-fold increase in both G protein (ot s ) and cyclooxygenase II. 
FSH (5-20 ng/ml) significantly increased (p < 0.01) cAMP, ino- 
sitol phosphate (p < 0.01), and PGE 2 {p < 0.01) production by 
pre-estrous/estrous cervix but not by cervix at the other stages. 
We conclude that bovine cervix at the time of the peripheral 
plasma FSH peak (pre-estrus/estrus) contains high levels of 
FSHR and responds to FSH by increasing the PGE 2 production 
responsible for cervical relaxation at estrus. 

INTRODUCTION 

FSH and LH are the two known pituitary hormones that 
regulate gonadal functions. FSH is essential for female fer- 
tility, specifically for folliculogenesis, and human ovarian 
failure has been related to mutations both in the FSH re- 
ceptor (FSHR) and in the FSH[3 gene [1]. However, unlike 
LH, for which extra-gonadal binding has been documented 
for the uterus in the'cow [2], pig [3], and human [4], no 
extra-gonadal effects of FSH have been reported. The uter- 
ine LH receptors have been shown to be physiologically 
active in the cow [2, 5]. The concentration of uterine LH 
receptors is related to the phase of the cycle, and binding 
of LH to uterine receptors increases uterine prostaglandin 
synthesis [2, 5]. Preliminary observations in our laboratory 
indicated that both FSH and LH receptors are also present 
in the cervix. We therefore investigated whether FSHRs are 
associated with the regulation of cervical prostaglandin, 
particularly around the time of estrus, when both cervical 
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PGE 2 production [6] and peripheral FSH concentrations are 
maximal. 

The present investigation was therefore conducted to de- 
termine the presence of FSHR and its mRNA, and its phys- 
iological activity, in the bovine reproductive tract. Since the 
cervix is a prostaglandin Ei (PGE|)- and PGE 2 -sensitive 
tissue and responds to hormonal stimulation, we also de- 
termined whether FSH could increase cervical cyclooxy- 
genase expression and production of PGE 2 in vitro, partic- 
ularly at the time near' estrus. 

MATERIALS AND METHODS 

Animals 

Cervical and ovarian tissues were collected from Hol- 
stein cows at a local abattoir. The stage of the cycle was 
determined according to signs of ovulation and the status 
of corpora lutea (CL; weight, color). These parameters al- 
lowed the classification of the cervix into three groups: 1) 
pre-estrous/estrous (follicular, 18-20 days postovulation; 
regressed CL < 1.0 g; presence of cervical mucus; n = 
54); 2) postovulatory (1-4 days postovulation; presence of 
corpora hemorrhagica; signs of follicular rupture; n = 35), 
and 3) luteal (12-16 days postovulation; CL 4-6 g; n = 
44). Bovine granulosa cells were collected from pre-es- 
trous/estrous follicles and used as a positive control for 
FSHR expression [7]. 

The cervix external os segment was taken as the initial 
3 cm of the cervix (total length of cervix is about 9 cm). 
The cervical os segments, primarily luminally oriented 
muscularis with epithelium, were sliced into horizontal 
strips (1 cm in length), pooled, and minced finely with a 
scalpel. The Kimron Veterinary Institute Animal Care and 
Use Committee approved all procedures. 

FSHR Gene Expression 

RNA isolation. Bovine cervical or ovarian tissues were 
rapidly dissected and frozen in liquid nitrogen (within 20 
min after slaughter). Total RNA was extracted using the 
acid phenol, guanidine thiocyanate technique [8], using 
TriReagent (Molecular Research Center, Inc., Cincinnati, 
OH) according to the manufacturer's instructions. 

Reverse transcription (RT)-polymerase chain reaction 
(PCR). Bovine FSHR mRNA was detected as previously 
described for the LH receptor mRNA [5]. Oligonucleotides 
corresponding to the published sequence of the bovine 
FSHR [7] were synthesized and used as primers in a PCR 
reaction. Bovine ovarian and cervical tissue cDNA ob- 
tained by RT of 2 \ig RNA was used as a template in the 
PCR reaction using a pair of primers corresponding to the 
transmembrane segments of the FSHR. The primers,, 20- 
mer each, were selected using the OL1GO Program (Oligo 
Rl primer analysis software; National Biosciences, Plym- 
outh. MN). The forward primer corresponded to position 
J 399—14 19. (5'CGGCTTTTTCACTGTCTTTG3 / ) on the 
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bovine FSHR mRNA [7]. The reverse primer was a 20-mer 
oligonucleotide complementary to position 1763-1783 
(5'CGCTTGGCTATCTTGGTGTC3'). The predicted size 
of the RT-PCR product was 384 basepairs (bp). Four mi- 
croliters of the RT reaction were used as a template for 
PCR; the reaction was allowed to proceed 35 cycles using 
2 U Termus Aquaticus DNA polymerase (recombinant) 
provided by MBI Fermentas, Vilnius, Lithuania, and 200 
prnol of each primer. The cycling parameters of the PCR 
were 98°C for 20 sec, 56°C for 45 sec, and 72°C for 45 
sec. After amplification the samples were separated on a 
1% agarose gel, stained with ethidium bromide, and pho- 
tographed under UV light. The 384-bp fragment was ex- 
tracted using the Wizard PCR Preps Kit (Promega, Madi- 
son, Wl) and sequenced with the upper primer using an 
automatic sequencer (Applied Biosystems, Foster City, 
CA). For internal control, a bovine p-actin fragment of 890 
bp was produced from an upper primer (5'ACCA- 
ACTGGGACGACATGGAG3 ' ; 21 mer) and a lower prim- 
er (5'GCATTTGCGGTGGACAATGGA3'; 21 mer) as pre- 
viously described [5]. Each PCR amplification was stan- 
dardized using ovarian granulosa cDNA to produce a spe- 
cific band of the predicted size. Standardization was based 
on temperature, cycle number, Mg level, and pH. 

Western Blots 

Cervical, bovine seminal vesicle, or muscle minces were 
sonicated for 45 sec, lysed in 500 |xl lysing buffer (1% 
Nonidet-40 [Sigma, St. Louis, MO], 2 mM EGTA, 5 nM 
MgCl ? , 1 mM PMSF in PBS) and incubated for 2 h at 4°C. 
After centrifugation, aliquots were taken for protein deter- 
mination using gamma globulin (Sigma) as reference stan- 
dard [9] and a protein dye binding method (Bio-Rad Lab- 
oratories, Richmond, CA). Sixty micrograms of protein of 
the soluble tissue extract was then separated by electropho- 
resis on 5-9%-gradient SDS -PAGE and electroblotted onto 
nitrocellulose paper (Amersham, Little Chalfont, Bucks, 
UK) as described previously [9]. The nitrocellulose mem- 
brane was washed with PBS containing 0.05% TWEEN-20 
and blocked with 10% horse serum (Kimron Veterinary In- 
stitute) in washing solution. The nitrocellulose membrane 
was then treated with either 1) an anti-peptide antibody 
(code name 179) raised against human (h) FSHR peptide 
265-295, diluted 1:500 (donation of Dr. J.A. Dias, Wads- 
worth Center, New York State Department of Health); 2) 
rabbit antiserum for bovine G protein (ot ? ) diluted 1:1000; 
UBJ, Lake Placid, NY); or 3) rabbit anti-bovine cycloox- 
ygenase polyclonal antiserum (diluted 1 :200; Kimron Vet- 
erinary Institute) [9]. Different dilutions were used for each 
antibody, and the final dilution used was 75% of the dilu- 
tion, which gave a maximal signal as previously described 
[9]. The nitrocellulose paper was then incubated with horse- 
radish peroxidase-conjugated goat anti-rabbit lgG (Sigma 
Israel, Rehovot, Israel; diluted 1:2000 in washing buffer) 
for 1 h at room temperature. The presence of FSHR, G 
protein (a s ), or cyclooxygenase was then visualized by 
means of a color reaction as follows. The nitrocellulose 
paper was incubated in a substrate solution containing 3'3'- 
diaminobenzidine (0.5 mg/ml; Sigma) in a mixture of PBS 
containing 0.5% CaCl 2 and 6% H 2 0 2 - The antibody to 
hFSHR recognized the 75-kDa protein of the FSHR. The 
antibody to G (a s ) recognized both the 42-kDa and 87-kDa 
forms of this protein, and the antibody to cyclooxygenase 
recognized the 72-kDa form (cyclooxygenase II). Extract 
of bovine seminal vesicles that are known to have a high 



content of cyclooxygenase was used as positive control as 
previously described [9]. The densitometric scans were ob- 
tained using a bio-imaging system (B.I.S. 2020; Rhenium 
Dingo, Jerusalem, Israel) and processed with Tina 2.0 soft- 
ware (Fuji, Japan). Linearity of detection [9] was deter- 
mined for densitometry for both Western and Northern 
blots. Each Western blot was evaluated in the absence of 
the first antibody, and no signal was detected. 

Northern Blot Analysis 

For Northern blots, 10 p,g of total RNA was denatured 
in 20 jxl 50% formamide/2.2 M formaldehyde in single- 
strength 3-(«-morpholino) propanesulfonic acid (MOPS) 
buffer (0.04 M MOPS, 10 mM sodium acetate, 1 mM 
EDTA) at 65°C for 10 min. Samples were placed on ice, 
and 5 pj of loading buffer (0.5% SDS, 0.25% bromophenol 
blue, 25% glycerol, 25 mM EDTA) was added. Samples 
were subjected to electrophoresis through a 1.2% agarose 
formaldehyde gel and transferred by capillary blotting to 
nylon membranes (Nytran; Schleicher and Schuell, Keene, 
NH). RNA was subsequently UV cross-linked to mem- 
branes. The RT-PCR product fragments of 384 bp for 
FSHR and 890 bp for p-actin were used to generate bioti- 
nylated probes using random primer biotin labeling of DNA. 
for chemiluminescence . (NEBIot phototype kit; Biolabs, 
Beverly, MA) according to the manufacturer's instructions. 
The blotted membrane was prehybridized at 65°C for 1 h 
in prehybridization solution (6-strength SSC [3 M NaCl, 
0.3 M sodium citrate], 5-strength Denhardt's reagent [1%. 
ficoll, 1% polyvinylpyrrolidone, 1% BSA], 0.5% SDS, and 
100 |xg/ml denatured salmon sperm DNAs) and hybridized 
overnight in the same solution with the denatured biotiny- 
lated probe to the target RNA. The membrane was then 
washed in double-strength SSC, 0.1% SDS at room tem- 
perature for 10 min and washed again in 0.1 -strength SSC, 
0.1% SDS at 68°C for 15 min followed by subsequent 
chemiluminescence, development, and detection on x-ray 
film (RX; Fuji Film, Tokyo, Japan). To obtain quantitative 
data for the specific mRJNA, we used the densimetric value 
for p-actin mRNA to normalize each specific mRNA value. 
To remove FSHR probe, membranes were stripped using 
water for 15 min, then incubated in 0.4 M NaOH, 0.1% 
SDS at 80°C for 30 min. The membrane was rinsed again 
in 0.2 M Tris-HCl, 0.1 -strength SSC for 30 min at 25°C. 
After stripping, prehybridization and hybridization with p- 
actin probe were performed as described for FSHR mRNA. 

Cyclic AMP Determinations 

Cervical tissue minces (50 mg/ml) were incubated for 
10 min in wells of 1.5 ml containing 1.0 ml of Tissue Cul- 
ture Medium- 199 (TCM-199 without serum (Biological In- 
dustries, Beit Haemek, Israel) in the absence or presence 
of physiological (5, 10 ng/ml) and pharmacological doses 
(20, 40 ng/ml) of FSH (USDA-bFSH-8-1 ; no further in- 
crease in cAMP production was seen at 50 or 100 ng/ml) 
or forskolin, a stimulator of adenylate cyclase (10 p,M; Sig- 
ma). The dose of forskolin was selected to give a 3-fold 
increase in cAMP. At the end of the incubation, tissues were 
removed, blotted on filter paper to remove mucus, and in- 
cubated overnight at 4°C with 400 pi of 3% HC10 4 . The 
solution was then neutralized with 150 pJ of KHC0 3 and 
centrifuged, and 50-|jl1 aliquots were taken for the radiore- 
ceptor assay as described by Brown et al. [10] and modified 
for endometrium by Miyazaki et f al. [11]. Standards (0- 
1000 pg of cAMP) or samples were incubated at room tern- 
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FIG. 1 . RT-PCR amplification of bovine cervix FSHR and p-actin mRNA. 
RT-PCR was performed as described in Materials and Methods for FSHR 
and p-actfn. Separate RT-PCR amplification products of FSHR (estimated 
384 bp) and p-actin (estimated 890 bp) were combined and separated on 
agarose gel electrophoresis and stained with ethidium bromide. RT-PCR 
was performed using 2 jjug RNA of bovine granulosa cells as a positive 
control (lane 1), cervical tissue at the pre-estrous/estrous (lanes 3-4), post- 
ovulatory (ianes 5-6), and luteal phases (lanes 7-8). Lanes 9-1 0 represent 
^amplification, using as template the PCR product, cDNA, from the lu- 
teal-phase amplification. Lane 2 represents negative control in the ab- 
sence of RNA. The tissue was taken from 4 cows at each stage of the 
estrous cycle, and two representative samples from each stage are shown. 
SM, size markers. 



perature with 100 uJ of cAMP binding protein prepared 
from bovine adrenal extracts as described [10]. The buffer 
used for the assay was added to form a final reaction vol- 
ume of 400 jjlI. After 2-h incubation in a cold room, the 
reaction was stopped by adding 300 jxl of a charcoal-dex- 
tran solution; centrifugation followed, and the supernatant 
was removed for counting in a scintillation counter The 
sensitivity of the assay was 60 pg/tube. Quadruplicate as- 
says were made for each value determined. The within- 
assay and between-assay coefficients of variance were 8% 
and 10%, respectively. 



* - 1401 

BTFSH GCTTTTTCAC TGTCTTTGCC AGTGAGCTCT CAGTCTACAC TCTGACCGCC . 
Cervix GCTTTTTCAC TGTCTTTACT AGTGAGCTCT CAGTCTACAC TCTGACCGCC 

BTFSH ATCACGCTGG AAAGATGGCA TACCATCACC CATGCCATGC AGCTCGAATG 
Cervix ATCACGCTGG AAAGATGCCA TCCCATCACC CGTGCCATGC AGCTCGAATG 

BTFSH CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 
Cervix CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 

BTFSH TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 
Cervix TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 

BTFSH AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 
Cervix AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 

BTFSH CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 
Cervix CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 

BTFSH GTGGCTGCTA CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 
Cervix GTGGCTXCTT CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 

1761 

BTFSH TCCTCTTCTA GTG 
Cervix TCCTCTTCTA GTG 

FIG. 2. Automated nucleotide sequencing and homology between Bos 
taurus mRNA FSHR (CenBank accession number L22319) from nucleo- 
tides 1401 to 1 763 and the PCR products obtained using the upper primer' 
as described in Materials and Methods. A homology of 97.5% was found 
between Bos taurus and our amplified cDNA obtained by RT-PCR from 
cervical mRNA at pre-estrus/eslrus, suggesting that they are complemen- 
tary. 



Measurement of Phospholipase C (PLC) Activity 

Cervical minces (30 mg) were incubated in 1 ml TCM- 
199 containing 5 mCi [ 3 H]myo-inositol for 90 min. Slices 
were rinsed with 3 ml TCM-199 and incubated for another 
60 min to remove unincorporated [ 3 H]myo-inositol. At the 
end of the preincubations, slices were incubated in 1 ml 
TCM-199 containing 10 mM LiCl to inhibit inositol phos- 
phate (IP) hydrolysis, allowing 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 to 
accumulate in the tissue after stimulation of PLC, in ab- 
sence or presence of 0-20 ng/ml of FSH or 0-20 ng/ml 
LH (USDA bLH-I-1). Incubations were terminated by add- 
ing 2 ml ice-cold chloroform/methanol/hydrochloric acid 
(5:10:0.1, v:v:v) and then 1 ml chloroform and 1 ml EDTA 
(5 mM) and extracted as described by Kisielewska et al. 
[12]. 3 H-1P, 3 H-IP 2 , and 3 H-IP 3 were separated from the 
labeled inositol compounds by using Dowex AG 1 -X8 (for- 
mate form). 3 H-Labeled IP was e luted from the column by 
sequential elution into 10 fractions. Aliquots of each frac- 
tion were added to scintillation vials containing 5 ml scin- 
tillation fluid, and radioactivity was determined using a 
scintillation counter. Total activity of PLC is expressed as 
cpm of total IPs/30 mg/30 min. 

RIA for PGE 2 and PCF 2a 

Aliquots of 100 jxl were taken at the end of the incu- 
bation period for specific RJA of PGE 2 and PGF 2a , which 
were performed without chromatographic separation. The 
antisera for PGF 2a (Sigma) reacts preferentially with PGF 2u 
but cross-reacts with PGF la (60%) and to a negligible ex- 
tent (< 0.1%) with prostaglandins of the A, B, and E series. 
The antisera. to PGE 2 (Sigma Israel) reacts preferentially 
with PGE 2 but cross-reacts with ?GE } (20%), PGA,, PGA 2 , 
PGF Ia , and PGF 2a (< 10%) and to a negligible extent with 
PGBi and PGB 2 (< 0.1%). The intraassay coefficients of 
variation were 9% and 11%, and the interassay coefficients 
of variance were 12.3% and 13% for PGF 2a and PGE 2 . 

Statistical Analysis 

ANOVA was performed with a significance level of p 
< 0.05. Data were further analyzed using Tukey's proce- 
dure (p < 0.05 or p < 0.01) to assess significance between 
, treatments. Student's /-test was used where appropriate. 
Values are expressed as means ± SEM. 

RESULTS 

FSHR mRNA Expression in the Cervix 

RT-PCR demonstrated that the FSHR gene is expressed 
in bovine cervix as indicated by the presence of the 384- 
bp fragment corresponding to bFSHR mRNA. The mRNA 
was maximally expressed in the cervix during the pre-es- . 
trous/estrous phase. In contrast, the expression of the gene 
was reduced in postovulation cervix and not detectable in 
luteal-phase cervix (Fig. 1). However, reamplification of the 
PCR product, cDNA, from the luteal phase produced a de- 
tectable signal (Fig. 1 ). 

To demonstrate that the absence of FSHR mRNA in lu- 
teal-phase cervix was specific, bovine p-actin mRNA was 
used as internal control. PCR demonstrated that the specific 
890 bp of the 3-actin cDNA band was produced by the 
cervical tissues of all the stages. The band for p-actin was 
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FIG. 3. Northern blot for analysis of total RNA (10 u,g) extracted from 
bovine cervical tissues collected from pre-estrous/estrous, postovulatory; 
and luteal-phase cows, and bovine granulosa cells as a positive control 
Relative migration of 28S and 1 BS ribosomal RNA is indicated; To localize 
mRNA for FSHR and (3-actin, our RT-PCR products (384-bp and 890-bp 
fragments, respectively) were used to generate biotinylated probes using 
biotin random primer. Figure shows results for FSHR mRNA in bovine 
granulosa cells as a positive control (lane A); and in cervical tissues at 
pre-estrus/estrus (lanes B-C), postovulation, (lanes D-E) and the luteal 
phase (lanes F-G). The corresponding (3-actin band was obtained by re- 
probing of the membranes previously tested for FSHR mRNA as described 
in Materials and Methods. Columns show mean ± SEM of the optical 
density of the ratio FSHR:p-actin, in which the luteal phase is given an 
arbitrary value of 1.0. "Significantly different from luteal-phase value (p 
< 0.01); n = 5 for each stage. 




FIG. 4. Presence of FSHR protein in cervical tissues as determined by 
Western blot. Soluble cell extracts (60 p.g protein) were used to determine 
the amount of FSHR as described in Materials and Methods. Western blot 
proteins were transferred to nitrocellulose paper and probed with anti- 
serum to peptide hFSHR, 265-296. Western blot displays a representative 
result for bovine granulosa cells, used as positive control (lane A); bovine 
seminal vesicles (lane B) and muscle, used as a negative control (lane C); 
and cervical tissues from three stages of the estrous cycle: pre-estrous/ 
estrous (lane D), postovulatory (lane E), and luteal (lane F). Histogram 
summarizes the results for the three stages of the estrous cycle. Columns 
show mean ± SEM of the relative optical density in which the luteal phase 
is given an arbitrary value of 1.0. Significantly different from luteal-phase 
value [p < 0.01); n = 7 for each stage. 



present even though the 384-bp band corresponding to the 
FSHR was absent (Fig. 1). 

The nucleotide sequence (Fig. 2) of the 384-bp fragment 
obtained from pre-estrous/estrous cervix was compared 
with that for the Bos taums FSHR reported by Houde et 
al. [7]. It was found that there was a 97.5% homology be- 
tween the 384-bp fragment and the nucleotide range 1410- 
1763 found in the Bos taurus FSHR. 

The Northern blot contained the expected three tran- 
scripts of 2.55, 3.3, and 3.8 kilobases (kb) present in the 
bovine granulosa. However, the high (6.8-kb) and low (1.6- 
kb) transcripts found in the ovary were not found in the 
cervix. The three transcripts were found in pre-estrous/es- 
trous cervix, but only one transcript (2.55 kb) was seen in 
luteal cervix (1/8 of the. pre-estrus/estrus level; n = 5) or 
postovulatory cervix (1/3 of the prc-estrus/cstrus level; n = 
5) (Fig. 3). 
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FIG. 5. Effect of incubation of cervical tissues with FSH on the expres- 
sion of the F5HR protein. Cervical tissue from 5 pre-estrous/estrous cows 
was incubated in the absence or presence of FSH (0 or 1 0 ng/ml). Western 
blot is representative for FSH protein concentration after 3 h of incuba- 
tion, with granulosa cell extract serving as positive control. Lane A, Gran- 
ulosa cells (GC); lane B, cervical tissue at Time 0; lane C, cervical tissue 
after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 h of 
incubation with 10 ng/ml of FSH. Lane E represents muscle tissue incu- 
bated with FSH (10 ng/ml) for 3 h as a negative control. Histogram sum- 
marizes the results of six experiments. Columns are means ± SEM. *Sig- 
nificantly different (p < 0.01) from the. column representing the control 
at 3 h (a given value of 1). 



FSHR Protein in the Cervix 

The presence of FSHR protein in the cervix was dem- 
onstrated as determined by Western blot analysis using a 
specific antipeptide antibody for hFSHR. The antibody rec- 
ognized, a 75-kDa protein in both bovine granulosa and 
cervical preparations identical with the predicted molecular 
size of FSHR. The signal for the 75-kDa protein was stron- 
gest in pre-estrous/estrous cervix when compared with 
postovulation- (3-fold) or luteal-phase (6-fold) cervix (Fig. 
4). With the use of 60 p,g protein in each lane, this protein 
was observed to be expressed throughout the estrous cycle. 

In Vitro Effect of FSH on Induction of FSHRs 

A time-course (0.75, 1.5, and 3 h) and dose-response (0, 
10, and 20 ng/ml) study was carried out to determine in 
vitro effect of FSH on its own receptor. Cervical minces 
(100 mg) from cows in the pre-estrous/estrous, postovula- 
tory, and luteal phases were used. It was found, as deter- 
mined by Western blot, that FSH induced its own receptor 
in cervical tissues at pre-estrus/estrus, with a 3-fold increase 
(p < 0.01) observed at 10 ng/ml after 3 h of incubation 
(Fig. 5). However, no effect of FSH was seen af earlier 
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FIG. 6. Effect of incubation of cervical tissues with FSH on the synthesis 
of G protein. Cervical tissues from 5 pre-estrous/estrous cows were in- 
cubated in the absence or presence of FSH 00 ng/ml). Both the 87-kDa 
G protein (a, p, 7 complex) and 42-kDa protein (ctj are elevated at 3 h 
of incubation. Granulosa cell extract was used as a positive control. Lane 
A, Granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, cervical 
tissue after 3 h of incubation in TCM-1 99; lane D, cervical tissue after 3 
h of incubation with 10 ng/ml of FSH. Columns are means ± SEM of 42- 
kDa protein. *Significantly different (p < 0.05) from the column repre- 
senting the control at 3 h (with an arbitrary value of 1 .0). 



times at the 10 ng/ml level. When 20 ng/ml was used, a 
stimulatory effect (2-fold) was seen after 1.5 h. No signif- 
icant response was observed when tissues of the postovu- 
latory or luteal phase were used at any dose or time tested 
(data not shown). 

Effect of FSH Treatment on G Protein (a J 

A time-course and dose-response study was carried out 
to determine whether cervical FSHR was coupled to G pro- 
tein. Cervical minces (100 mg) from cows in the pre-es- 
trous/estrous, postovulatory, and luteal phases were incu- 
bated in the presence of FSH (0-20 ng/ml) for 0.75, 1.5, 
and 3 h. It was found in pre-estrous/estrous cervical tissue 
that, in the presence of FSH (10 ng/ml), there was a sig- 
nificantly (p < 0.05) higher level of G protein (ot s ) at the 
level of both the 87-kDa"(a, p, 7 complex) and the 42-kDa 
(a s ) proteins after 3 h of incubation (Fig. 6). FSH, at either 
10 or 20 ng/ml/ did not have a stabilizing effect on the 
level of G protein (a s ) at shorter times of incubation (0.75 
or 1.5 h). No significant response was observed when tis- 
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FIG. 7. Effect of FSH on IP production in cervical tissue. Cervical tissues 
from six cows at each of three stages of the cycle were incubated for 30 
min in the absence or presence of FSH (10 ng/ml) or LH (ng/ml). IPs 
produced were determined in four replicates by amount of labeled com- 
pound (cpm) produced. Columns are means ± SEM (*p < 0.05; **p < 
0.01; significantly different from control value for each stage of the cycle). 



sues of the postovulatory or luteal phase were used at any 
dose or time tested (data not shown). 

IP Pathway Activation by FSH 

For IP measurement; cervical tissue minces (30 mg) 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated for 30 min in the absence or presence 
of FSH (10 ng/ml) or LH (10 ng/ml). FSH significantly (p 
< 0.01) increased the level of IP, IP 2 , and IP 3 at pre-estrus/ 
estrus. However, FSH significantly (p < 0.05) inhibited IP 
level in the postovulatory stage. In contrast, LH, but not 
FSH, stimulated IPs significantly (p < 0.01) at the luteal 
phase (Fig. 7). 

Adenylate Cyclase Activation by FSH 

A dose-response analysis was used to determine whether, 
the cervical FSHR was coupled to adenylate cyclase. In 
these experiments, cervical tissue minces (50 mg) obtained 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated with FSH (0-40 ng/ml). FSH in- 
creased cAMP accumulation in a dose-dependent manner 
when incubated with pre-estrous/estrous cervical tissues, 
with maximal stimulation 2.5 times (p < 0.01) that of con- 
trol in the presence of 10 ng/ml (Fig. 8 A). In contrast, cer- 
vical tissue from luteal or postovulation did not respond to 
FSH even though tissues from all phases of the cycle re- 
sponded to forskolin with a significant (p < 0.01) increase 
in cAMP (Fig. 8B). 

Effect of FSH on PGE Production by Cervical Tissues 

A dose-response analysis was carried out to determine 
whether FSHR was associated with cervical prostaglandin 
production. PGE 2 and PGF 2o production by cervical minces 
(100 mg/ml) in the presence of FSH (0-20 ng/ml) was de- 
termined by R1A. FSH stimulated (p < 0.01) basal PGE 2 
production in a dose-dependent manner and caused a 3-fold 
increase in PGE 2 production at 10 ng/ml in pre-estrous/ 
estrous cervical tissues (n = 8, Fig. 9). FSH had no effect 
on the small amount of basal PGF 2u (< 1 ng/100 mg) pro- 
duced by cervical tissue (data not shown). FSH did not 
elevate PGE 2 in cervical tissues from the luteal phase; fur- 
thermore, a small but significant (p < 0.05) inhibition was 
observed at postovulation (Fig. 9). 
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FIG. 8. Effect of FSH on adenylate cyclase production in cervical tissue. 
A) Cervical tissues from six cows at pre-estrus/estrus were incubated (4 
replicates) for 10 min in the absence or presence of FSH (0, 5, 10, 20, or 
40 ng/ml) or forskolin (10 jtmol/L), and the amount of cAMP produced 
was measured by a protein binding assay. B) Cervical tissues from six 
cows at each of the three stages of the estrous cycle were incubated (4 
replicates) for 10 min in the absence or presence of FSH (10 ng/ml) or 
forskolin (10 |iM). The amount of cAMP produced was measured by a 
protein binding assay. Columns are means ± SEM. Columns with asterisks 
were statistically different from their own control (*p < 0.01; **p < 0.05). 
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FIG. 9. FSH dose response on production of PCE 2 by cervical tissues. 
Cervical tissues from 8 pre-estrous/estrous, postovulatory, or luteal-phase 
cows were incubated in the absence or presence of FSH (5, 1 0, or 20 ng/ 
ml) for 6 h, and the effect on the production of PCE 2 was measured by 
RIA. *p < 0.01; **p < 0.05. 
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FIG. 10. Effect of incubation of cervical tissues with FSH on the ex- 
pression of cyclooxygenase. Cervical tissues from 5 pre-estrous/estrous 
cows were incubated in the absence or presence of FSH (10 ng/ml). The 
72-kDa cyclooxygenase II protein was elevated at 3 h of incubation. Lane 
A, bovine granulosa cells (GC); lane 6, cervical tissue at Time 0; lane C, 
cervical tissue after 3 h of incubation in TCM-199; lane D, cervical tissue 
after 3 h of incubation with 10 ng/ml of FSH. Columns are means ± SEM 
of 72-kDa protein. 'Significantly different (p < 0.01) from the column 
representing the control at 3 h (with an arbitrary value of 1 .0). 



Cyclooxygenase Activation by FSH 

To determine whether the presence of FSHR was asso- 
ciated with the expression of cyclooxygenase, cervical 
minces were incubated for 3 h in the presence or absence 
of FSH (10 ng/ml). Tissues from pre-estrous/estrous, post- 
ovulatory. and luteal-phase cows were extracted, separated 
on SDS-PAGE, and tested for the cyclooxygenase II (72 
kDa) using a specific antibody. It was found that in cervical 
tissues from six pre-estrous/estrous cows, FSH induced a 
200% increase in the expression of cyclooxygenase after 3 
h of incubation (Fig. 10). In contrast, no effect on cyclo- 
oxygenase expression was seen in cervical tissues from lu- 
teal-phase cows, and an insignificant (p > 0.05) elevation 
at the postovulatory phase was observed (data not shown). 

DISCUSSION 

Results of this study demonstrate by both PCR amplifi- 
. cation and Northern blot analysis that FSHR mRNA is pres- 
■ ent in the bovine cervix. The bands corresponding to the 
receptor (75 kDa) and its mRNA were maximally expressed 
during the pre-estrous/estrous phase. Incubation of cervical 
tissues with FSH increased the expression of cyclooxygen- 



ase and the production of PGE 2 (and possibly PGE,) at the 
pre-estrous/estrpus phase. The stimulation of cervical cy- 
clooxygenase was associated with activation of both the 
PLC and adenylate cyclase second messenger G-protein re- 
lated pathways. 

Northern blot analysis of bovine cervical tissues from 
pre-estrous/estrous cows revealed multiple transcripts for 
bovine cervical FSHR; The sizes of the major transcripts 
were 2.5 kb, 3.3 kb, and 3.8 kb, which were similar to those 
obtained in bovine granulosa ceils [7], rat testis and ovary 
[13, 14], and human ovary [15] and myometrium [16]. Oth- 
er minor transcripts have been reported in all of these spe- 
cies. The RT-PCR product yielded a single band of 384 bp, 
which was the expected molecular size, so the different 
isoforms of the FSHR are probably the result of differential 
splicing of the same transcript. Isoforms of FSHR have 
been characterized in human [17] and primate [18] ovary 
and in ovine testis [19]. Since only a single 2.5-kb isomer 
was present in luteal-phase cervix (which did weakly pro- 
duce FSHR), it would appear that there are different iso- 
forms present in the bovine cervix as well. 

The nucleotide sequence homology between our RT- 
PCR 384-bp product was 97.3% identical to the comparable 
region of the bovine FSHR, showing that our amplified 
cDNA was complementary to the Bos taurus mRNA FSHR. 
The PCR product of 384-bp cDNA was found primarily in 
the pre-estrous/estrous cervix. 

Using specific antibody raised in rabbits against hFSHR 
(amino acid sequence 265-296) for Western blot analysis 
resulted in a major signal for a 75-kDa protein. A protein 
of similar molecular mass has been reported in rat and hu- 
man ovary [20]. A strong signal for this protein was seen 
in the pre-estrous/estrous cervix compared with cervix from 
the other stages of the cycle. This cervical receptor was 
regulated by FSH itself, similar to the regulation of the 
FSHR in the rat ovary [14]. 

FSH increased cAMP production by cervical tissue from 
the pre-estrous/estrous phase but did not elevate cAMP in 
the postovulatory and luteal phases. FSH elevation of IPs 
displayed the same pattern. It would therefore appear that 
the cervical FSHR is associated with signal transduction 
pathways in a way similar to the accepted mechanism of 
LH action in the ovary and testes, i.e., activation of the 
second messenger pathways— adenylate cyclase [21] and 
phosphatidyl-inositol (PLC) [22, 23]. The inhibitory effect 
of FSH on IPs and cAMP at postovulation occur when 
FSHR is low. Since FSH activity is biphasic, i.e., is stim- 
ulated by a low dose and inhibited by a high dose, an in- 
hibitory effect at low concentrations could be the result of 
the change in the ligand-receptor ratio. 

FSH induced about a 200% increase in the expression 
of cyclooxygenase at pre-estrus/estrus but had no effect on 
the two other stages of the cycle. This was similar to the 
response of cyclooxygenase to LH that was observed for 
the bovine endometrium [2] and uterine vein [6]. However, 
in the endometrium, LH increased cyclooxygenase during 
both the luteal and postovulatory phases but not at estrus 
[2]. Furthermore, the induction of cyclooxygenase by go- 
nadotropin in the endometrium was associated with an in- 
creased level of PGF 2ft , while in the cervix PGE 2 was the 
major product. 

The bFSH was of immunological grade and devoid of 
residual biological contamination of LH, thyroid-stimulat- 
ing hormone, growth hormone, prolactin, or ACTH. The 
effects on increasing cAMP, PGE 2 , and cyclooxygenase as 
well as induction of FSHR were therefore specific. Fur- 
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thermore, it was found that highly purified hFSH (10-20 
ng/ml) (donation of J.A. Dias) had the same effects as 
bFSH when incubated with cervical tissue under the same 
conditions (2.5-fold increase in cAMP; 3-fold increase in 
PGE 2 ; 2-fold increase in cyclooxygenase). 

The relaxation and opening of the gravid cervix is due 
to active biochemical and structural changes in the cervical 
connective tissue that are mediated in part by prostaglan- 
dins [24]. It is thought that stretching of the cervix is a 
factor in causing the release of these prostaglandins [25, 
26]. The effect of PGE in causing cervical softening in the 
ewe is also well documented [27-2-9]. In both nonpregnant 
and pregnant cows, PGE 2 causes an increase in cervical 
opening within 3 h of treatment [30]. PGE] administered 
intra-cervically in a jelly was shown to decrease cervical 
resistance within 24 h [31]. 

Recently, we reported [6] that oxytocin caused a signif- 
icant stimulation of PGE 2 production in vitro in cervical 
tissues from pre-estrous/estrous cows but had no effect on 
PGE 2 production in cervical tissue from other stages of the 
estrous cycle of the cow. Similarly, in an initial report [31], 
it was shown that administration of oxytocin to pre-estrous 
cows increased the concentration of PGE 2 in the cervical 
exudate. However, peripheral oxytocin concentrations dur- 
ing estrus are lower than during the luteal phase [32, 33], 
and elevations of both peripheral oxytocin and cervical 
oxytocin receptor concentration are necessary for oxytocin 
to cause cervical softening towards parturition [24]. Fur- 
thermore, progesterone in vitro induced a dose-dependent 
inhibition of PGE 2 release by cervical tissues from pre- 
estrous/estrous cows, and this was associated with a de- 
crease in both basal and oxytocin-stimulated PGE 2 produc- 
tion [6]. It would therefore seem that hormones other than 
oxytocin are responsible for the increase in cervical PGE 2 
in the pre-estrous/estrous cow. The present report indicates 
that FSH, which has its peak peripheral concentration at the 
time of estrus, could be the hormone that increases cervical 
PGE 2 , as the FSHR expression is maximal at this time and 
FSH "in vitro increases PGE 2 production by the cervix. 

The bovine cervix at pre-estrus/estrus has high levels of 
FSHR protein and its corresponding mRNA. Activation of 
the receptor by FSH is associated with the G-protein-cou- 
pled receptor family that mediates the cAMP and IP sig- 
nalling pathways. These signalling pathways then increase 
the expression of cyclooxygenase and production of PGE 2 . 
The expression of the FSHR was maximal at the time of 
the FSH peak in the blood and suggests a physiological 
role for FSH in the relaxation and opening of the cervix at 
estrus. 
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The aipha(v)beta6 uategrin receptor for Foot-and-mouth disease virus is 
expressed constitutive^ on the epithelial cells targeted iu cattle. 

Monaghan P , Gold S, Simpson J, Zhang Z , Weinreb PH , Violcttc SM . Alexanderscn 
SL Jackson T . 

Institute for Animal Health, Pirbright Laboratory, Ash Road, Pirbright, Surrey GU24 
ONF, UFC 

Field strains of Foot-and-mouth disease virus (FMDV) use a number of alpha(v)- 
integrins as receptors to initiate infection on cultured cells, and integriris are believed to 
be the receptors used to target epithelial cells in animals. In this study, 
immunofluorescence confocal microscopy and real-time RT-PCR were used to 
investigate expression of two of the integrin receptors of FMDV, alpha(v)beta6 and 
alpha(v)beta3, within various epithelia targeted by this virus in cattle. These studies show 
that alpha(v)beta6 is expressed constitutively on the surfaces of epithelial cells at sites 
where infectious lesions occur during a natural infection, but not at sites where, lesions 
are not normally formed! Expression of alpha(v)beta6 protein at these sites showed a 
good correlation with the relative abundance of beta6 mRNA, In contrast, alpha(v)beta3 
protein was only detected at low levels on the vasculature and not on the epithelial cells 
of any of the tissues investigated. Together, these data suggest that in cattle, 
alpha(v)beta6, rather than alpha(v)beta3,,serves as the major receptor that determines the 
tropism of FMDV for the epithelia normally targeted by this virus. 
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Uroldnase-mediated posltranscriptional regulation of uroldnase-rcceptor 
expression in non small cell lung carcinoma. 

Montuori N . Mattiello A, Mancini A, TagUalatcla P . Caputi M . Ros$i G, Ragno P . 

Istituto di Endocrinologia ed Oncologia Sperimentale, Consiglio Nazionale delle 
Ricerche, Naples, Italy. 

The urokinase-type plasminogen activator {uP A) and its cellular receptor (uPAR) are 
involved in the proteolytic cascade required for tumor cell dissemination and metastasis, 
and are highly expressed in many human tumors. We have recently reported that uPA, 
independently of its enzymatic activity/is able to increase the expression of its own 
receptor in uPAR-transfeeted kidney, cells at a posltranscriptional level. In fact, uPA, 
upon binding uPAR, modulates the activity and/or the level of a mRNA-stabilizing factor 
that binds the coding region of uPAR-mRNA. We now investigate the relevance of uPA- 
mediated posttranscriptional regulation of uPAR expression in non small cell lung 
carcinoma (NSCLC), in which the up-regulation of uPAR expression is a prognostic 
marker. We show that uPA is able to increase uPAR expression, both at protein and 
mRNA levels, in primary cell cultures obtained from tumor and adjacent normal lung 
tissues of patients affected by NSCLC, thus suggesting that the enzyme can exert its 
effect in lung cells. We investigated the relationship among the levels of uPA, uPAR and 
uPAR-mRNA binding protein(s) in NSCLC. Lung tissue analysis of 35 NSCLC patients 
shows an increase of both uPA and uPAR in tumor tissues, as compared to adjacent 
normal tissues, in 27 patients (77%); 19 of these 2.7 patients also show a parallel increase 
of the level and/or binding activity t>f a cellular protein capable of binding the coding 
region of uPAR-mRN A. Therefore, in tumor tissues, a strong correlation is observed 
among these 3 parameters, uPA, uPAR and the level and/or the activity of a uPAR- 
mRNA binding protein. We then suggest that uPA regulates uPAJR expression in NSCLC 
at a posttranscriptional level by increasing uPAR-stability through a cellular factor that 
binds the coding region of uPAR-mRNA. Copyright 2003 Wiley-Liss, Inc. 
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Vascular endothelial growth factor expression correlates with matrix 
metalloproteinases MT1-MMP, MMP-2 aud MMP-9 in human 
glioblastomas. 

MunautC, Noel A, Hougrand O , Foidart JM. BoniygrJ T^r^jyi- 

Laboratory of Tumour and Development Biology, University of Liege, Liege, Belgium. • 

Vascular endothelial growth factor (VEGF) is the major endothelial mitogen in central 
nervous system neoplasms and it is expressed in 64-95% of glioblastomas (GBMs) 
Tumour cells are. the main source of VEGF in GBMs whereas VEGF receptors (VEGFR 
1, its soluble form sVEGFR-I , VEGFR-2 and neuropilin-l) are expressed predominantly 
by endothelial cells. Infiltrating tumour cellsand newly.formed capillaries progress 
through the extracellular matrix by.local proteolysis involving matrix metalloproteinases 
(MMPs). Recent studies have shown that VEGF expression and bioavailability can be. 
modulated by MMPs. We reported previously that the expression of MT1 -MMP in 
human breast cancer cells was associated with an enhanced VEGF expression We used 
quantitative RT-PCR, Western blot, gelatin zymography and immunohistochemistry to 
S feeXpreSSi0n of VEGF, VEGFR- 1, VEGFR-2, sVEGFR-1, neuropilin-l,MTl- 

• MMP - 2 . MMP-9 and TIMP-2 in 20 human GBMs and 5 normal brains The 
expression of these MMPs was markedly increased in most GBMs with excellent 
correlation between mRNA and protein levels; activated forms of MMP-2 and MMP-9 
were present in 8/18 and 7/18 of GBMs. A majority of GBMs (17/20) also expressed high 
*^7d aS prev,ous| y re Por*ed, with strong correlation between VEGF and MT1- 

MMP gene expression levels, and double immunostaining showed that VEGF and MT1- 
MMP peptides co-localize in tumour and endothelial cells. Our results suggest that the 
interplay between metalloproteinases and VEGF previously described in experimental 
tamours may also be operative in human GBMs. Because of its dual ability to activate 
MMP|-2 ajid to up-regulate VEGF, MTl-MMP might be of central importance in the 

f^Zr, BMS re P resent ™ interesting target for anti-cancer treatments. Copyright 
2003 Wiley-Liss, Inc. 
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MT1-MMP, MMP-2, MMP-9 and TIMP-2 in a series of 20 GBMs 
and 5 normal brains. Using quantitative RT-PCR,. gelatin zyroog- 
raphy, Western blot and immunohistocbemistry, we showed a 
strong correlation between the expression of VEGF, MT1-MMP, 
MMP-2 and MMP-9 in GBMs. These results are in accordance 
with previous in vino studies and add to the evidence of an 
interplay between VEGF and MMPs in the progression of human 
GBMs. 



MATERIAL AND METHODS 

Patients 

We studied 20 GBMs diagnosed at the Laboratory of Neuropa- 
tbology-CHU Uege between 1997 and 2001. The series included 
17 primary GBMs (ie./no previous history, of lower grade diffuse 
astrocytoma) and 3 secondary GBMs (£.«■., previous history of 
lower grade diffuse- astrocytoma). Clinical information on these 20 
cases have been reported previously as part of a larger series. 31 The 
gender ratio was 1/1 , and the age at time of diagnosis ranged from 
41-79 years (mean 56 years^ Normal brain cortex and white 
matter were obtained from 5 patients with intractable epilepsy 
treated by partial temporal lobectomy. Histological examination of 
these specimens showed severe hippocampal sclerosis; frozen tis- 
sue was sampled from microscopically normal inferior temporal 
gyri. Our study was approved by the Ethical Committee of the 
Faculty of Medicine of the University of Liege. 



RNA extraction and cDNA synthesis 

Total RNA was extracted from cryosections with RNeasy Mini 
Kit (QJAGEN GmbH, Hilden, Germany) according to the manu- 
facturer's protocol. Total RNA (1 jxg) was reverse transcribed with 
a ThermoScript reverse transcriptase (ThermoScript RT-PCR Sys- 
tem, Invitrogen, Carlsbad, CA) and random hexamers is primers. 

Primers 

: Primers pairs used in our study are described' in Table L Primers 
for the VEGF gene were chosen to distinguish between YEGF, B?> 
VEGF 165> VEGF I43 and VEGF I2I mRNA isoforms. Intron-span- 
ning primers and probes for the TaqMan system (primers for 
VEGFR-1 (Rt-lXsVEGFR-1, VEGFR-2 (KER/Flk-1) and NRP1) 
were designed to meet specific criteria by using Primer Express 
software (Perkin Elmer, Foster City, CA)l All primers were syn- 
thesized by Eurogentec (Liege, Belgium). The 5'- and 3'-end 
nucleotides of the probe were labeled with a reporter (FAM. — 
6-carboxy-flubrescein) and a quencher dye (TAMRA = 6-car- 
boxy-tetramethylrhodaroine). We conducted BLASTn (National 
Center for Biotechnology Information; Bethesda) searches against 
dbEST and the mm redundant set of GenBank, EMBL, and DDBJ 
database sequences to confirm the total gene specificity of the 
nucleotide sequences chosen for the primers. The specificity of the 
amplified PCR products was confirmed. either by restriction digest 
• or by sequencing. The 18S ribosomal RNA was measured using 
the Pre-Developed TaqMan Assay Reagents Endogenous control 
kit from Applied Biosy stems (Foster City, CA). 



TABLE I - SEQUENCE OF PRIMERS AND T^Ntow PROBES. USED FOR RT-PCR STUDIES 



Gene and accession 
number 


Posilioo 


MMP-2 FP 


1740F 


MMP-2-RP . 


1964R 


NM_004530 . 




MMP-9-FP 


1592F 


MMP-^RP 


1800R 


J05070 




MMP-14-FP 


1288F 


MMP-14-RP 


I508R 


NMJXM995 




T1MPI-FP 


78F 


T1MPJ-RP 


245R 


M 12670 




TIMP-2-FP 


78F 


7TMP-2-RP 


245R 


NM 003255 




VEGF-FP 


1208F 


VEGF-RP 


1687R 


AH0OJ553 




28S rRNA-RP 


12403F 


28S iRNA RP 


I2614R 


U1336? 




VEGFR1-FP 


243SF 


VEGFR1-RP 


2516R 


VEGFR1 Probe 


2469 


AF063657 




sVEGFRl-FP 


2209F 


sVEGFRI-RP 


2388R 


sVEGFRl Probe 


2257 


U0II34 




VEGFR2-FP 


79JF 


VEGFR2-RP 


946R 


VEGFR2 Probe 


820 


AF063658 




NRPI-FP 


J831F 


NRP1-RP 


1942R 


NRPI Probe 


1883 


XM 034725 





Sequence 



Size 



Cycle* 



5'-agatcttcttcttcaaggaccggtt-3' 
5vggctggtcagtggcjtggggta-3' 

5' -gcg g ag attggga acc agctgt a-3 ' 
5'-gacgcgccrgtgtacacccaca-3' 

5'-gg ata ccca atgccc attggcc a-3' 
5'- ccattgggcatcc ag a a gaga gc-3 ' 

5 - catcctgttgttgctgtggctg at-3 ' 
5'-gtcatcttg;atctcataacgctgg-3' 

5'-ctcgctggacgttggagcaaagaa-3' 
5'-agcccatctggtacctgtggttca-3' 

5 '- cctggtgg ac atctt cc agg a gt a-3 * 
5'-ctcaccgcctcggcttgtcaca-3' 



5'-GTTCACCCACTAATAGGGAACGTGA-3' 
5'- GATTCTG ACTTAGA GGCGTTCAGT-3 ' 

5'-TCCCTTATG ATGCCAGC A AGT- 3 ' 

5'-CCAAAAGCCCCTCTTCCAA-3' 

5'-CCGGGAGAGACTTAAACTGGGCAAATCA-3' 

5'-ACAATCAGAGGTGAGCACTGCAA-3' 
5'-TCCGAGCCTGAAAGTTAGCAA-3' 

_5'-TCCAAATTTAAAAGCACAAGGAATGA TTGTACCAC-3' 

5'-CTTCGAAGCATCAGCATAAGAAACT-3' 

5 ' -TGGTC ATC AGCCC ACTGG AT-3 ' 

5 A ACCGAG ACCTA A A A A CCC AGTCTGG G A GT- 3 ' 

5'-CACAGTGGAACAGGTGATGACTTC-3' 

5'-AACCATATGTTGGAAACTCTGATTGT-3'. 

5'CCACAGAAAAGCCCACGGTCATAGACA-3' 



225 bp 
208 bp 
221 bp 
168 bp 
155 bp 



479 bp 
407 bp 
347 bp 
275 bp 

212 bp 



79 bp 



180 bp 



156 bp 



112 bp 



33 

37 

32 . 

33 

30 

33 

19 
40 

40 

40 

40 



850 



6000 
5600 
5000 
4500 



3 3500 



< 



. 2000 



VEGF VEGFR1 sVEGFRI VEGFR2 NRP 
to'SE l^?^ WEG ^ eptOIS mRNA ^^ficarion: scat- 
senis we mean of 3 separate experiment;. * 

™£T I^""'" , " V,; PCR f° r Mri-MMP. MMP-2, MMP-9 
TJMP-2 mRNA and. VEGF mRNA informs 

fr. MT} ^%' MMP ' 2 ' MMP!> > ""M^ VEGF mRNA iso- 
forms rVEGF lw , VEGF lt5 , VEGF M5 and VEGF.,0 were met 

Muga, Japan) and 5 pmol of each primers (Table ]). The iherma 
cychng commons included 2 min a, 95°C for denaturationTd 

«•£ K VEGF isoforms) with a final incubation 2 „^ a 
. ;Jr?J ,odaas wc,e '"olved od 2% Nusieve 31 aeam.se 
ge is <B"*tauta. Rockland, MD) and analyzed using a FW-S 
Multdmage, (B.o-Rad, Hercules. CA) af.J cU.id.um brornlde 
stammg. Specfic mRNA levels were „pr«sed as .£ SLtf 

.vcrr^T,,^ 1 ^ RTPCR a^ses for VEGFR-1 
sVEGFR-1 VECFR-2. NRPI mRNAs and J8S ,RNA were car' 
ned ou. us.no ,be AB1 PRISM 7700 Sequence ^TctXtcm 

OI the PCR pnmer pa,rs and fluorogenic probes thai were used for 
itjT T?,° WD ] " T3bk 1 A curve was generaTed by 

»SoO 80 T° ° f P,a " nla CDNA *° C0W ,h < of 
Si Wa ? ™ d "P«ca.e during every experiLn" 
K» each e X per.me„.al sample., .he amouni of targe, was 

o A™. " S,a " dard CUrV '- Thf «P™io™ieve 
of Ae .arge. gene was normabzed agains. 18S .RNA .o compen 
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SNA T i^f >i0 V m A u C T 3, '' iy ° fmA "* lhc am °™ of input 
-f Ustr Bult,^ 2V ^ maa ? f ™™ PB A PP««< Biosystems 
, LSr D 2) - ^ *"* camed ou' with the TaoMan Uni 

400 nM pnroers .n a 25 p.1 final action imxfure. After a 2 mm 
.DCv.bat.on at 50°C to allow for UNG cleavage, ArnphTao Grfd 
w^acjrvated by an incubation for lOrnm a, ^5^^ 
FCR cycks coasted of 15 sec of denaturation at 95°C and 
hybr.d.zatH>n of probe and primers for I min at 60T. 

To confinn amplification specificity, the PCR products were 
abp exanuned by subsequent 2% agarose gel etetrcXrZ 
Expenments were repeated at least 3 times in duplicate. 

Immunohistochemisiry for VEGF and MT]-MMP 

*' Ck) Were cn ' flora f^alin-fixed. paraffin 
embedded hunoui dssue. TTiey were hydrated through graded 
alcohols and incubated in (03% 15 min). Secrions^-ere 

^alfn^ «"? a ' 'rl 6 ° C in citra "= 0 O fTer PH6 for^,^ 
^T^-P*"*"®- Fot ^ble irnrnunostaiDi^ 
M^ n A ^ C M ^ m0noc)oml Ab anti-ltfr? 

rAlhlh^k ^ 10 °. (?»co g ene Research Products, San Diego, 
CA) followed by peroxidase-conjugated Envision (Dako) Inunu- 
= t.v,ry was visualised wid, S^diarrnncW^KT 

s ass's f^^rib ^ 

Fas Red^" Immunor " c, " v "y for VEGF was visualized with 
Fast Red chromogemc substrate (Dako). Single imrnunostainine 

2^2^™ ' °" ^ SeC ' i ° nS USi " g ^primary aSS^ 
alone w.th the conespondrng enzyme-chromogene combination 
_ Negauve controls were obtained by omitting the prinJ^Sl 

Gelatin zymography assay 

MMP-2 and MMP-9 acnviHes were quantified by gelatin zy- ' 
mography on 2 normal brains and 18 GBMs. Ten cryosectjons (10 
MM) were homogenized in buffer (0.1 M Tris-HCI pH 8.1, 0.4% 
Tr.ionX')00) and centrifuged for 20 min at 5,00Of. The pellets 
were d.scarded. 25 u.g of total protein from hornogcLc ~J£Z 
i£ e / e wuh non reduci "S sample buffer (62.5 mM 

SSTh J? V 2% S ? S: ,0% 0 ) * brlophenol 

SS*^SSlS tare, ? , . d,?ta:ll!jr °" J0% SDS-polyacrylamide 
gels (SDS-PAGE) conta.n.ng 0.1% gelatin (w/v).« After elecrro- 

mEP. 8 v Tnn W .^ ed fM 1 hr a ' I0om 'cmperatuie in a 2% 
K'™. X * so ' u ' ,on '» "=move SDS, transfened to a buffer 
(50 mM Tris-HCI, pH 7.6, containing 10 mM CaCU and incu- 
ba*d for 18 hr at 37'C Gels were stained for 30 rn^witb 01% 
(w/v) Coomass.e bnlhan, blue G250in 45% (v/v) me.hanol/10% 
'i^ aC nt' C r• aC ." , a " d deS ' 3ined in J0% < v/v > acelic *«iiW0» (v/v) 
d 9 Vn S D We .'? l M,yMd Wi ' h ° uan,i, y O" 1 software (ver- 
£ ■ , k Labora, 0" e5 , Hercules, CA) after densitomet- 
ry scannmg of the gels us.ng a Fluor-S Multimager (BioRad). 

Western blot 

MTl-MMP p,o.ein Jeveh were analyzed in 2 normal brains and 
buffer [0.25 M Tns (pH6.8), J0% SDS (w/v), 4% sucose (v/v) T 

fJa k'Ti a r Pt0e , m3n01 (WV) and 0 ,25% b ^opheno) blue (w/v)j 
and boled for 5 min. They were sep aia ted on 10% SDS^PAGE 
gels and transferred to a PV D F filler (NEN. Boslon. MA). After 
blocking wr,h 5% {w /v), 0.1% iween 20 (w/v) in PBS for 2 
T. a K ™ n J l , ( 5 m P« aru "- membranes were exposed ,o the primary 
annbody (10 jtg/ml clone 113-5B7, Oncogene Research 

Produus, San Drego, CA) a. 4X overnight foJlowed I by incubation 
w,ih a horseradish peroxidase-conj U o ale d rabbil anti-mouse ami- 
J ' 3 K ^/ml DaJco, Glosrmp, Denmark). Signals were dc- 
■S W M A ^ enh ? Ced chemolu ™n«cence (ECL) kit (NEN, 
Boston. MA) The relative in.ensiues of the immunoreaciive bands 
were analyzed w,ih Quaniiry One sofrw 3I e (version 4.7 2 Bio-Rad 
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Figure 2 - MMPs and T7MP-2 mRNA quantifica- 
tion, (a) Representative 2% agarose gels of RT-PCR 
products for MTJ-MMP, MMP-2, T1MP-2 and 
MMP-9 in 2 norma] brains (NB) and 10 GBMs. {b) 
Scatter plots (as described in Fig. I). Experiment was 
repeated at least 3 times in duplicate. 
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Laboratories) after densitometry scanning of the X-ray films using 
a Fluor-S Mullimager (Bio-Rad). 

Statistics 

. VEGF, VEGFRs, MMPs and TJMP-2 expression, values in 
GBMs were correlated using Spearman's test. Correlation was 
considered significant for 2-taiJed p~ value < 0.05. Statistical anal- 
ysis was carried out using tbe Prism 3.0 software (GraphPad, San 
Diego, CA). 



RESULTS 

Expression of VEGF and VEGF receptors 

VEGF mRNA was present in normal brains (295-375, aibiirary 
units; mean = 322) and in all GBM samples (217-5,1 12; mean = 
L774) as reported previously (Fig. I).*' In most GBMs, VEGF 
mRNA levels were raised 2- 1 5- fold above normal brain values. 
Tbe most abundant isoform in all cases was VEGF )65 , followed by 
VEGF J2|J VEGF 1B? and VEGF,.,, (data not shown). VEGFR- 1 
expression was found at similar levels in GBMs (89-357: mean = 
182) and normal controls (154-198: mean = 1 8 J). There was no 
correlation between VEGFR- 1 and VEGF mRNA levels {p - 
0.35) in GBMs. VEGFR- 2 was expressed in all GBMs (48-582: 



mean — 210) and in 8720 cases at least twice nprraal values 
(87-rlll; mean = 103). VEGF and VEGFR-2 expressions were 
correlated significantly ip = 6.0035) in GBMs. NRP1 expression 
varied broadly between GBMs (75-3,260; mean ~ 1,061) con- 
trasting with a constant baseline expression in normal controls 
(383-397; mean - 390). In rumours, NRP1 correlated with 
VEGFR-2 ip = 0.0119) but not with VEGF ip = 0.084X dot 
VEGFR- 1 [p = 0.066). sVEGFR-I was expressed at low levels 
both in normal brains (84-92; mean - 87) and GBMs (25-208; 
mean = 101). s VEGFR- 1, however, was found to correlate with 
VEGFR- 1 ip = 0.0289), VEGFR-2 ip = 0.0029), and NRP1 ip - 
0.0027) but not with VEGF ip = 0.053). 

Expression of MMPs and TJMP-2 

MT1-MMP, MMP-2 and MMP-9 were expressed in both nor- 
mal brains and GBMs but at much higher levels in the latter (Fig. 
2a). MT1-MMP mRNA levels were "constantly higher in GBMs 
(34-202; mean = 106) than in normal controls (3^29; mean = 
15). MMP-2 and MMP-9 mRNA levels were higher than controls 
in 18/20 and 14/20 cases respectively (Fig. 2b) and correlated with 
each other {p = 0.0187). MTI -MMP.mRNA levels correlated with 
MMP-2 ip = 0.0008) and MMP-9 ip = 0.005). TIMP-2 had a 
non-discriminative distribution in relation io the controls. TJMP-2 
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used as a positive control (+) *mjK&£^? * S ¥^> Werc 
norma! brain (N) and GBMs. ? f>n)tem » detec '«* in 



MTI-MMP ■ MM?-2 



VEGF 

VEGFR-] 

VEGFR-2 

NRPj 

sVEGFR-1 



0.0250 
0.0073 
<0.0001 
0.0053 
0X3)3 



Gfclaiin Jjrmograpby 



0.0245 

0.0710 
0.0168 
0.1334 
0.0469 



0.0053 
<0.000] 
0.0004 
0.2457 
0.0194 



Nl 

m. 

GB1 

GB2 

GB3 

GB4 

GB5 

GB7 

GB8 

GB9 

GB12 

GB13 

GBM 

GBI5 

GBJ6 

GBI7 

GB18 

GB20 

GB21 

GB22 



— — _. . Western bk* 

.^oMM^ MMP-2 MTJ-MMP 



71MP-2 

0.0094 
0.4542 
0.0J53 
0_5480 
0.2563 



0.41 

0.28 
9.00 
4.28 
1.67 
1.40 
0.47 
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5.75 
5:02 
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8.00 
5.55 
6.61 
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J6.52 
20.66 . 
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0.83 
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2.86 
4.33 
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5.11 
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and '. t^.^'"" '."V-MWpmn an* acn mud MMP-2 
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" S-'o'S^! 1 ^^'fcrived activated MMP-2 levels 
<p - 0 0226) but not with activated MMP-9 levels Z -n£n 

script inS:" 8 "^ fW a "^"y ^ - 

Corretatim Wren VEGF network and MMPs 

.heK!rr noreac,ii ; i,y was sbown in «»* >»™<» and e»dc- 



DISCUSSION 

GBMs are highly maligna, lumoore ; fc nosjs ^ 

V^GI^-'%EG C F 0 - kaSC11,ar prolifera,ioD and express WgMevelsof 
Dromoiin. " 3 ?" 0 " e " )i,0Een ' M «"<»o.heli a l cells thereby 

VhGJ- also strmolaies mmour cell invasioD, migraiion a^l survival 

soteequent amplication of cell prolifera.jon and protection 

melanoma and breast carcinoma cells. MTl-MA^P upreeulates 
VEGF e,press,on whereas T1MP-2 reduces i-JSS 
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Figure 5 ~ Imreunohisiocbcniis- 
try (scale bar = 50 um> (o^) VEGF 
positive tumour and endothelial cells 
(plain anows) show granular red 
staining of the cytoplasm. Negative 
cells (empty arrow) are seen in their 
close vicinity, (c) MTI-MMP posi- 
tive cells show a strong brown cyto- 
plasmic staining: They include tu- 
mours cells (plain arrow) and * 
endothelial cells (empty arrow), (d) 
MTI-MMP positive rumour cells 
(plain arrow) are mixed with nega- 
tive cells (empty arrow) (ej) Double 
staining with VEGF (red) and MTI- 
MMP I (brown). Double positive tu- 
mour cells (plain arrows) contrast 
with negative or single weakly pos- 
itive cell (empty arrow). 



pericellular proteolysis mediated by -MTI-MMP in GBMs could 
also induce an autocrine loop resulting in enhanced VEGF expres- 
sion. In rum, VEGF could act as a paracrine factor on endothelial 
cells to stimulate angiogenesis. or possibly as an autocrine factor 
promoting glioblastoma cells survival-migration and invasion as 
demonstrated recently in the various rumour cell culture models. 

We compared the expression of VEGF and its receptors with 
MTI-MMP; MMP-2 and MMP-9 in 20 GBMS and 5 norma] brains. 
The expression of these MMPs was markedly increased in most 
GBMs with excellent correlation between mRNA and protein levels. 
MTI-MMP expression has been shown previously to correlate with 
glioma aggressiveness and its transfecrion in different tumour cell 
lines triggers an angiogenic phenotype and promotes rumour 
growih. 2 ^ 8 -^- 36 -^ A majority D f GBMs ( 17/20) also expressed high 
levels of VEGF, as previously reported, with a strong correlation 
between VEGF and MTJ-MMP gene expression levels. Double in> 
munostainiDg studies showed co-expression of VEGF and MTI- 
MMP by the same rumour cells. These data suggest that the transcrip- 




tional control of VEGF by MTI-MMP could be operative doi only hi 
vitro but also in vivo in human GBMs. 

MTI-MMP could also promote the growth of GBMs by its 
ability to activate MMP-2 in the presence of Jow concentration of 
T1MP-2. J0 Pro-MMP-2 activation occurs after the formation of a 
ternary complex that contains pro- MMP-2 linked to cell surface 
MTI-MMP via a TJMP-2 bridge. In accordance with this hypoth- 
esis, we found that MMP-2 activation occurred in 8/ J 8 of our 
GEMs 20 - 4 ' among which 7/7 tested for MTI-MMP showed high 
contents of this protease. 

Activated MMP-9 was arso found in 7/18 of our GBMs. This is 
an interesting finding as active MMP-9 is able to mobilize VEGF 
from its ECM reservoir. 39 Therefore, MMPs could promote 
VEGF-mediaied angiogenesis in GBMs by both transcriptional 
(MTI-MMP) and post translationa] (MMP-9) mechanisms. 

VEGF binding to VEGFR-2 triggers the proliferation and mi- 
gration of endothelial cells whereas its binding to VEGFR-) has 
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mKNA levels were correlated with VEGFR-2 but nol VEHFR I 

dnrin? wS? r ° mPleX J Panem ° f VEGF «=«P'o«, tans. 
, 2 VE GFs,gnaJmg toward cell proliferation and migration 

J^Tl^L 00 ' ^ addS to "* ™<knce for an interplay 
between, roetaDoprolemases and VEGF in human GBMs as „c2 

to acDvale MMP-2 and to up-regulate VEGF, MTI-MMP mi«ht be of 
sent an interesting target fox anti-cancer treatments. 
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angtogenesis. Cell 1993;72:835-46. 



mi- 

.- 




INTERNATIONAL 



rial 



HEALTH SCIENCES LIBRAR> 
UNIVERSITY CF WISCONSIN 

SEP 1 0 2003 

1305 Linden Drive 
Madison, VW 53705 




International Union Against Cancer 



J. Sefhun, President (USA) 

i_J. Denis* Treasurer (Belgium) 

L Batty, Chair, Finance Committee (USA) 

G. Brien (Australia) 
R.C. Burton (Australia) 
M. Daube (Australia) 
KJV. DiwsHAW (India) 
L. ElovaJNIO (Finland) 
M.K- GosPOaMtowicc (Canada) 
R.T. Hudson (Ireland) 
T. Kjtacawa (Japan) 
A. Pumdauk Kuwcure (India) 
RJE. Lenhard (USA) 
• A. Uombart- Bosch (Italy) 



COUNCIL 



MXuwJA (Indonesia) 
C Mallinson (UK) 
IJL Marchesi (Brazil) 
. H.F. Mickelsow (USA) 
K. Nilsson (Sweden) 
S. Omar (Egypt) 
T. Phiut (France) 
E. Robinson, Past President (Israel) 
Y. Saloojee (South Africa) 
H. Sancho-Garhier (France) 



RJ. SorvoTZER (USA) 
O. Sckeide (Norway) 
K. Tajjma (Japan) 
W. Weber (Switzerland) 

' S. Wilkinson (UK) 
D. Zaocs (USA) 
D. Zaridze (Russia) 
Y. Hut Zhang (China) 
_M. Zrv (Israel) 

. M 2UR Hausen (Germany) 



U1CC, URL; http^/www.uicc.org 
U ICC, E-maih mfo@uicc.org 



Particular emphasis is pbeed oo professional and pub)»c ethiotfjon. ... M „, . w,w,„™ 

FW^inl933. .be UKTC is a noo-govemintMal. indepeodeM association of 0,01c than 290 member organizations i» over 80 countnes. Members « vohmury 
eancej »ea S oes and societies, cancer research ami/or ueairoart ceirta^ and in some coimtnw fn»r..slnes of health. 

The UJCC is S ovcn>cd by iu member, which meet in General Assembly ever, 4 years. I«'ere«ed Council and E*ec»me Con*nmec « response for Prolan. 



SUBSCRIPTION INFORMATION 



o 2003 WBej-JUss, Inc., a Wiley Company AJ1 rights reserved. No part of this publication may 
be reproduced in any form or by anj means, except as permitted under section 107 or 108 ol 
the 1976 United Stales Copyright' Act, without either the prior written permission of the 
publisher or authorization through the Copyright Clearance Center, 222 Bosewood Drive, 
Danvers,MA 01923. telephone: (978) 750-8400, fax: (978)750-4470. Bequests tothe pnbhsher 
for permission should be addressed to the Permissions Department., do, John Wey & bons 
IncTni Hire* SU. Hoboken; NJ 07030. Faa: (201) 748-6008; Tel.: (201) 748-6011; http:// 

WT^.wiley.conygo/permissions, 

International Journal of Cancer (Print ISSN 0020-7136; Online ISSN 1097-0215) is 
published 30 times a year. seroi-rooDthly with extra issues in January, March, May, JuJy, 
September, and November, by Wiley-Liss. Inc., through Wiley Subscription Services, Inc a 
Wiley Company Send subscription inquiries in care of John Wiley & Sons, Inc.,, Attn. 
Journals Admin Dept VK> 111 River St, Hoboken, NJ 07030. (201) 748-6645 
Advertising inquiries should be addressed to Advertising Department, do John Wiley & 
Sons. Inc., HI River St., Hoboken, N J 07030. Telephone: (201) 748-6921. . 
Offprint sales and inquiries should be directed to the Customer Service Department m 
cart of John Wiley & Sons. Ill River St.. Hoboken, NJ 07030. Telephone: (201) '48-8776. 
Subscription price: Volumes 103-)07 T 2003 (30 issues). Print only: $2,495 worldwide. 
Electronic only: $2,495 Worldwide. A combination price of $2,620 worldwide includes the 
subscription in both electronic and print formats. All subscriptions containing a print ^ ele- 
ment, shipped outside the H.S., v»ll be sent by air. Payment must be made in U.S. dollars 
drawn on VS. bank. Periodicals postage paid at Hoboken, NJ and at additional mailing 
offices. Postmaster send address changes to INTERNATIONAL JOURNAL OF CANCER, 
Subscription Distribution, do John Wiley & Sons. Inc.. Ill River St., Hoboken, NJ 07030. 
Change of address: Please forward to the subscriptions address listed above 6 weeks pnor 
to move; enclose present mailing label with change of address. Claims for missing issues? 
Claims for undelivered copies will be accepted only after the following issue has been 
received. Please enclose a mailing label or cite your subscriber reference number. Missmg 
copies will be supplied when losses have been sustained in transit and where reserve stock 
permits. Send claims in care of John Wiley & Sons. Inc., Attn: Journals Admin Dept UK, 111 
River St. Hoboken, NJ 07030. . 
■ Indexed by: EMBASEflEacerpta Medic a * Current Contents/Life Sciences • Science Citation 
Index * SdseaTch * BIOSIS Data Base • lndes Medicus • Cambridge Scientific Abstracts • 
Chemical Abstracts * Reference Update • Smoking and Health Database. 

This journal is printed on acid-free paper. 



INTERNATIONAL JOURNAL 
OF CANCER 
2003 Wiley-Liss, Inc. 

The International Journal of Cancer IS 
published for the International Union 
Against Cancer by Wiley-Liss, Inc., a di- 
vision of John Wiley & Sons. Inc. Five 
volumes are issued annually, each consist- 
ing of six numbers. 

Abstracting and olher journals may re- 
print the summaries of articles without re- 
questing authorization. Authors alone are 
responsible for views expressed in signed 
articles. The mention of specific compa- 
nies or of certain manufacturers* products 
does not imply that they are endorsed or 
recommended by (he International Union 
Againsl Cancer. 



References; When quoting from the 
International Journal of Cancer, 
please use the official abbreviation: 
Int. J. Cancer 



DNA hypermethylatioD is a mechanism for loss of expression of the HLA 
class I genes m human esophageal squamous cell carcinomas. 

NjeY, YangG, SjmgY, ZhaoX ? SoC, U M J,W M£ LD,Y 2£s£ S. 

Laboratory for Cancer Research, College of Pharmacy, Rutgers-The State University of 
New Jersey, } 64 Frelinghuysen Road, Piscataway, NJ 08854-8020, USA CTSlty ° f 

TTie three human leukocyte antigen (HLA) class I antigens) HLA-A, HLA-B and HLA-C 
Play ,mportantroIes :fc .the elimination of t-n^edddlsly^^T^b F ^±S 
^ss of express,on of these antigens at the cell surface has been observed in many E 
™' r° US rf 31,18,1,5 f ° r ^-^ripfonal regu.ation have been proposed^ 
bvlm T 1 7 l r l * m f ianisms for transcriptional regu.ation are not cLr We slw 
by immunohurfochermstry that the HLA class I antigens are absent in 26 of 29 (89%) 
samp es of human esophageal squamous cell carcinomas (ESCC). Eleven of the 26 ESCC 
samples lost MA expression for* least one of the HLA. genes', as showf by RT-PCR 

r!£S P T f f . ESCG " nd nei S hbo ™S normal epithelium were examined for 

CpG .sland hypermethylaaon, homozygous deletion, microsatellite instability (MSR and 

SS, J t ^ S ¥i L ° U) - ° NA fr ° m epithdial tissues had ™ ^^table 

JSSSSL ^ ? land r 0{my of t,lese gcne loci ° f 29 ESCC 

i k12 ^, methxlatl ° n ° f ° ne or more of the *«* "LA loci and six samples 21%) 
exlub,ted methy ation of all three loci. The HLA-B gene locus was m ost fr^Wy } 
methylated 38%). HLA-B mRNA expression in an ESCC cel. line, where HLA B was 
hypermethylated and d,d not express mRNA, was activated after treatment with 5 Z™?- 
deoxveyhdme. Homozygous deletion of these three gene loci was not observed 

D6S30 V 6 nrT,S^oS H W6re ° taervBd f0r die microsatellite markers 

D6S306 D6S258, D6S273 and D6SI666, close to the HLA-A, -B and -C loci although a 

* 3 nCarby ' 0CUS (repreSented ^ the -£s D6S1 of, 
andD6S1560 where the tumor suppressor gene P 2I(Wafl) resides. A strong correlation 
between genejc alterations and mRNA inactivation was observed in the ESCC sample? 

EScS h St ^ - C,3SS 1 ^ CXpreSS10n WaS ^™«y ^wn-re^l aS b' 
ret^o^ leVdS that '^^ethylation of the promoter 

SivaUon 3 8eneS 15 3 maj ° r mechanisin of transcriptional 
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Comment in: 

* H um Pathol. 200.1 Ji.l-ijQ ^j^g, 
| __ h< i '-^ h ^ [i ^ aiho! J gy _ j 

a n ti-HER2/„eu polydon ^STsTx T?? T lmmUnohist ^en,i Stly using an 
establish basal £J, 0 t£S£ "mRNA HErS S P ecimcns ««* to 
nonnal tissues and SSs Fonrnf n transcnpts were detected in all 

leve,s above ^ 

level (P = 0 01) Incrls^ HfS ox^ ^ ^ Pr ° tem ex P res *°n and mRNA 
PMID: 12874758 [PubMed - indexed for MEDLINE). 
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Molecular and Immunohistochemica! 
Sarcoma 0 ' HER2/neu Oncogene in Synovial 



Amplification ibd/or ovexfmesdoD of HER2/ n „i k 

^1 pc ^ Dal ""^^ su »—» ™L 

of im™JL ng „ CT ^"->»-l*-*P-™ 

.5 ooserved, ud a 5Bong correfeuoD was 

^though traditionally considered o be fhi^^ 
^fl»cnce prognosis, including morpho- 
«he plo.dy s,atus, a,,d 0,e apoptouc mdcx ' * * 

such Pm K Cn ' ° f """P^fc advances 

prL^i^ hll PeClfiC J?^** of »«Jefuhr aherations" 
present ,„ human malignances. haj b 



Pini, Milan |J, y . , nr , .„, a ™ OK, CT. Orthopedic l mu , ult Gaelano 

x kt, !S5 a„ d ;^— ■ 

D, Kmhni 8. 20)4?. M,bno ] la Jy ' fV>, °' Vw A " 

, //03/3-j07-O00?i30.00/0 
do ':J0- 10] 0/S00-i<VS 177(03^0238-7 • 



found benvcen p^tcfa, «pr«sio D and mRNA l«J /*> - 0 „.\ 
viU, mRNA levels. O^^Zm^fl I""*™ .'««»-« 

tumor features Oia, are prediclive ofresponse to therapy 
One or the most exclusively slud ied molecular 

HER?/ " HER - 2 /" eu P'o.oo„co 6 e„e. 

J he HER-2/„eu oncogene (also kr, OW n as c-crbB-2) 

? Chlomosome »q2L » » membe, of the 
ryrosme Jonase receptor family and encodes for a 185- 
kilodalton proton tha, shows 50% homology with the 

phficd.and/or over expressed in"20% ,o 30% of breast 
carc3„ om as^ „ d m various ^ ^ «^ 

IS 0 '"^ ,U ?.° r a ^ e «»"^» poor prog 7 

h,I f Pr ! djC ' U,C re5 P° nsc lo chemotherapy in 

bteas. cancer, and the use of recombinant hum anized 
anuboCcs to HER-2/„e u pro.ein (Tras^ul t)Tn Ure 

The role of HER-2/neu activa.ion in sol. tissue . 

Z ZLTr POOl)y Und " St °^ »<« ^arce molec- 
ular data bacXmg .mmunohis.oche.nkal studies have 
been d . HER . 2/neu n « >,ave 

; »..noh»,ocbemKaHy Mudiec. i„ 204 s^comas. includ- 

',»IL, , OV "™P>^>°» «s absent in all ,hese 
mabgnani mesench>Tnal neopl.vms 10 

RecenUy, HER-2/„e» aj.e.abons have be,-,, de- 
scribed u, osieosarcoma, «th a high incidence of pro- 
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TABLE 1. Clinicopathologic Features of 13 Patients With SS 



No. 


Age(y)/C 


Site 


Subtype 


Surgery 


Chemo/Radio 


Rec 


Mets 


HER2 


FU(roo) 


1 


68/F 


Kl 


MF 


RR 


No 


Yes 


Yes 


L 


108 


2 


7l/F 


KE 


. BF 


WA 


No 


No 


No 


L 


12 


3 


27/M 


KJ 


. BF 


ME 


Yes 


Yes 


Yes 


L 


24 




39/M 


T 


MF 


WE. 


Y« 


No 


No 


H 


48 


5 


15/M 


N 


MF^D 


1VE ■ 


Yes 


No 


No • 


H 


48 


6 


41/F 


F 


MF,PD 


WA 


Yes 


No 


• No 


H 


48. 


. 7 


57/F 


Kl 


MF 


v/a > 


No 


N/A 


N/A , 


L 


N/A 


8 


48/M . 


F 


BF,PD 


ME 


Yes 


Yes 


No 


L 


36 


9 


29/F 


T 


PD 


WE 


Yes 


Yes 


Yes 


L 


36 


id 


27/M 


KE 


BF 


WE 


No 


No 


No ' 


H 


24 


H 


62/F 


A . 


MF 


WE 


No 


Yes 


No. 


L 


24 


12 


62/F 


F 


MF 


WA 


No 


No 


No 


L 


2 


13 


34/F 


L 


MF 


RR 


No 


No . 


No 


L' 


2 



Abbreviations: Age, age at diagnosis; G,. gender; M, male: F. female; Site, anatomic location; Kl, knee, intraarticular; KE;*knee, 
extra-articular; T, thigh; N. neck; F. foot; A, arm; U leg; Subtype, histological subtype; MF, roonophasic fibrous; BF, triphasic; MF-PB, 
roonophasic fibrous with, poorly differentiated areas; BF-PD, biphastc with pooih; differentiated areas; PD, poorly differentiated;' Surgery* 
primary surgical therapy; RR, radical en bloc resection; WA. wide through-bone amputation; ME, roarginal en bloc excision; WF^wide en bloc 
exrision; Chemo/Radio, adjuvant postoperative chemotherapy and/or radiotherapy; Rec, recurrence; Mctt, presence of metastasis; HER2, 
HER2 mRNA expression; L, low expression; H, high expression; FU, follow-up status; N/A, not avaible. 



tein expression, ranging from 42% to 61 %} JI5 Indeed, 
despite limited information on Her2/neu in ibis type 
of malignancy, based mostly on immunohistochemical 
findings, 2 clinical trials of Trastuzumab have been 
initialed for recurrent and metastatic osteosarcoma pa- 
tients (http://wiviv.cancer.gov/cKmcaJ_trials: MSKCC- 
99097/NC1-T98-O083 and COG-AOST0121). 

Therefore, we evaluated the mRNA expression and 
the gene product of HER-2/neu in 13 SS patients using 
real-time reverse transcription-polymerase chain reac- 
tion (RT-PCR) and immunohistochemistry. To the best 
of our knowledge, this is the first report documenting 
increased levels of HER-2/neu mRNA and protein 
in SS. 



MATERIALS AND METHODS 
Potient Populotion 

Thirteen patients with primary SSs, obtained from the 
files of ihc Depart mem of Pathology, Caetano Pini Orthope- 
dic Ins rj tut e, were included in this study. Cases were chosen 
based on the availability of frozen primary rumor. Patient age 
ranged from 15 to 71 years (mean, 44.6 years). Anatomic sites 
included the knee (5 patients, 3 intra-anicular). thigh (2 
patients), foot (3 patients), arm (1 patient), neck (1 patient), 
aod loner- )cg (1 patient). The histologics) subtj-pes »»ere 3 
biphasic (BF), 6 monophasie fibrous (MF), 2 monophasie 
fibrous with poorly differentia ted areas (MF-PD), 1 biphasic 
with poo try differentiated areas (BF-PD), and 1 poorly differ- 
entiated (PD). Clinical staging was 1)B for all ihc patients. 
Local surgical excision was performed in 9 patients, amputa- 
tion, in 'I patients. Recurrence was observed in 5 cases with 
subsequent amputation (3 patients) and loot excision (2 
paticnw). Metastasis occurred to lung (2 patients) and ingui- 
nal lymph nodes (1 patients). Five patients icccived adjuvant 
postopeiDtive chemotherapy, and 1 patient (case 4) was 
nested with chemotherapy associated with radiotherapy Fol 
)ow-up ranged from 2 to 108 months (mean, 34.3 months). 
Clinicopathologk data are summarized in Table 1. 



Pathologic Studies and Frozen Tissue 
Selection 

In all cases, the primary tumor was available' for study. 
Surgically resected" tumor tissues were par try snap-frozen in 
liquid nitrogen and stored ai -80°C foT RNA extraction, and . 
partly fixed in buffered formalin and embedded in paraffin 
blocks. Hematoxylin and eosin-stained sections were reeval- 
uated and graded according to FNCLCC gradmg system. 16 
Frozen tissue blocks were handled as follows: 4-jnhick frozen 
sections were cut and stained with hematoxylin and eosin to 
determine the percentage of tumor cells present in the spec- 
imen. We used tissue blocks with tumor cells comprising 
more than 80% of the specimen. About 30 20-fi-ibick sections 
were collected into Eppendorf tubes. Another 4-/i-thick fro- 
zen section was cut after the serial sections and examined by 
light microscopy to guarantee the percentage of. tumor cells 
collected. Trizol (life Technologies; Gibco BRL, Gaithers- 
burg, MD) was used for RNA extraction, according to the 
manufacturer's protocol. RNA was quantified spec trophoto- 
merrically. , 



cDNA Synthesis 

Total RNA (200 ng) was reverse-transcribed in a total 
volume of 50 /iL containing 1 X TaqMan buffer, 5.5 mmol 
MgO?, 1 inmol deoxynucleo tides, 2.5 umol random hexaw- 
ers,20 U RNase inhibitor, and 62.5 U MuLv reverse transcrip- 
tase. The samples were incubated at 25°C for TO minutes, 
48^ for 30 minutes and 95°C for 5 minutes. 



PCR Amplification 

Amplification reactions were performed \snih the Univer- 
sal TaqMan 2 X PCR master mix in a volume of 25 /jL con- 
taining 300 nmol of each primer, 100 nmol of probe, and 5 
uL of cDNA: Both 0-actin and flER2/neu amplification were 
done in duplicate for esch sample. 

The iheimal cycling conditions included 2 minutes at 
SOX and 30 minutes ai 95 °C, followed by '10 cycles of 95 = C for 
15 seconds and 60°C for 1 minute. AJ1 reagents used for 
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RT-PCR wrrc purchased from Applied Biosystems (Fosier 
Gry, CA). 

Prime's ond Probes 

Prime™ and probes for>actin and HER2/neu mRNA 
were chosen using die computer program Primer Express 
^osystems). Sequences of the forward primer for 
HER2/DCU mRNA (CenBank accession number X03363) 
were 5'-TCC TGT GTG CAC CTG GAT CAC3' and the 
reverse pnmer 5'-CCA AAG ACC ACC CCC AAG A-3'- the 
?rTT£>°^ P Iobc 5 *(EAM)-AGC AGA ATG 

CCA ACC ACC CCACA-(TAMRA)-3\ Sequences of the for- . 
ward pnra C r for £acrin mRNA {CenBank accession number. 
X00351) were 5TCC TTC CTG CCC ATC GAG-3* and the 
reverse primer 5-ACG ACC ACC AAT CAT. CTT GAT CTT- 

5£ VrT^ f * C Z a( ^ Man pTobc ^(FAM)-CCT GTC 
GCA TCC ACG AAA CTA CCT TG(TAMRA)3\ Probes were 
purchased from Applied Biosystems. 

Reol-Time RT-PCR 

To measure HER2/neu expression in these tumors we 
used a readme quantitative RT-PCR based on TaqMan meth- 
od o^y as previously described," with minor modifications 
Briefly, this technique allows, by means of fluorescence emis- 
sion, to find the cycling point when PCR product is detectable 
(U value or threshold cycle). As previously reported, the Ct 
value correlates to the starting quantity of the target mRNA. 13 
To normalize the amount of total RNA present in each reac- 
tion, we amplified the housekeeping gene ^actin, which is 
assumed to be constant in both norma) samples and rumor 
tissues. 

r « uI « 3 ^ expressed as relative levels of HER2/neu 
mRNA, referred to a sample, called a "calibrator/ chosen to 
represent IX expression of this gene. The calibrator was a 
breast cancer cellular line <MCF-7) ,! > that was analyzed on 
every assay pbte with the unknown samples. All of the ana- 
lysed tumors expressed n-fold HER2/neu mRNA relative to 
the calibrator. 

The amount of target, normalized to an endogenous 
reference (p-aci»o) and relative to .he calibrator, was defined- 
by the AACt method as described by Livak K (Sequence 
Detector User Bulletin 2; Applied Biosystrms). Specifically 
the lormula is applied as follows: 

target amount = 2 ^°' 

where AA Ct = ||Ct (HER2/neu sample) - Ct (/^ain 
sample)) - fC. <HER2/neu calibrator) - Ct (/^ctin calibra- 
tor))). 



Immunohistochemistry 



Forxnahn-fixed. paraffWmbedded tissue sections were 
dep a raffini 2 ed,iehydrnted t and exposed to the primary anti- 
body usmg the E»VisVon-f sys.em (Dako. Carpinteria, CA) 
Primary anu-HER-2/neu antibody (rabbit polyclonal ant^ 

f° i^nnn ° 8 AW85; D ^ o) ™ a PP Iied ™ 3 <>*>»™ 

of 1:2000 for 60 mmmes a, ,oom temperamre. Before expo- 
sure loihe pnmnry antibody, sections were rrnoowave-p, e - 
ueated m £DlA. pH S.O. to retrieve antigenicity, and incu- 
bated with endogenous peroxidase-blocking solnrion (or 10 
mmmes a» room .cmpeiamre. Positive conuol. constituted bv 

for HER2/„eu, as well ncgabve control. ,„ which thf 
primary annbody m om.ned, were stained in pamliel. 




nofmallfesue n=6 syrxrv^ stomas synovW siw«?ma» 
(k»i expression) (h«h «tf»ession> . 
n=9. - - „i4 

FIGURE 1. Distribution of HER2/neu mRNA levels" in normol 
^ Zl**^ ^ toW - md h{ 9^xpression sarcomas. 
™ °?»f 6XpfGSSed 05 rneon °^ slondord error erf the meon 

PCX ©OCR Qf OUp. 



All cases were examined for both cytoplasmic and mem- 
brane immunoreacrivity. Cytoplasmic staining was evaluated 
on a semiquantitative scale, according to Kirpatric* etal with 
minor modifications,^ and repbned as 0 (no staining or 
staging in <J0% of cells). 1+ (weak staining in >10% of 
cells), 2+ (moderate staining -in >j0% of celb), or 3+ 
(strong staining in >W% of cells). The presence of a mem- 
branous pattern of staining was recorded separately and 
scored as absent..(no staining or weak staining in <10% of 
cells) or present (complete and/or incomplete staining in 
>10% of cells). Tumors with a cytoplasmic score of 3+ were 
considered to have high HER-2/neu protein expression. 

Stotislicol Analysis 

Statistical differences were calculated by Fisher s exact 
test. The i-test method was used devaluate the differences 
between groups. Differences were considered statistically sig- 
nificant when Pwas <0.05. 



RESULTS 

HER2/neu mRNA Evoluolion 

All of the tissues analyzed contained delectable 
)eve)s of HER2~ncv mRNA. Sbt norma) tissue ssmptes 
(skeletal muscle) were used to establish basal level of 
HER2/neu mRNA. All the normal samples expressed 
very low levels of HER2/neu mRNA, ranging from 0.9 
to 1.9 n (mean, 1.4 n). Among the 13 tumor samples, 
HER2/neu levels varied greatly, ranging (torn 2.1 to 
2<1 n. Setting a cutoff level at 7.9 n (a value that repre- 
sents the mean value of expression distribution of the 
SSs), 9 cases (69%) ,had low HER2/neu expiession and 
1 cases (31%) had high HER2/neu expression (Fig 1; 
Table 1). The difference between the 2 groups (low 
and high HER2/neu tumors) was statistically sitmincant 
(P= 0.00O1). h 
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mSiKnL j^ n ^ ii,oc hemicol locoltzotion ol HER2/neu in SS. (A) Positive control (breast coneer) showing typical strona 

^^^SdS^^^ eptheho component or the some cose. <C> Cose la the eptthellol component ol this biphosic 
c^op^^poSi^v ^ ^ °' 5, ° tf " n9 (D) C05e A ° n e P i!he6oW <* eo » o monochoslc fibrous SS. showing Strong 



HER2/neu Protein Expression 

Staining with UER2/neu antibody revealed a vari- 
able cytoplasmic and membrane staining pattern. 
Fbree rumors (23%) showed strong staining involving 
both the cell membrane and the cytoplasm (cases 4, 5, 
and 10); weak to mode/aie, exclusively cytoplasmic 
staining was observed in 7 cases (cases J, 3, 6, 7, 8, 9, 
and J 1 ). No staining was detected in 2 tumors (cases' 12 
and 13). In ) case (case 2), a cluster of glandular 
structures representing <5% of the tumor showed 
weak cytoplasmic and very focal membrane staining, 
the latter limited to a ..ingle gland. The epithelial/' 
epithelioid component* exhibited stronger cvtoplasmic 
stainmg compared with the spindle<e|f component of 
the tumors. Membrane staining was predominantly in- 
complete and limited to the epitbelial/epitbelioid ar- 
eas. All 3 cases wiih high immunohistochemical expies- 
sron of HERV/ncu (cases <J, b, and 10) ivcre grade 1JJ 
sarcomas, including J .MP, ] BF, nnd 1 MF-PD SSs. 



ExampI es of HER2/n eu , cy Vbplasmic arid m embran e 
staining are depicted in Figure 2. 

Correlation of Molecular ond 
Immunohislocherntco! Results 

A strong, statistically significant association was 
present between protein expression, for both mem- 
brane and cytoplasmic staining, and HER2/neu mRNA 
levels (P = 0.03), although ] case (case 6) displayed 
discordant results. Interestingly, this neoplasm showed 
high HER2/neu mRNA levels, whereas only weak stain- 
ing, limited to the cytoplasm of a minority of tumor 
cells, was detected by immunohistocbenm al analysis. 

HER2/heu Expression ond Clinicopothofogic 
Porometets 

Both HER2/neu protein expression and mKNA 
levels were evaluated to establish the relationships to 
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TABIE Z Correlation Between Cfinfcopathotogfc 
Features and HER2/neu Expression as Delected by 
1HC and RT-PCR 



HER2/ n 



IHC 



PGR 



Variable 


L 


H 




L 


H 


lvalue 


Age (years) 














<40 


3 


3 




3 


3 




>40 
Sex 


7 


0 


. NS 


6 


1 


NS 


Female " 


8 


0 




7 


1 

3 




Male 


2 


3 


0.03 


2 


NS 


Tumor size (cm) 












<5 


3 


2 




3 


2 




>5 


7 


1 


NS 


6 


2 


NS" 


Histological grade 












n 

Histological type 


3 
7 


0 

3 


NS 


3 
6 


0 
4 


NS . 


MF 
BF 


5 
2 


1 
J 




5 
2 


I 
) 

2 




PD 

Cbem o/Radjotherapy* 


3 


1 


NS 


2 


NS 


Yes 


4 


2 




3 


3 




No - 

Recurrence^ 


5. 


J 


NS 


5 


] 


NS 


Ye* 


5 


0 




5 


0 . 




No 

Metastasis^ 


2 


3 


NS 


1 


4 


0.02 


Yes 

No 


3 
4 


0 

3 


• NS 


3 
3 


0 
4 


NS 



«| mi fi^7S? 0ni: U l r^ rcs5i ° n - H - hi * h lesion; NS. not 
«En,ficant; MF, monophas.c fibrou*; BF, bJphasic; PD, poorly differ- 
entiated (mcfudin ? MF and BF *ith pocrfy difTerenuatcTarL). 

Information not available for case 7. 
1 Cases 7, 12, and 13 were cxclwlcd from ibe analysis. 

ciinicopathologic features, including local recurrence 
and metastatic disease. Two cases (cases ] 2 and j 3) with 
follow-up jess than 12 months and 1 case (case 7) for 
which clinical information was not available were ex- 
cluded from the analysis of recurrences and metastatic 
behavior. 

•No correlation was observed between HER2/neu 
mRNA expression and age, sex, tumor size, tumor 
grade histotype, and metastasis. A correlation between 
sex of the patients and HER2/neu protein expression 

^ mi ^ f n ° ne oF lhc fcmaIe P aljcnts shov ^ d 
nigh HER2/nen protein expression \P = 0 03) Pa- 
tients *rilh high Her2/neu mRNA levels had a lower 
ris* of recurrence- than those with fow Her2/neu 
rnRNA levels [P = 0.02). None of the cases with high 
HER2/ncu mRNA levels developed metastatic foci tl- 
though the small number of observations precluded 
reaching statistical significance [P = 0.}). Results are 
detailed in Table Z 

DISCUSSION 

The present work provides the first combined mo- 
lecular by real-ume RT-PCR and irrmumohistochemical 
ev.dence that HER2/neu over expression occurs in SSs 



Our results indicate that this parameter may provide 
prognostic information and suggest that a specific ther- 
apy with humanized monoclonal antibodies against 
HER2/neu may be considered in a significant number 
of SSs. 

The HER2/neu oncogene has been extensively 
investigated as.a prognostic factor arid more recently as 
a predictor of response I o therapy. It has been demon- 
strated m breast cancer, where HER2/neu overexpres- . 
sion is usually associated with gene amplification 2 ^and 
in other epithelial tumors, mcluding ovarian, gastric 
lung, and urinary bladder car cinomas. 

HER-2/ neu amplification/ overexpression appears ■ 
to be an early event in oncogenic transformation by 
interacting with other members of the HER famuy 3 In 
breast cancer, it is involved in cell cycle and apoptoric 
pathways through the antiapoptotic effects mediated by 
p53 and p21 deregulation.* 2 '* 5 . 

Whether HER2/neu overexpression plays an im- 
portant role in mesenchymal neoplasms remains con- 
troversial. An immunohistochemical study of sarcomas 
using a monoclonal anubody, reported no evidence of 
rmmunoreactivity for HER-2/ neu in 6 SSs as well as in 
other 197 mesenchymal tumors, with cytoplasmic reac- 
trvny observed only in J case of peripheral neuroepi- 
thelioma. A recent investigation reported gene ex- 
pression profiles of 41 soft tissue tumors with cJDNA 
microarray analysis. Among these sarcomas, 6 
monophasic SSs were characterized by a unique expres- 
sion pattern of a cluster of 104 genes, including the 
epidermal growth factor receptor, which shows 50% 
homology with the HER2/ncu gene. 24 These data also 
suggest that the erb-B receptor family plays a significant 
role in SS. It has been demonstrated that a variable 
number of osteosarcomas overexpress HER2/neu 11 15 
However, more recent studies 20 '^ 5 ' 26 were unable to 



- ' -~ "tJi. UJI.IUJC tu 

detect any HER2/ neu gene amplification and/or over- 
expression using fluorescence in situ hybridization 
(FISH), RT-PCR, and immun ©histochemistry. 

Differences in the techniques used may play an 
important role and explain (at least in part) , these 
discrepancies. HER2/ neu alterations can be evaluated 
using different techniques irrcluding immunohbto- 
chemistry, FISH, Southern hybridization, Northern 
blot, and competitive, differential, or real-time PCR. 27 
Immunohistochemistry is the most common method 
for detection of HER2/neu overexpression, but it is 
srgmficandy affected by the sensitivity and specificity of 
the antibodies used, the type, of tissue (frozen versus 
formalin-fixed), and the various interpretative criteria 
and scoring systems used to evaluate cases. Indeed, 
most studies of MER2/neu expression in osteosarcoma 
used immunohistochemical techniques, with different 
monoclonal oi polyclonal antibodies. The discrepancy 
in results may stem from the use of different antibodies, 
as well as a lack of standardized evaluation. 

For these reasons, to evaluate HER2/neu irnrnu- 
noreactiviry in our study, we used a polyclonal antibody 
(Dako, Carpinteria, CA), arguably the most diffuse and 
thoroughly tested antibody for HER2/neu assessment. 
Furthermore, we investigated >lER2/neu mRNA ex- 
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pression with real-time RT : PCR, because it has been 
demonstrated that mRNA levels correlates tightly with 
. protein expression. 28 At present, real-time RT-PCR 
- probably represents the most powerful tool for quanti- 
tative analysis, because it allows better internal control 
and reduction of sample contamination, and provides 
more objective results. 18 

We analyzed HER2/neu gene expression at the 
mRNA and protein level in 13 cases of SS. HER2/neu 
expression was found in all of the cases investigated, 
and mRNA content in the tumors varied from 2.1 to 
24 n. 

The variability of mRNA levels in SSs is reflected 
on heterogeneity of protein expression pattern as de- 
tected by immunohistochemistry. We found that 
HER2/neu immunoreactivity correlates strongly with 
mRNA levels. A convincing cytoplasmic immunoreac- 
tivity was documented with the polyclonal antibody in 
10 of 13 sarcomas. Distinct membranous staining was 
observed in 3. cases, although.it was never comparable 
to the positive breast control. Jt was predominantly 
incomplete and identified in the epithelial/epithelioid . 
component of SS. Interestingly, all of the cases with 
strong cytoplasmic staining also exhibited a membrane- 
staining pattern." 

In breast cancers, a membranous pattern, of stain- 
ing is thought to be specific for HER2/neu protein 
expression and correlated with gene amplification, 
wbereascrtopiasmic staining is usually considered non- 
specific. However, cytoplasmic positivity for HER2/ 
neu has been reported to be prognostic ally significant 
m other tumor types; including bladder, colon, pan- 
creas, thyroid, and nasopharyngeal carcinomas, and 
even in breast cancer. 3 * 3 * 

Patients with high HER2/neu mRNA expression 
had a significantly lower risk of recurrence. Similarly 
all of the cases wiih high HER2/neu expression did not 
metastasize, although this correlation did not reach 
statistical significance due to the small number of cases 
studied. These data suggest that HER2/neu plays a roJe 
in the biology of SS and that HER2/neu over ex pression 
may be linked to a less-aggressive clinical behavior. 
Indeed, unlike many cancers where HER2/neu overex- 
pression has been shown to correlate with poor prog- 
nosis, HER2/neu levels are linked to a more favorable 
clinical course in other malignant rumors, such as thy- 
roid carcinoma and osteosarcoma. 15 34 

The molecular mechanisms responsible for the 
action of HER-2/neu in SSs are unknown. A possjble 
interaction between HEJR-2/neu and the other mem- 
bers of HER family could be important in tumorigen- 
esis. Derangements of other oncogenes, tumor suppres- 
sors, and apoptosis regubtors have been described in 
SSs. For instance, many SSs have been shown to be 
diffusely positive for bd-2 family proteins (bcl-2, bax, 
bcl-x, and bak). These members of the bcl-2 family are 
involved in the regulation of apoptosis in SS. 36 This 
jaises the hypothesis that complex alterations in apop- 
losts-conndlling mechanisms ore present in these neo- 
plasms, iviih HER-2/neu interacting with Bcl-2 family 



members. Further studies, are needed to clarify the 
mechanisms of apoptosis in SS. 

Dependingon the size and location, the therapy of 
choice for SS is radical local excision or amputation.. 
Whenever radical surgery cannot be performed, radio- 
therapy in concert with local excision is suggested in an 
attempt to avoid the need for amputation. Only re- 
cently has a study evaluated thepossible roJe of chemo- 
therapy in the treatment of SS. 

The role of molecular markers in predicting treat* 
ment responsiveness is currently the focus of extensive 
investigation. Breast cancer patients with high HER2/ 
neu expression appeared to benefit from high-dose 
CAF (cyclophophamide, adriamycin, and 5-fluoroura- 
cil) therapy. In our study, 3 of 4 patients with high 
HER2/neu expression received adjuvant chemother- 
apy with ifosfamide; these patients had a favorable clin- 
ical outcome. These data raise the possibility that 
HER2/neu may have value in predicting which patients 
are likely to respond to specific adjuvant chemotherapy 
regimens. Whether the favorable significance of 
HER2/neu expression depends on predicting clinical 
recurrence, response to chemotherapy, or both re- 
mains to be fully elucidated in SS patients. • 

To the best or our knowledge, this is the first 
report that shows expression of HER2/neu in primary 
SS by real-time RT-PCR. Elevated levels of HER2/neu 
mRNA and protein are found in a significant group of 
SS patients, and these levels appear to correlate with 
features of good prognosis. Furthermore, our results 
suggest that this mechanism of disease in SS may be the 
target of specific inhibitory therapies based on human- 
ized monoclonal antibodies. Considering the small 
number of patients examined, further investigation is 
needed to confirm these preliminary findings. 
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Gain and loss of chromosomal material is characteristic of bladder cancer as well as 
malignant transformation in general. The consequences of these changes at both the 
ranscnption and translation levels is at present unknown partly because of technical 
• {mutations. Here we have attempted to address this question in pairs of non-invasive and 
invasive human bladder tumors using a combination of technology that included 
comparative genomic hybridization, high density oligonucleotide array-based monitoring 
of transcnpt levels (5600 genes), and high resolution two-dimensional gel m ° mt ° nne 
electrophoresis. The results showed that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This effect depended (p < 0 0 1 5) on the 
magnitude of the comparative genomic hybridization change. In general (18 of 23 case<rt 
chromosomal areas with more than 2-fold gain of DNA showed a corresponding increase 
m mRNA transcnpts. Areas with loss of DNA, on the other hand, showed eithef rZcrt 
or unaltered transcnpt levels. Because most proteins resolved by two-dimensional gels 
are unknown .t was only possible to compare mRNA and protein alterations in relatively 
tew cases of well focused abundant proteins. With few exceptions we found a good 
correlauon (p < 0.005) between transcript alterations and protein levels. The implications ■ 
as well as limitations, of the approach are discussed. impncauons, 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoft4:§, Thomas ThykjaerU, Frederic M; Wa!dman||, Hans Wolf**, 
and Julio E. Celisti 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
. partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there is a gene dosage effect that in some cases 
superimposes ori other regulatory mechanisms. This ef-: 
feet depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In' general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyc//n d1, 
ems 1, and N-myc (3^5). However, a high cyclin Dt protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein" 
ovefexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-, 11p-, 1q+, 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with, 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non- invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 
Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and alter removal of tissue lor routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary); 

1 The abbreviations used are; CGH. comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA-FABP, psoriasis-associated fatty acid-binding protein; 2D." 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr\ CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. 0, expression ol mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length ol the chromosome 
(teft)_ The bold curve in the ratio profile represents a mean of lour chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line (bro/cen) indicates a ratio value of 1 (no change), and the vertical lines next to it (doited) indicate a ratio of 
0.5 (/eft) and 2.0 {fight). In chromosomes where the non-invasive tumor 335 used lor comparison showed alterations in DNA content, .the ratio 
prpfile of that chromosome is shown to the right. ol the invasive tumor profile. The colored bars represents one gene each^ identified by the 
running numbers above the oars (the name of the gene can be seen at www.MDL.DK/s data. html). The bars indicate the purported location of 
the gene, and the colors indicate the expression level ol the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase (black), >2-fold decrease (b/ue). no significant change (oranoe). The bar to the far right, entitled Expression shows the resulting change . 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (Wade), at least hall of the genes 
down-regulated (b/ue). or more than half of the genes are unchanged (orange). II a gene was absent in one ol the samples and present in 
another, it was regarded as more than a 2-foJd change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination ol -1600 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pTT 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade HI. 

mRNA Preparation- Tissue biopsies, obtained fresh from surgery,, 
were embedded immediately in a sodium- guanidinium thiocyanate 
solution and stored at -80"C. Total RNA was isolated using the 
.RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A)^ RNA was isolated by an oligo(dT) selection step (Oligotex 
. mRNA kit; Oiagen). 

cRNA Preparation- 1 M g ol mRNA was used as starting material. 
The .first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according lo the manufac- 
turer's instructions but using an ofigo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription kil {Ambion). Biot in- labeled CTP and 



UTP (Bnzo) was used, together with unlabeled IMTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Oiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified Irom a previous method (13). 10 ^g of cRNA was ■ 
fragmented at 94 *C lor 35 min in butler containing 40 mM Tris 
acetate, pH 8.1, lOOmM KOAc. 30 mM MgOAc. Prior to hybridization, 
ihe fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCl, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 "C lor 5 min, 
subsequently cooled to 40 and loaded onto the Asymetrix 1 probe 
array cartridge. The probe array was then incubated tor 16 h at 40 e C 
at constant rotation (SO rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 *C followed by A washes in 0.5 x SSPE-T 
at 50 'C. The biotinylated cRNA was stained wilh a streptavidin- 
phycoerythrin conjugate, 10 ix§fm\ (Molecular Probes) in 6x SSPE-T 
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Fkj. 1 — continued 



lor 30 mm at 25 °C followed by 1 0 washes in 6.x SSPE-T at 25 °C. The 
probe arrays were scanr>ed at 560 nm using a confocal laser scanning 
microscope (made lor Asymetrix by Hewlett-Packard). The readings 
Irom the quantitative scanning were analyzed by Asymetrix gene 
expression analysis software. 

Microsatellite Analysis— Micros a telltte Analysis was performed as 
described* previously (14). Microsatetliles were selected by use ol 
www.ncbi.nlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gcib.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume ol 20 for 35 
cycles. The ampficons were denatured and eleclrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss ol heterozygosity was defined as less than 33% 
of one allele delected in tumor amplicons compared with blood. 

Proteomic Analysis— ICCs were minced into small pieces and 
homogenized in a small glass bomogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure lor 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nilrale and/or Coomassie Brilliant Blue. Pro- 
teins were identilied by a combination of procedures that included 
microsequencing, mass spectrometry, two- dimensional gel Western 
immonoblotling, and comparison with the master two-dimensional gel 
image ol human keralinocyte proteins; see biobase.dk/cgi-bin/ce5s. 

CGH— Hybridization ol deferentially labeled tumor and normal DNA 
. to normal metaphase chromosomes was performed as described 
previously (10). Fluorescein- labeled tumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 jig) were 
denatured al 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was al 37 "C for 2 days. Afler washing, 
the slides were count erst ained with 0.15 jig/ml 4,6-diamidino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
greenrred fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green:red Huorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms ol acrocentric chromosomes, and heterochro- 
rnatic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization —The CGH analysis 
identified a number ol chromosomal gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 

Top, CGH used as independent variable (il CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable frf expression alteration — what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulation 

0 Down- regulation 

3 No change 

1 Up-regulation 

5 Down- regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



80% 



17% 



Expression change clusters 


Tumor 733 vs. 335 


Concordance 


Expression change clusters 


Tumor 827 vs. 532 


Concordance 


CGH alterations 


CGH alterations 


• - 16 Up-regulation 


11 Gain 


69% 


17 Up-regulation 


10 Gain 


59% 




2 Loss 






5 Loss . 






3 No change 






2 No change 




21 Down- regulation 


1 Gain 


38% 


9 Down-regulation 


0 Gain 


33% 




8 Loss 






3 Loss 






12 No change 






6 No change 




15 No change 


3 Gain 


60% 


21 No change 


1 Gain 


81% 




3 Loss 






3 Loss 






9 No change 






17 No change 





two invasive tumors (stage pTl, TCCs 733 and 827), whereas 
. the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+ ; 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24 + , 2q14.1-qter-, 3q12-q13.3-, 6q12-q22-. 
9q34+, 11q12-q13+, 17 + , and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. 1 A) and 
20q12inTCC827(Fig. 18). 

mRNA Expression in Relation to DNA Copy Number—The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared wilh the two non-invasive counterparts 
fTCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes lhat yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non- invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fbld were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
. chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more " 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, fop). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

. In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom): 
Furthermore, as a control we looked at areas with no alter- 
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5 



Expression changes 
detected 



Expression changes 
not detected 



Expression changes 
detected 



Expression changes 
. not detected 



Tumor 827 versus 532 Tumor 733 versus 335 

FK3..2. Correlation between maximum CGH aberration and Ibe ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦ ) and their non-invasive 
counterparts 532 and 335. The expression change was taken from the Expression tine to the right in Fig, 1, which depicts the resulting 
expression change lor a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down- regulated 
to be scored as an expression change. All chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom)^ Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2). For both tumors TCC 733 (p.< 0.015).and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2). Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite- based Detection of Minor Areas of Loss : 
es— In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21*and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1 A). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression o! mRNA observed in TCC 733 at 
chromosomes 3q24. I1p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11 pi 5.5, 12p11,.15q11.2, 
and I8ql2 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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FrG. 3- Microsatellite analysis of loss of heterozygosity- Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (6) by D1S2735 close to cathepsin E (gene 
number 41 "m Fig. 1), and (c) at chromosome 2p23 by D2S2251 close, 
to general 6-speclrin (gene number 1 1 on fig. 1) and of [d) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1118 
close to mitochondrial 3-oxoacyt-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the tower curves show the 
electropherogram Irom tumor DNA (7). In all cases one allele is 
partially lost in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down- regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels — 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. . Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated horizontal 
axis). The mRNA ratio. as determined by oligonucleotide arrays was 
plotted for each gene [vertical axis). A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference (fop of figure), two different 
scalings were used to exclude scaling as a confounder. TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (•O) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
' between the groups. Proteins shown were as follows: Group A [Irom 
left), phosphoglucomulase 1 . glutathione transferase class ^ number 
4, fatty acid-binding protein homologue,. cylokeratin 15, and cylo- 
keratin 13; B [irom left), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cylokeratin 13, and calcyclin; C (from left), a-eno- 
lase, hnRNP B1 r 28-kDa heat shock protein. 14-3-3-*, . and 
pre-mRNA splicing factor; O, mesolhelial keratin K7 (type II); E (from 
fop), glutathione S-transf erase- n and mesothelial keratin K7.(type II); 
F(from top arid left), adenyfyl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphogfycerate mutase, annexin IV, cy- 
loskeletal 7-aclih, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain- type clathrin light chain- a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationalry controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase j3-1 subunit; G, (from top and leit), TCP20, calgizzarin, 70- 
kDa heal shock protein, calnexin, hnRNP H, cylokeratin 15, ATP 
synthase, keratin 19, iriosephosphate isomerase, hnRNP F, liver gryc- 
eraldehyde-3-phosphalase dehydrogenase, glutathione S-transfer- 
ase-u, and keratin 8; H (from left), plasma gelsolin, autoantrgen cal- 
reticulin. tbioredoxin, and NAD+ ^dependent 1 5 hyoVoxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxytase p- subunit, cylo- 
keratin 20. cylokeratin. 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin H; K. annexin IV; L (from fop and left), 90-kDa heat 
shock protein, prolyl 4 hydroxylase 0-subunit, o-enolase. GRP 78, 
cyclophilin, and coftlin. 

gradient, and having a known, chromosomal location, were 
selected for analysis in theTCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except (or a group ol cylo- 
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keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0-005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733;. see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present * 
study provides some evidence as to the effect of these gains 
and losses on gene, expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Table » 

Proteins whose expression level correlates with both mRNA and gene dose changes 



Protein Chromosomal location Tumor TCC . CGH alteration Transcript alteration* Protein alteration 



Annexin tl 


1q21 


733 


Gain 


Abs to Pres* 


, Increase 


Annexin IV 


2p13 


733 


Gain 


3-9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3. 8- Fold up 


Increase 


Cytokeratin 20 


I7q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10- Fold down ' 


Decrease 


FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma getsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibilin 


17q2t 


827/733 


Gain 


3.7-/2.5-Fold up b 


Increase 


Prolyl- 4 -hydroxy! 


I7q25 


827/733 


Gain 


5.7-/1.6 : Fold up 


Increase 


hnRNPBI . 


7pt5 


827 


Loss 


2.5- Fold down 


Decrease 



° Abs, absent: Pres, present. 

* In cases where the corresponding alteralions were lound in both TCCs 827 and 733 these are shown as 827/733. 




Fig. 5. Comparison ol protein and transcript levels in invasive 
and non-invasive TCCs. The upper part ol the figure shows a 2D gel 
(feft) and the oligonucleotide array fight) ol TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below.' 
Clearly, cytokeratins 13 and 15 are strongly down- regulated in TCC 
827. (ret/ annotation). The tile on the array containing probes lor 
cytokeratin 15 is enlarged betotv the array {red arrow) from TCC 532 
and is compared wilh TCC 827. The upper row ol squares in each tile 
corresponds to peiiect match probes; the lower row corresponds to 
mismatch probes containing a mutation {used lor correction for un- 
specific binding). Absence of signal is depicted as- black, and. the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC. 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence ol signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom ol the figure (/eft) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles lor the PA-FABP transcript. A medium transcript level was de- 
tec led "m the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF, isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these. transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors [e.g. in TCC 733 transcript from cellular ligand of 
annexin n gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylation in tumor cells (17-19); Thus, it may be possible 
. that in chromosomes with increased DNA copy numbers two 
' or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has. documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 
. Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p- ; 9q-, 1q+", Y- 
(2, 6), and in pT1 tumors, 2q-,11p-, 11q-, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q~ and Y-, respectively. Likewise, the two minimal invasive 
pTl tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q1 3+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 mega bases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous micros at ellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases, fl seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether ■ 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal cells and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used tor the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases • 
we found discrepancies that may be attributed to translational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertrahslated mRNA pools, which are associated with few 
translationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells as deter- 
mined by arrays and 2D- PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 

Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of. chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2- fold 
alterations in transcript levels, a feature that may facilitate. the 
analysis of the effect of loss ol chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution GGH based on microarrays with many thousand radiation 
hybrid- mapped genes wiil increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived, from genes with 
known locations, and 2D gel analysis. to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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Expressiou of bcr-abl mRNA in individual chronic myelogenous leukaemia 
cells as determined by in situ amplification. 

Pachmanu IC , ZhaoS, SchcnkT, Kantariian H . El-Naggar MC Siciliano MX Guo 
JQ. Arhnghnus RB Audreeff M . 

The University of Texas M.D. Anderson Cancer Center, Department of Molecular 
Haematologyand Therapy, 1515 Holcombe Boulevard, Houston, TX 77030, USA. 

We present the results of a novel method developed for evaluation of in situ 
amplification, a molecular genetic method at the cellular level. Reverse transcription 
polymerase chain reaction (RT-PCR) was used to study bcr-abl transcript levels in 
individual cells from patients with chronic myelogenous leukaemia (CML). After . 
hybridizing a fluorochrome-labelled probe to the cell-bound RT-PCR product, bcr-abl 
raRNA-positive cells were determined using image analysis, A dilution series'of bcr-abl- 
positive BV173 into normal cells showed a good correlation between expected and actual 
values, hi 25 CML samples, the percentage of in situ PCR-positive cells showed an 
excellent correlation with cytogenetic results (r = 0.94, P < 0:000 1), interphase 
fluorescence in situ hybridization (FISH) (r = 0.95, P = 0.00 1 ) and hypermetaphase FISH 
(r -.0.81, P < 0.001). The fluorescence intensity was higher in residual CML cells after 
interferon (f FN) treatment than in newly diagnosed patients (P = 0.004), and was highest . 
in late-stage CML resistant to IFN tlierapy and lowest in CML blast crisis (P = 0.001). 
Mean fluorescence values correlated with bcr-abl protein levels; as determined by 
Western blot analysis (r - 0.62). Laser scanning cytometry allowing automated. analysis 
of large numbers of cells confirmed the results. Thus, fluorescence in situ PCR provides a 
novel and quantitative approach for monitoring tumour load and bcr-abl transcript levels 
. in CML. . 
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Cell localization and regulation of expression of cytochrome P450 1A1 and 

Ivbr Z£ t T g " " mdUCti ° n W " h 3 - m ^y«choIanthrene using mRNA 
hybridization and immunohistochemistry. 6 

INSERM Unite 139, Hopital Henri Mondor, Creteil, France. 

In order tp characterize the response of various pulmonary cell types to polycyclic 
aromatic hydrocarbons the expression of cytochrome M50 (Cy7) 1A1 and^ i noRNA 
m the lung of rats, w.th or without induction by S-methylchoLJene (3M C fw^ 
analyzed by m s,tu hybndization using appropriate 35S-labeIed riboprobes TrT 
express.on o the corresponding proteins was investigated inuBunoWstocSmSlly 
Following induction with3MC, the kinetics of mRNA expression differed SLbly 
between Clara cells and type II pneumoCyte* and venous endothelial cells. hSi 
mRNA expression was detected as early as 1 h after induction, peaked between Xd 4 h 
and was completely undetectable at 14 h. In contrast, venous^dothelial^aud ^e if 
pneumocytes exhibited permanent mRNA expression of CYP 1 Al in ZfcZXiZ 
rau. These kinetic results explain the striking absence of correlation betwelrSA and 

Z ? ^ C,ara CdlS 24 h after e «^f the induction p^ocol^ 
ftese^llsexh*^ agood 

correlation was observed for mRNA and protein expression of CYP 2B1 wiuSlar 

cells. This study clearly distinguished the regulation of CYP I Al expression in the rat 
hmg s from hat described in the liver. The differences observed ^ 
*pes whatever the post-transcriptional mechanisms involved, emphasize thatSs 
must be^performed at the cellular level in order to understand he specific refuse to 
xenob.ot.es, not only of this organ as a whole but also of its variouLatoS^res. 
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Correlative tmmunouistocheiuical and reverse transcriptase polymerase 
chain reaction analysis of somatostatin receptor type 2 in neuroendocrine 
tumors of the lung. 

Papotri ML CroceS, Macri L , Funaro A , Pccchioni C Scliindler ML Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Italy. 

Somatostatin receptors type 2 (sst2) have been frequently detected in neuroendocrine 
tumors and bind somatostatin analogues, such as octreotide, with high affinity. Receptor 
autoradiography, specific mRNA detection and, more recently, antisst2 polyclonal 
antibodies are currently employed to reveal sst2. The aim of the present study was to 
investigate by three different techniques the presence of sst2 in a series of 26 
neuroendocrine tumors of the lung in which fresh frozen tissue and paraffin sections were 
available, ft was possible, therefore, to compare, in individual cases, RNA analysis 
studied by reverse transcriptase polymerase chain reaction (RT-PCR), in situ 
hybridization (ISH), and immunohistochemistry. A series of 20 nonneuroendocrine lung 
carcinoma samples served as controls. RT-PCR was positive for sst2 in 22 of 26 samples, 
including 15 of 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 small-cell 
carcinomas. The sst2 mRNA signal obtained by RT-PCR was strong in the majority 
(87%) of typical carcinoids and of variable intensity in atypical carcinoids and small-cell 
carcinomas. A weakly positive signal was observed in 5 of 20 control samples. In 
immunohistochemistry, two different antibodies (ariti-sst2) were employed, including a 
monoclonal antibody, generated in die Department of Pathology, University of Turin. In 
the majority of samples a good correlation between sst2 mRNA (as detected by RT-PCR) 
and sst2 protein expression (as detected by immunohistochemistry) was observed. 
However, one atypical carcinoid and one small-cell carcinoma had focal immunostaining 
but no RT-PCR signal. ISH performed in selected samples paralleled the results obtained 
with the other techniques. A low sst2 expression was associated with high grade 
neuroendocrine tumors and with aggressive behavior. It is concluded that 1) 
neuroendocrine tumors of the lung express sst2, and there is a correlation between the 
mRNA amount and the degree of differentiation; 2) immunohistochemistry and ISH are 
reliable tools to demonstrate sst2 in these tumors; and 3) sst2 identification in tissue 
sections may provide information on the diagnostic or therapeutic usefulness of 
somatostatin analogues in individual patients with neuroendocrine tumors. 
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Correlative Immunohistocheiiucal and Reverse 
Transcriptase Polymerase Chain Reaction Analysis of 
Somatostatin Receptor Type 2 in Neuroendocrine 
Tumors of the Lung 

" Aola Funaro, PbX>:, Carta Pecchioni, Marcus Schindler, M.V. y and 
Gianni Bussolati, M.D., RR.c.Patb. 



' Somatostatin receptors type 2 (sst2) have been frequently de- 
tected in neuroendocrine tumors and bind somatostatin ana- 
logues, such as octreotide, with high affinity. Receptor autora- 

' diography. specific mRNA detection and. more rccendy, ami- 
sst2 polyclonal antibodies arc currently, employed to reveal 

•* sst2rThe aim of the present study was to investigate by three 

' different techniques the presence of ssl2 in a series of 26 neu- 
ro endocrine rumors of the Jung in which fresh frozen tissue and 

' paraffiri sections were available. It was possible, therefore, to 
compare, in individual cases. RNA analysis studied by reverse 
transcriptase polymerase chain reaction (RT-PCR), in situ hy- 
bridization (ISH), and imrounorustocheJTUstry. A scries of 20 
Donneurbendocrine lung carcinoma samples served as controls. 

: RT-PCR was positive for sst2 in 22 of 26 samples, including 15 
of. 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 
small- cell carcinomas. Tbe sst2.mRNA signal obtained by RT- 
PCR was strong in the majority (87%) of typical carcinoids and 
of variable intensity in atypical carcinoids and small-cell car- 
cinomas. A weaiJy positive signal was observed in 5 of 20 
control samples. In immunohistochemistry, two different anti- 
bodies (anti-5st2) were employed, including a monoclonal an- 
tibody, generated in the Department of Pathology, University 
of Turin. Id the majority of samples a good con elation between 
sst2 mRNA (as detected by RT-PCK) and sst2 protein ezpres- ' 
sion (as detected by immunobjsiochemisoy) was observed. 
However, one atypical carcinoid and one small-cell carcinoma 
had focal bumunostaining but no RT-PCR signal. )SH pei- 
formed in selected samples paralleled the results obtained with 

the other techniques. A low sst2 expression was associated with 
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high grade neuroendocrine tumors and with aggressive bchav- 
iOT- IMs concluded that 1) neurc^ndocrine rumors of the lung 
express sst2„ and there is a correlation between the mRNA 
amount and the degree of differentiation; 2) irnmunohistochem- 
istry and JSH are reliable tools to demonstrate sst2 in these 
tumors; arid 3) ssi2 identification in tissue sections may provide 
information on the diagnostic or therapeutic usefulness of so- 
matostatin analogues in individual patients wiih neuroendo- 
crine rumors. 

Key Words; Neuroendocrine— Lung— Tumors— Somato- 
statin receptors — lmmunohistc^hemistryv— Small cell carer- 
noma — Reverse transcriptase polymerase chain reaction. 

Diagn Mpl Pathol 9(1): 47-57, 7000. 



The somatostatin receptor family (sst) includes at least 
five isoforms that have been recently identified and char- 
acterized (18,32,41). The ssts are widely distributed in 
normal human tissues and in human tumors. Sst type 2 js 
more commonly detected in neuroendocrine rumors 
(32,37) and binds the somatostatin analogue octreotide 
with high affinity. 

Sst localization had originally been demonstrated by 
means of binding assays of radiolabeled somatostatin 
analogues (20,25,31). Subsequently, specific sst messen- 
ger RNA (rrtXNA) detection was obtained by means of in 
siru hybridization (ISH) and reverse transcriptase poly- 
merase chain reaction (RT-PCR) (14,32,37). Recently; 
polyclonal antibodies spec in c for different isoforms of 
sst were produced aod used in imniunohisiochemisD7 
(10,12,15,1830,35,36). Given the well-known heteroge- 
neity of neoplastic populations, in situ methods (bnmu- 
nohistochemisny and ISH) allow a more definite map- 
ping of the distribution of the receptor in such tissues. 
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This is potentially useful for piedicting the responsive- 
ness of a given neoplastic cell population to medical 
treatment with somatostatin analogues, which are used in 
the clinical setting for both diagnostic and therapeutic pur- 
poses with special reference to neuroendocrine tumors. 

The spectrum of neuroendocrine rumors of the lung 
includes well-differentiated neoplasms (so-called -typical 
. carcinoids) and poorly differentiated small-cell carcino- 
mas (SCCs). Intermediate forms sharing features of both 
the aforementioned types also belong to this spectrum 
. (so-called atypical carcinoids or well-differentiated neu- 
roendocrine carcinomas). Finally, large-cell neuroendo- 
— erine-carejnoma-ha&-beeD-4dentif)ed and- ineluded-in-tbis- 
tumor group (4,40). The tissue distribution of sst2 an 
neuroendocrine tumors of the lung has not been thor- 
oughly characterized, although individual samples of 
: bronchial carcinoids were found to express sst (30). 
SCCs (but riot non-small-cell types) were also shown to 
be sst2 positive by receptor binding assay (33). More- 
over, sst2 has been detected in in vitro cell cultures of 
human SSC of the lung (39,42). No study on a series of 
neuroendocrine rumors of the lung- including all neuro- 
endocrine lung tumor types has been reported to date. 

The aim of this study was therefore to investigate the 
presence of ssi2 mRNA and protein in a series of 26 
neuroendocrine tumors of the lung, employing different 
technical approaches, such as RT-PCR, JSH, and immu- 
nobistochemistry. To this purpose a monoclonal anti- 
body to sst2 (N-tenrnnal) was generated in the Depart- 
ment of Pathology, University of Turin. The results were 
then compared and related to the tumor grade and to 
other clinicopathologic parameters. 

MATERIALS AND METHODS 

Case Series and UNA Extraction 

Twenty-si* samples of neuroendocrine rumors of the 
lung, in which fresh frozen iissue was available, were 
retrieved from the surgical pathology file of the Univer- 
sity of Turin, Italy. All samples were reviewed applying 
currently accepted criteria of classification (4,40), and 
the neuroendocrine nature was confirmed by positive im- 
munostaining for chromogranin A (CgA) (with or with- 
out antigen retrieval) or synapiophysin, and by positive 
RTPCR for CgA rnJRNA. According to the classi- 
fications described here, these included 15 well- 
differentiated neuroendocrine tumors {typical carci- 
noids), 6 well-differentiated neuroendocrine carcinomas 
(atypical carcinoids), and 5 SCCs. 

A series of 20 non- small- cell Jung carcinomas (10 
squamous, 9 adenocarcinomas, arid 1 large- cell anaplas- 
tic) lacking neuroendocrine dijfer entiation, as demonstrated 
by negative inrmiunohistodienustry and RT-PCR for CgA 
(I), served as n conuoJ group. Clinicopathologic data, and 
follow-up infonwuioo wcie obtained for all patients. 



For hybridization analysis, total RNA was extracted, 
using the guanidine thiocyanate-cesiurn chloride method 
(5). The concentration of RNA was estimated by spec- 
trophotometry, and RNA degradation was assessed by 
agarose gel electrophoresis as previously reported (37). 

Reverse Transcriptase Polymerase Chain Reaction 
for sst2 and Chromogranm A 

Total RNA (2 p,g) was first digested, with 10 units of 
RNase-free DNase (Boehringer, Mannheim, Germany) 
in a 10- uX solution containing 20 mmol/L MgCl 2 , to 
. avoid DNA contamination. The solution was kept at 
room temperamre for^O minutesriben heated for 5 min- 
utes at 70 P C to inactivate the DNase molecules; 40 
pmol/L of oligodeoxy thymidine primers (oh*go-dT16) 
were added and the solution was heated again at 70°C for 
10 minutes, then chilled on ice to allow the primer hy- 
bridization, Toe resulting solution was reverse tran- 
scribed using 100 units of reverse transcriptase (Gibco 
BRL t Gaitersburg, MD). Complementary DNA (cDNA) 
was generated in a 50-uX final reaction volume contain- 
ing 50 mmol/L Tris-HCI pH 8.3, 75 mmol/L KO, 3 
mmol/L MgCl 2 > 30 mmol/L dithiotbreitol, 1 ntmol/L de- 
oxynucleotide triphosphates (dNTPs), and 20 units of 
RNasin (Promega, Madison, WI). The solution was 
heated at 37°C for 90 minutes. Finally, the enzymes were 
inactivated by heating to 70°C for 10 minutes. 

The efficiency of the reverse transcription was deter- 
mined by performing a PCR reaction having the ^ 2 - 
nu" croglobulin •'housekeeping gene'* as a target. PCR was 
carried out in a 10-u-L final reaction volume containing 
1 pL of cDNA template, 30 pmol of sense and antisense 
oligonucleotide primers, 67 mmol/L Tris-HCI pH 8.8, 16 
mmol/L (NH 4 )2S0 4 , 0.03 % polysorbate 20, 2 mmol/L 
dNTPs, 1 mmol/L MgCl 2 > and 0.5 units of Taq polymer- 
ase. 02-MiCTOglobulin, sst2, and CgA PCR reactions were 
performed using the same protocol at the following PCR 
conditions: 35 cycles, each cycle consisting of denaturation 
at 94 °C for 2 minutes, annealing at 55 D C for 1 minute for 
P r microglobulin, al 61 °C for sst2, and at 68°C for CgA; 
extension was performed at 72°Cfor 1 minute. The primers 
used for RT-PCR (9,1 1,23,37) are reported in Table J. 

The amplified fragments were run in a 1% agarose gel, 
containing .eihidium bromide. Strict precautions against 
contamination were undertaken (19) and negative con- 
trols (a no- template control and a no^reverse transcriptase 
control and distilled water to replace the RNA) were in- 
cluded. The RNA extracted from an H71 6 neuroendocrine 
colon carcinoma cell line, and from a neuroblast oma (37) 
served as positive controls for CgA and sst2 t respectively. 

Antibodies 

Two different antibodies specific for sst 2 were em- 
ployed: The fiist one was a monoclonal antibody raised 
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Size ol PCB 
product (bp) 



Position 



Study 



^croglobulm sense: 3 > 
li ^rrwcroQtobutin antisense: 5' ATC TTC AAA CCT CCA TGA 1G 3 

3 &2 sW: 5' CAG JCA TGA GCA TCG ACC GA 3' 

4 SSTR2 antisense: 5' GCA AAG ACA GAT GAT GGT GA 3* 
£ CoA sense: 5' GCT CCA AGA CCT CGC TCT CC £ 

% §R aXnse: 5' GACCGA^TCTCGCOT^^ 



120 
284 
5B3 



286-305 
385-408 
402-421 
665-684 
316-335 
B78-897 



Gussow et ah (B) 
Sestintet al. (37) 
Helma/iet a). (11) 



■ • PCBi polymerase chain reaction. 

"in the Department of Pathology (University of Turin) 
^cific for an N-teTiDi p^_se^nce,of ^e_sgt2isbarcd_ 
by both A and B receptor isoforms). The octapeptide 
EPYYDUTS* corresponding to amino acids 35 to 42 of 

„ the human receptor (and differing by one amino acid 

■ironi the mouse sequence), was synthesized, having. a 
added to the N-tcrminaL This sequence was similar 

- 4otbat used by other groups to produce polyclonal anti- 

• bodies (17,18,27). This .sequence was rather short but 
made it possible to avoid extensive homology with sstl. 

• In addition, according to a genbank search using FASTA 

• (28), this protein sequence is unique to human sst2 and 
'has a partial homology only with rat and human nuclear 
' receptor retinoid orphan nuclear- receptor-beta (a protein 
' having nuclear localization): Three Balb/c mice were im- 
munized with the peptide conjugated to keyhole limpets 
fcemccyanin (KLH) (Sigma, Sl Louis, MO) following 

' the standard procedure. After the first mtrasplenic injec- 
tion (100 ftg of protein) at time 0, the mice were inrra- 
periloneally injected six times with the peptide-KLH 
conjugate (J 50 p.g) in the presence of Freund adjuvant 
The reactivity, of the sera from each animal was evalu- 
ated using an enzyrrre-Jinied immunosorbent assay, us- 
ing the peptide coated onto the plastic. H>e hybridomas 
were produced by somatic fusion of immunized splenr> 
cytes with the mouse myeloma cell line Ag8.X 63.653, 
./piloting the si andajd .technique (21). The monoclonal 
antibodies of interest were selected on the basis of tbe 
reactivity with- the target peptide and with appropriate 
tissue sections: Tbe latter included formalin-feed and 
parafTm-erobedded sections of pituitary, gland and pan- 
creatic islets and weie analyzed by means of immuno- 
peroxidase staining. Parallel control experiments were 
also performed by staining serial sections of these tis- 
sues, emitting ihc primary antibody oi with the preim- 
rnune serum oi widi the antibody preads orbed with high 
concentrations (1 rog/mL) of the antigenic peptide. In 
addition, the selected monoclonal antibodies (coded 
10C6 and lOG4),.boib of IgM isorype, were further char- 
acterized by Western blotting. Membranes were prepared 
horn stable unnsfecied Chinese hamster ovary (CHO> 
Kl cells, individually- ex pies sing recombinant human so- 
maiosiatin receptors (sstl to sst5). Western blotting was 
performed as" previously described (36). The monoclonal 



antibody was used as culture supernatant at 1:3 dilution 
9/hnurs at room temperature,, in Tris-buffered saline 
(TBS), supplemented with 6.1% polysorbate 20. Blots 
were washed in TTBS and incubated with peroxidase- 
conjugated goat antimouse IgM, diluted 1:1.500 for 90 
niirjutes at room temperature. TTben, blots were washed in 
TTBS and immunocoroplexes were visualized using 
ECt following manufacturer's instructions (Amersbarn, 
Bucks, UK). * ■ 

A second polyclonal antibody was produced that bad 
been characterized previously (35,36). This antibody 
(coded 3C230) was raised in sheep and was specific for a 
sequence of the C- terminal portion of the sst2A (KSRL- 
J^TTETQRTLLXEDLQ, amino acids 347 to 366). 

Xmm iu) oh i st o ch e mis try 

Sections 4 or 5 p. thick, adjacent to those used for 
conventional histopathologic exarnination and imrirono- 
staining for neuroendocrine markers, were collected onto 
po)y-l>)ysine-coated slides. Tbt proliferative activity of 
the rumors was assessed by means of K567 immunosta- 
nining (clone MDB1, lmmunotech, Marseille, France), 
.diluted 1:10 after microwave -based antigen retrieval in 
citrate buffer). Tbe ascitic fluid of monoclonal antibody 
10G4 was used in this study and was applied to tissue 
sections with prior antigen retrieval (three 3-minuie pas- 
sages in a microwave oven at 800 V/ in citrate buffer pH 
6.0). at the dilution of 1:10,000 or 1:12,000 for 30 min- 
utes at room temperature, tbe antiserum coded K230- 
was applied overnight at a dilution of 1300 wjth no prior 
antigen retrieval- Tbe immune reactions were then re- 
vealed with the immunoperoxidase technique (13) using 
the streplavidin-peroxidase kit and diammobenzidine as 
cbroroogen. A weak nuclear coimterstain or no countei- 
stain was used in p^ja}}c) sections. Control stirrings for 
both antibodies included immunoperoxidase of serial 
sections using preimroune serum oi antibody pre ad- 
sorbed with the antigen or buffer instead of the primary 
antibody. 

In Siru Hybridization 

Selected tumors (12 samples) were also analyzed for" 
sst2 mRNA expression by means" of a oonjadioacuve, 
ryramide deposition- based 3SH technique. Tbe proce- 
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dure of amplification was modified from procedures re- 
ported by Kerstens el al. (16), Speel e l ah (38), arid the 
GenPoirjt (biorinyMynamide) manufacturer (DaJco, Glos- 
trup, Denmark). Briefly, 5-pjn-thick parafliri sections 
were collected onto silane-coated slides- and deparaf- 
linized through xylene and graded alcohols to phosphate 
buffer saline (PBS) ; The slides were then incubated for 5 
minutes in a microwave oven at 800 W in citrate buffer 
pH 6.0. After washing in PfiS, tbey were digested with 
proteinase K (1 pg/mL) for 10 minutes at 23°C Endog- 
enous peroxidase activity was blocked with 3% hydro- 
gen peroxide and endogenous biorih was Mocked using 
"^virJhrtJoc3dng~reagenrfor— 15 jninutes--folle\vefM>y — 



washing in PBS and biotin-blocking reagent for 15 roin- 
utes (3). Sections were then prebybridized for 1 hour at 
room temperature in a mixture composed of 4x SSG, 
50% formamide, Denbardt's 1*, dextrane sulfate 5x > 500 
/xg/mL salmon sperm DNA, and 250 pg/mL tRNA. Hy- 
bridization took place overnight at 42°C in a solution 
containing the specific probe at a concentration of 1 
pmoVmL The probe was a djgoxigeninOabeled 4 8- base 
oligonucleotide (32), complementary to positions 91 io 
139 of the human sst2 gene (41). After hybridization,, 
excess hybridization buffer and coverslips were removed 
. by a rapid wash in 4x SSC followed by stringent washing 
in O.lx SSC for 10 minutes at 42°C. The hybrids were 
revealed by the following incubation steps: peroxidase- 
labeled antidigoxigenin (diluted 1:100 in PBS) for 30 
minutes at room temperature, biotiny]ated tyramide (di- 
luted 1:5 in PBS) for 15 minutes at room temperature, 
and peroxidase-labeled streptavidin for 15 minutes at 
room temperature. Diaminobenzidme was used as chro- 
mogen. Controls for 1SH included staining of serial sec- 
tions with sense probe, an unrelated piobe (EBER-1 of 
the Epstein-Barr virus), and omission of the probe in the 
hybridization mixture, with all other experimental con- 
ditions identical to the procedure described here. 



RESULTS 

.Reverse Transcriptase Polymerase Chain Reaction 

All neuroendocrine tumors, but no noiineuroendocrine 
lung carcinomas, were positive for CgA mRNA (Fig- 1). 
Sst2 mRNA was amplified in 22 of 26 samples of neu- 
roendc>crine tumor. The signals had variable intensities 
(JFng. 2) and were weak in moderately or poorly differ- 
ent ated rumors (mostly in SCCs). No amplification was 
obtained in no- template or no-reverse transcriptase ex- 
periments. Control samples (noiineuroendocrine lung 
carcinomas proven by negative CgA RT-PCR) were 
weakly positive foj sst2 in 5 of 20 samples only. (includ- 
ing 3 adenocarcinomas, 1 squamous,, and the large-cell 
anaplastic carcinoma) (Fig. 3). Tliese. differences were 
statistically significant (P < 0.01) by X 2 test 

Characterization of Monoclonal Antibodies to sst2. 

Several clones were identified having a positive bind- 
ing by enzyme -linked immunosorbent assay and a par- 
allel immun ore activity 1 on formalin-fixed paraffin- 
embedded human endocrine tissues (pituitary ar>d pan- 
creatic islets). In Western blotting experiments, two 
clones (coded 10C6 and 10G4) specifically developed a 
band at approximately 70 kJD. When the antibodies were 
used against CHO-transfecied cells expressing recombi- 
nant somatostatin receptors 1 through 5, a specific band 
corresponding to sst2 (at approximately 70 kD) was re- 
vealed by the monoclonal antibody 10G4. Monoclonal 
antibody 10C6 developed a strong band wiLh sst2 but 
displayed a weaker reactivity also with sstl, 3, and 5, al 
least in the present experimental conditions (Fig. 4 A,B). 
The same antibodies were also tested by means of im- 
rounoperoxidase staining, on formalin- fixed, paraffin- 
embedded samples of norma) human pituitary gland and 
pancreas. Monoclonal antibody }QG4 gave good results 
in immujjohjstochernistry and was used at increasing di- 




583 bp 



J 2 3 4 5 6 7 8 9 10 11 I 32 33 14 35 36 17 38 19 20 23 22 23 24 25 26 C C* M 
FtG. 1. Reverse transcriptase polymerase chain reaction for cbromograniri A (CGA) mRNA 
in 26 samples ol neuroendocrine tumor ot the lung. Numbers in each lane correspond to 
sample numbers in Tabfe 2. CGA mRNA is amplified at 429 bp. G and C stand for positive 
(neuroendocrine colon carcinoma cell line, B716) and negative (distilled water) controls, 
respectively. The !asl column to the right repiesenis the molecular weight marker. All samples 
are positive wilh a variable intensity ot the amplification band. 
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284 bp 
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4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 C C* M 

FIG. 2. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 26 samples of 
"neuroendocrine tumor of the lung. Numbers in each lane correspond to sample numbers in 
Table 2. sst2 mRNA is amplified at 284 bp. C and C* stand for positive, (a neuroblastoma) 
and negative (distilled water) controls, respectively. The last column to the right represents 
. the molecular weight marker. Twenty-two of 26 samples are positive with a variable Intensity 
of the amplification band. 



lutions (up to J:)5 > 000) with specific staining. Using thin 
secpons (approximately 4 pjn), a strong membrane- 
bound and peripheral cytoplasmic imrnunoreactivity was 
found in an adenohypophyseal cell population (corre- 
sponding to growth hormone-secreting cells, as con- 
firmed by double imrnunohistochernica) analyses) and in 
pancreatic islets (Fig. 4 CJD). In the latter, the staining 
was apparently not restricted to a specific hormone- 
producing cell type and had a peripheral cytoplasmic or 
membrane distribution. Exocrine pancreatic cells (both 
acinar and ductal) were only occasionally immuno : 
stained. bnmunobisic>cbeniistry performed on serial con- 
trol sections, either omitting the primary, antibody or us- 
ing the preimrnune serum or antibodies pieabsorbed with 
the synthetic peptide, was negative in both tissues. 
Monoclonal antibody I0C6 had a relatively higher back- 
ground staining at similar dilutions. 

Imjnunobistocbemistry* " 

The antibodies tossi2 (monoclonal antibody J0G4 and 
polyclonal K230) gave slightly different imrnunoieac- 
lions in 25 samples, and staining was not done in j 
sample because of Jack of residua) paraffin blocks. The 
monoclonal antibody 10G4 stained 21 of 25 samples, the 



negative samples being 1 atypical carcinoid and 3 SCCs 
(Fig. 5). The rumors had 5% to 25% of the neoplastic 
cells imrnunoreacDve. The staining was at the periphery 
of tbe cytoplasm, and omitting the counterslain its mera- 
branevbound distribution was better outlined in most 
samples (Fig. 6). One sample of atypical carcmoid {no. 
2]) was locally immunoreactive for sst2„ despite 1 " nega- 
tive RT-PCR findings. Conversely, sample no. 26 was 
immunohistoehemistry negative and RT-PCR positive. 
The antiserum anti-sst2A (code K230) gave positive sig- 
nal in 19 of 25 samples, in 5% to 60% of the neoplastic 
cell population (Fig. 7). The location of the staining was 
at the membrane level associated with a weak cytoplas- 
mic reactivity. The same partem was seen in positive 
controls, e.g., pancreatic islets (Fig. 7, inset). Two 
samples (nos. 19 and 26) were negative in spite of a 
positive RT-PCR signal. Two other tumors (nos. 21 and 
•22), apparently devoid of sst2 mRNA, showed a small 
percentage of immunoreactive cells. Incidentally, one of 
these latter samples (no. 21) was also immunoreactive 
with monoclonal antibody 10G4 (Table 2). 

The five control samples positive 'by RT-PCR were 
also reactive with the antibodies: The ryp e of immuno- 
. cytochemical location of ssi2 receptors was similar to 
that described here, being a peripheral cyioplasmic stain- 




M I 2 3 4 5 6 7 8 9 10 11 12 13 H 15 16 17 18 19 20 C C* M 

FiG. 3. Reverse Iransaiptase polymerase chain reaction for ss\2 rhRNA in 20 control 
samples bl nonneuroendocrine lung carcinoma. Five of 20 samples show a weak band at 
284 bp corresponding to ss!2 rnRNA. Control columns [C and C) are identical to those in 
FiQ. 2. 
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FIG. 4. Western blot analysis of monoclonal antibody 
clones 10G4 and 10C6 against sst2 in Chinese hamster 
ovary cells transfected with recombinant sst 1 through 5 
(numbers of each column "correspond to receptor type). 
Monoclonal antibody 10G4 shows a specific band at ap- 
proximately 70 kD for sst2 only (A) as opposed to mono- 
clonal antibody 10C6, which strongly reacts with sst2 but 
also has some degrees of cross- reactivity with sst 1, 3, 
and 5 (B). The lower figures show control formalin-fixed 
pa raff in- embedded pancreatic islets immunostained with 
monoclonal antibody 10G4 without (C) and with (D) 
preadsorption with the peptide antigen, respectively. The 
majority of endocrine cells show a membrane-bound im- 
munoreactrvjty (C) (immunoperoxidase). Bar; 90 um. 



ing present in 40% to 70% of neoplastic ceUs. A weak 
and focal staining was also observed in five of the re- 
maining RT-PCK-negative samples, when the antibody 
K230 was used (but not when the monoclonal was em- 
ployed). 

Several celJs in periiumora) tissues were occasionally 
stained Ciliated cells of bronchial mucosa had a periph- 
eral staining at the cijia border. Mucous glands were 
jiegative. Kaie chondrocytes had a membrane staining. 
The wall of peril umoraj as well as of occasional distant 
vessels was stained at the endothelium level and in oc- 
casional smooth muscle cells. 

. The reactivity "of both antibodies was abolished in se- 
rial seciions when the reacts were preabsorbed with 
the respective synthetic peptides, but not when an unre- 
lated peptide was used. The perirurooial bronchial mu- 
cosa had a focal staining of ciliated cells with both an- 
tibodies. This reactivity disappeared- when die prcab- 
s or bed ami body was applied. 



In Situ Hybridization 

Eight of 12 samples stained by 1SH were positive for 
sst2 mRNA. The ntRNA was present in a percentage of 
cells (ranging from 10% to 40%) and gave a weak signal 
(Fig. 8), despite the amplification provided by the tyra- 
roide-based procedure. The background level was mini-, 
mal using diluted biotinylaled tyramide. Control sections 
stained with sense probe or an unrelated probe, or omit- 
ting the probe, were consistently negative. 

Clinical Data 



Qinicopathologic data are suinmarized in Table 2. At 
follow-up, the majority of patients with typical carci- 
noids arc free from disease 1 to 1 1 years after surgery. 
Two patients are alive with stable metastatic disease. 
Patients affected by atypical carcinoids had disease pro- 
gression in one third of samples. Finally, patients with SCC 
had f2tal outcomes within 1 year from diagnosis (except the 
receDt sample). Eight patients bad preoperative octreotide 
scintigraphy performed at tbe time of diagnosis. All patients 
had positive octreoscan findings, 'and, in these patients, 
also the tumor was positive by RT-PCR and immu- 




FIG, 5. sample no. 25 (small cell carcinoma). Absence of 
immunoreactivify lor sst2 with the monoclonal 10G4. This 
sample was also negative by reverse transcriptase poly- 
merase chain reaction and in situ hybridization. (Immuno- 
peroxidase in a formalin- fixed paraffin-embedded sample. 
Nuclei slightly counterstained with hernalum.) Bar: 45 pm. 



Dsozn Vol rr.thcl. i'c/ 9. A'o. 7. 7OP0 



SOMATOSTATIN RECEPTOR TYPE 2 JN NEUROENDOCRINE LUNG TUMORS 



53 



nohistocbemistiy or ISH. In addition, three of these pa- 
rents received octreotide therapy administered at the 
time of tumor recurrence or metastatic spread. Stable 
disease is recorded at follow-up more than 5 years after 
diagnosis. 

'Correlations 

Overall, complete overlapping (i.e., RT-PCR, 1SH, 
and immunohistochernistry with two antibodies) be- 
tween sst2 gene and pjotein expression was obtained in 
2*1 of 25 samples (84%) and between RT-PCR results 
ohistochemical findingS-^im..ai-leasLope-xi^- 



the antibodies in 24 of 25 samples (96%). Tbe monoclo- 
nal antibody 10G4 looked highly sensitive, being able to 
stain all but one sample (do. 26) (95%) positive for sst2 
mRNA by RT-PCR. Sst2 expression, at mRNA as well 
as at protein levels* was reduced in high grade rumors, 
with SCCs being weakly positive in ODly two of five 
samples. Decreasing expression of sst2 appears to cor- 




HG* 6. sample no. 16 (typical carcinoid). Immunohisto- 
chemical detection of sst2 by means ol monoclonal anti- 
body 10G4. The neoplastic cells have a peripheral cyto- 
plasmic staining and membrane positivity in some .cells, 
whereas the peribronchial gland adjacent to the tumor is 
uweactrye. (Imrnunoperoxidase in a foimalin- fixed paraf- 
iin-embedded sample. Nuclei slightly counterstained with 
hemalum.) Bar: 45 pm. The membrane-bound distribution 
of the immunostaining is better outlined in a parallel sec- 
tion stained for monoclonal antibody 10G4 omitting* 
nuclear counterstain (inset). 




FIG. 7. Same sample as in Fig. 6. Immunohislocbemical 
detection of sst2 by means of the polyclonal antibody 
K230. The immunostaining is more intense at the cell bor- 
der (arrows), as observed with the monoclonal antibody. 
In the inset.- a pancreatic Islet, used as positive control, 
shws a predominant membraner bound immunostaining of 
many neuroendocrine cells. (Immunoperoxldase in a for- 
malin- fixed paraffin- embedded sample. Nuclei slightly 
counterstained with hemalum.) Bar 45 pm. 

relate with high tumor grade and elevated proliferative 
activity, but not with other parameters such sex, age, or 
tumor size. 



DISCUSSION 

In this study, the presence of sst2 mRNA has been 
demonstrated in a series of resected neuroendocrine ru- 
mors of the lung by.means of RT-PCR and confimied by 
a sensitive nonradioactive tyramide-based ISH procedure 
and by immunohistochernistry with anti-sst2 antibodies. 
Samples of both carcinoid tumors and SCCs were ssi2 
positive, although a reduced, or absent signal was ob- 
served in poorly differentiated (small^cll) carcinomas. 
This is the first study of sst2 expression in a relatively 
larse series of neuroendocrine tumors of the lung. Single 
samples of human carcinoids and SCCs (including cell 
lines of ihe latter) had previously, been analyzed and 
found to express ss(2 (7,15,30,32,33,39,42). Several 
mediods have been used to detect these receptors and 
parti ally overlapping result severe obtained. 

In the present study, The expression of high amounts of 
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TABLE 2. Oinicopathofogic data and somatostatin receptor type 2 (sst2) expression in 26 cases of neuroendocrine 

■ tiing tumors 
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A WD, alive with disease; CgA, chromogranin A; DOD, died of disease; -* F, focal: positive Hi <5% of cette; IHC; Immunohistochemistfy; 
Mab, monoclonal antibody; NECa, Neuroendocrine carcinoma; NED, no evidence of disease; NET, neuroendocrine tumor; NT. not tested; 
RT*PCR. reverse transcriptase polymerase chain reaction; SCC, SmalKcefl lung carcinoma; SYP, synaptophysin; Wp, well differentiated 

" Ki67 IHC: vaJues correspond to percentage of positive nuctef of neoplastic cells. 

t Patients who had preoperative octreoscan performed. 

t Patients who had octreoscan performed and octreotide treatment. 



sst2 mRNA was confirmed in well to moderately differ- 
ent) a ted neuroendocrine tumors, in agreement with the 
results obtained by Reubi et al. (32) by means of radio- 
active 1SH. The presence of sst2 mRNA in SCC had 
never been reported in human specimens, except for two 
samples included in. Reubi et aJ.'s series (32). Although 
the data on cell lines support the observation that'SCCs 
contain sst2 (42), slightly discrepant results were found 
in some of samples described here. Unfortunately, SCCs 
are rarely operated on, and therefore it is difficult to 
collect a large number of surgical specimens. The frve 
samples studied in the current series by means of RT- 
PCR had a Jow amount (two samples) or absent (three 
samples) sst2 mRNA. This could be the result of the 
extensive necrosis commonly present in such rumor 
types. However, because care was taken to freeze frag- 
ments ihat weje.maooscopically devoid of necrotic ar- 
eas, a more likely hypothesis is that sst2 expression is 
reduced in poorly differentiated tumors. Recently, Reis- 
inger et al. (29) showed that the uptalce of somatosiaUn 
analogues in patients with SCC undergoing chemo- 
therapy is 5ignif)cant)y lower, and therapeutic exremal 
faciofiTrriay afjec! the reeeptoi status of individual ru- 
mors. In addition, the uptake of somatostatin analogues 



in metastatic deposits of SCC has been shown to be Jow 
or absent (29,2). The present finctings suggest that the 
-sst2 mRNA content is related to the degree of tumor 
differentiation. These data must be cojrfirrrjed in larger 
series of noBneurbeDdocxine tumors to ascertain whether 
the observed loss or decrease of sst2 expression in neu- 
roendocrine rumors is- a common- event linked to neo- 
plastic dedifferentiauon. In addition, further studies are 
. needed to assess the functionality of such receptors, by 
comparing the profile of sst2 expression in tumor tissues 
with binding assays employing labeled somatostatin and 
with the clinical response to diagnostic and therapeutic 
administrarJon of somatostatin analogues. 

To this purpose, several investigators have demon- 
strated a correlation between clinical imaging or re- 
sponse to somatostatin analogue treatment, and sst2 
mRNA content in single . samples of carcinoid rumors 
(15,22). Northern blotting and 1SH weie the techniques 
used for sst2 mRNA identification. This kind of corre- 
lation is useful for selecting patients for somatostatin 
analogue treatment, although the demonstration of recep 1 . 
tor mRNA in a cell does not imply per se that the recep- ' 
tor is hilly funcuonal. 

The present study relied on a highly sensitive tech- 
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— FIG..B» sample no.- -14- (typical carcinoid!). In-sHtnhytw- 
ization (ISH) for s$t2 mRNA shows a weak cytoplasmatlc 
stainbg (A) in most lumor cells.. An ISH performed with an 
unrelated, probe was negative in a serial section of the 
same tumor (B). This sample was strongly positive by 
reverse transcriptase polymerase chain reaction for sst2 
mRNA and by immunohistochemislry. ( Nonradioactive - 
ISH revealed by peroxidase and diaminobenzidine, as 
substrate- Nuclei counterstained with hemaJum.) Bar: 
75 pm. " . 



* : nigue, RT-PCR, to identify all samples bearing even 
small amounts of sst2 noRNA^ Indeed, in a previous 
study, single samples exhibiting octreotide-binding sites 
- had no demonstrable sst2 mRNA by means of ISH, pos- 
sibly duero the low sensitivity of the JSH procedure* (34). 
The RT-PCR bas shown sst2 mRNA transcripts in the 
majority of samples here studied. Only four samples 
were negative, all belonging lo poorly differentiated high 
grade tumors, which usually follow an aggressive course. 
A decrease of sst2 mRNA expression in association with 
neuroendocrine minor dedifferentiation had also been re- 
ported in neuroblasiomas (37). In tbe above report, as 
well as in the cunenr study, samples having an unfavor- 
able prognosis were found to contain a relatively low . 
amount of sst2 mRNA, as compared with well- 
differentiated rumors. 

In the current sample series, eight samples were in- 
vestigated before $w°ery with radiolabeled octreotide. 
Despite the low figures, all tbe samples positive at the 
diagnostic procedure had a strong RT-PCR signal for 
sst2 mRNA. Three of eight patients were also responsi ve 
io octreotide treatment adrmmstered at the time of re- 
lapse or metastatic spread, More extensive correlative 
chmcopaihoJogjc studies on the ssi status are needed to 
better define the tissue distribution, of somatostatin bind- 
sites and their potential chnica] role in the Deatmenr 
of patients. 

Sst2 evaluation by means of ISH (14,32) or RT-PCR 
<?6,37) is a highly sensitive and reliable procedure. Un- 
fortunately, Oiese itxhmques have limitations because 
^ozen tissue is needed for some of ihem, and radioac live — 
ma,ejjal or COMlv and time-consuming methods are nec- 



essary for others. Irrrmujiobislocbcrnical analysis of sst2 
by means of specific antibodies represents an idea], 
cheap, and rapid alternative, easily applicable to archival 
material. For these reasons, several investigators have 
raised polyclonal antibodies specific for sst (8,10,15,17, 
18). Li the current study, tested tumor fragments adjacent 
to those snap frozen for RT-PCR analysis were tested 
with a polyclonal antibody against a C-terminal portion 
of the sst2A splice variant (35,36). In addition, .a mono- 
clonal antibody was produced in the Department of Pa- 
thology (University of Turin) against an N-terminal se- 
quence of the bnman sst2. This antibody was tbe jfirst 
monoclo naj^e^loped against sst2 and was shown to be 
highly specific for sst2 in Western Wot and jmmunohis- 
tocbemical analysis. Both the monoclonal and tbe poly- 
clonal antibodies specifically reacted with all samples 
also positive by RT-PCR (with minor, discrepancies in 
two samples, likely due to rumor heterogeneity). The 
observed correlation between RT-PCR and immunobis- - 
locbemistry indicates that the latter may be a reliable 
diagnostic tool and may allow unmunob'strcbernical in- 
vestigation for sst2 even in small biopsy samples. This in 
rum may enable a rapid screening of sst^-positivc .tumors 
for medical treatment with somatostatin analogues. 

Having confirmed in a relatively large series that the 
vast majority of. neuroendocrine rumors of tbe lung con- 
tain variable amounts of sst2 mRNA, a final comment is 
deserved for sst2 expression in nonneuroendocrine Jung 
carcinomas. No data bave been reported thus far in tbe 
literature concerning normal human lung, although in the 
present study some bronchial cells of peritumoral paren- 
chyma were positive for sst2 when immunohi si ©chemi- 
cal analysis was performed with either antibody. Tbe 
staining was specific because it was abolished using pre- 
absorbed antibodies. Therefore, it is likely that normal 
human lung tissue contains sst2. This might .be con- 
firmed by alternative techniques (e.g.,' Western blot, RT- 
PCR). However, in situ morphologic procedures, such as 
those employed here, have definite advantages. In fact, 
the lung is rich in vessels, and in several tissues (either iri- 
tumoral or in mflammatory- reactive conditions) the ves- 
sels were recently shown to contain sst (6). 

A low expression of sst2 was found in 25% of lung 
carcinomas of nonneuroendocrine type investigated in 
the present study by means of RT-PCR. Therefore, sst 
type 2, at least, does not appear to be extensively ex- 
pressed in nonneuroendocrine carcinomas of the lung. 
However, because two tumors in the control group (a 
squamous carcinoma and an adenocarcinoma, respec- • 
lively) had positive octreotide scintigraphy, but no sst2 
mRNA, it is plausible that a heterogeneous disnibuiion 
of sst occurs in nomieuroendocrine lung rumors. Other 
receptor types may be expressed in these rumors and may 
be responsible for the positive results in diagnostic test- 
ing. Because sst5. is also known to bind somatostatin 
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analogues, such as octreotide, with high affinity (24), the 
expression of this jeceptoj type will be investigated in 
future studies. q 

AcfaiowJcdgroeBts: Tbt authors arc grateful to Dr. PJL 
Filosso (Turin) for clinical data, to Mrs. Jvj. Ccrrato and JvJiss S. 
SoJero for skilful technica) help, and to Mr. A. Grua for ihe 
photographs. 
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Expression of somatostatin receptor types 1-5 in 81 cases of 
gastrointestinal and pancreatic endocriue tumors. A correlative 
immunohistochemical and reverse-transcriptase polymerase chain 
reaction analysis. 

Papotti M, Bon^ovanniM, Volante M . Allia £ . LandolfiS , Heiboe L , Schindler M 
Cole SL , Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
10126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human tumors by 
means of autoradiography, reverse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (ISH) and immunohistochemistry (IHC). We analyzed the SSTR type 
1-5 expression by means of RT-PCR and/or IHC in a series of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine rumors and related norma! tissues. 
Moreover, we compared the results with clinical, pathological and hormonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies .to SSTR types 2, 3, 5 and ISH for SSTR2 rriRNA. 
The vast majority of tumors expressed SSTR types 1, 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to the good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, thirty-five additional GEP endocrine rumors were studied with 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of sornatostatinornas expressed SSTR5 and 100% gastrinomas and glucagonomas 
expressed SSTR2. Pre-operative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work-up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2, 3 and 5 in surgical and biopsy specimens. 
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P-cadheriaovei expression is an indicator of clinical outcome in invasive 
breast carcinomas and is associated with CDH3 promoter 
hypomcthylation. 

Paredes J , Albergaria A, Olivcii a JT . Jcronimo C . Milanezi F . Sclnnitt PC . 

[institute of Pathology and Molecular Immunology of Porto University (IPATIMUP) 
Braga, Portugal, jparedes@ipatimup.pt ' . 

PURPOSE: P-cadherin overexpression has been reported in breast carcinomas where it 
was assocated with proliferative high-grade histological tumors. This study aimed to 
analyze P-cadherin expression in invasive breast cancer and to correlate it with tumor 
markers, pathologic features, and patient survival. Another purpose was to evaluate the P- 
cadherm promoter methylation pattern as the molecular mechanism underlying this gene 
regulation. EXPERIMENTAL DESIGN: Using a series of invasive breast carcinomas P- 
cadhenn expression was evaluated and correlated with histologic grade, estrogen 
receptor, MIB-1, and P 53 and c-erbB-2 expression. In order to assess whether P-cadherin 
expression was associated with changes in CDH3 promoter methylation, we studied the 
methylation status of a gene S'-flanking region in these same carcinomas. This analysis 
was also done for normal tissue and for a breast cancer cell line treated with a 
demethylating agent. RESULTS: P-cadherin expression showed a strong correlation with 
high histologic grade, increased proliferation, c-erbB-2 and p53 expression, lack of 
estrogen receptor, and poor patient survival. This overexpression can be regulated by 
gene promoter methylation because the 5-Aza-2'-deoxycytidine treatment of MCF-7/AZ 
cells increased P-cadherin mRNA and protein levels. Additionally, we found that 71% of 
P-cadhenn-negative cases showed promoter methylation, whereas 65% of positive ones 
were unmethylated (P = 0.005). The normal P-cadherin-negative breast epithelial cells 
showed consistent CDH3 promoter methylation: CONCLUSIONS: P-cadherin expression 
was strongly associated with tumor aggressiveness, being a good indicator of clinical 
outcome. Moreover, the aberrant expression of P-cadherin in breast cancer might be 
regulated by gene promoter hypomethylation. 
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Ma m ma ry-de rived growth inhibitor protein a ad messenger ribonucleic 
acid concentrations in different physiological states of the gland. 

PpKh's f, Gorcvvtt RC MuUer T, Grossc R . 

Department of Animal Science, Cornell University, Ithaca 14853, 

Expression of mammary-derived growth inhibitor in tissue from Iactating and involuting 
bovine mammary glands was investigated. Seventeen Iactating, pregnant (220 to 272 d in 
gestation) cows were divided in two groups of 8 and 9 cows each. Cows of the first group 
were slaughtered while in lactation. Cows of the second group (9 involuting cows) were 
slaughtered at 13 to 52 d following sudden cessation of milking. High concentrations of 
mammary-derived growth inhibitor (.63% of the total protein) were detected in mammary 
tissue of Iactating cows. Mammary-derived growth inhibitor (less than .10% of the total 
protein) was dramatically reduced during most of the involution period (1 3 to 45 d 
following cessation of milking). Mammary-derived growth inhibitor was again detected * 
(.28% of the total protein) during the last stage of the involution (46 to 53 d after 
cessation of milking), which coincided with colostrum formation. When steady state 
concentrations of mammary-derived growth inhibitor mRNA were examined, the results 
obtained mirrored those obtained at the protein concentration. These data suggest that 
regulation of mammary-derived growth inhibitor occurs via modulation of the steady 
?tate concentration of its mRNA. Furthermore, there is a strong correlation between 
mammary-derived growth inhibitor expression and lactation in dairy cows. 
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T-cell receptor V beta-family usage in primary cutaneous and primary 
nodal T-cell non-Hodgkin's lymphomas. 

Wiche a DF , van Vlocen WA de Weper RA ~~ -iS™ 

Department of Pathology, University Hospital Utrecht, The Netheriands. 

To evaluate whether the expression of T-cell receptor (TCR) V beta families in eight 
c^f mahgnantT-cc.1 lymphomas took place in a preferential manner, we^ed 
ZuZ „l m JM S fU ? 8 ° ldeS <MF)- the ™* common form of primary cutaneous T- 

nt \Z 5?? '? 'r^ 011135 f ° Ur ~ of P rim ^ nodal T-cell NHL. The 

usage of V beta famines m T-cell populations was investigated on mRNA that was 
uansenbed to cDNA using a C beta primer and reverse transcriptase. Subsequently the 
specific usage of the families was analyzed by polymerase chain reaction (PCR) using 
comb nations of the selected C beta-oligonucleotide primer and one of the family-sp^nc 

P "T erS ' , Penpheral blood lymphocytes, from four healthy volunteers and I 
Tn T Zlr P ?u SCrVed 33 3 COntr °' and e *P tes * e d all 20 V beta families tested for 
hx T-cell hncs, with restricted V beta expression, and in three patients with advanced MF 
only one or two V beta families were expressed at the mRNA level. In an early MF idn 
fliis monoclonal expresston was absent several Vbeta families were expressed with a 
weak mtens.ty This may indicate either a polyclonal origin of MF, or that too few 
monoclonal neoplastic cells were present in the tissue specimen. In the four nodal T-cell 
NHL, only one famdy could be clearly distinguished, whereas some of the other V beta 
lamthes showed only a weak expression. These latter families represent the reactive T- 
cell component m the nodal T-cell NHL. Both in nodal T-cell NHL and in MF there was . 

3et PCR hTTJ 1 ° f 3 PartiCU,af V bctafami, y- ™« was a good correlation 
between PCR data and the expression of V beta-family protein products observed by 

ca^or MF 7!^ ° n sections of ^ T-cell lymphomas. All T-cell lines, three 
cases of MF, and three cases of nodal T-cell NHL showed a rearrangement of the TCR 
beta chain on DN A level. 
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MatriIia-3 in human articular cartilage: increased expression iu 
osteoarthritis. 

Putlig O, Wcscloh G , Kirtt AR. Wagener R . Swoboda B . 

Division of Orthopaedic Rheumatology, Department of Orthopaedics University of 
Erlangen-Nuremberg, Rathsberger Str. 57, D-9 1 054 Erlangen, Germany. 
ohver.pullig@med.uni-erlangen.de 

OBJECTIVE: Matrilin-3 is a member of the recently described matrilin family of 
extracellular matrix proteins containing von Willebrand factor A-like domains. The 
matrilin-3 subunit can form homo-tetramers as well as hetero-oligomers together with 
subunits of matrilin- 1 (cartilage matrix protein). It has a restricted tissue distribution and 
is strongly expressed in growing skeletal tissues. Detailed information on expression and 
distribution of extracellular matrix proteins is important to understand cartilage function 
. in health and in disease like osteoarthritis (OA). METHODS: Normal and osteoarthritic 
cartilage were systematically analysed for matrilin-3 expression, using 
inimunohistpchemistry, Western blot analysis, in situ hybridization, and quantitative 
PCR. RESULTS: Our results indicate that matrilin-3 is a mandatory component of 
mature articular cartilage with its expression being restricted to chondrocytes from the 
tangential zone and the upper middle cartilage zone. Osteoarthritic cartilage samples with 
only moderate morphological osteoarthritic degenerations have elevated levels of 
matrihn-3 mRNA. In parallel, we found an increased deposition of matrilin-3 protein in 
the cartilage matrix. Matrilin-3 staining was diffusely distributed in the cartilage matrix 
with no cellular staining being detectable. In cartilage samples with minor osteoarthritic' 
lesions, matnlin-3 deposition was restricted to the middle zone and to the upper deep 
zone. A strong correlation was found between enhanced rnarrilin-3 gene and protein 
expression and the extent of tissue damage. Sections with severe osteoarthritic 
degeneration showed the highest amount of matrilin-3 mRNA, strong signals in in situ 
hybridization, and prominent protein deposition in the middle and deep cartilage zone. 
CONCLUSION: We conclude that matrilin-3 is an integral component of human articular 
cartilage matrix and that the enhanced expression of matrilin-3 in OA may be a cellular 
response to the modified microenvironment in the disease. Copyright 2002 OsteoArthritis 
Research Society International. 
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Up-regulation of mitochondrial peripheral benzodiazepine receptor 
expression by tumor necrosis factor alpha in testicular leydig cells. 
Possible involvement in cell survival; 

Rcy C , Maudiat'C , Nauretls O , Benahmcd M, Louisot P, Gasnier F . 
INSERM U. 189, Faculte de Medecine Lyon-Sud, BP12, 69921 cedex, Oullins, France- 
Porcine Leydig cells in primary cultures are resistant to tumor necrosis factor alpha 
(TNFalpha) cytotoxicity. Here we report that these cells can be rendered sensitive to 
TNFalpha killing by treatment with the translational inhibitor cycloheximide, suggesting 
the existence of proteins that can suppress the death stimulus induced by the cytokine. In 
search of these cytoprotective proteins, we focused on the constituents of the 
mitochondrial permeability transition pore (PT pore), whose opening has been shown to 
play a critical role in the TNFalpha-mediated death pathway. We found that TNFalpha 
up -regulated mRNA and protein expression of the mitochondrial peripheral 
benzodiazepine receptor (PBR), an outer membrane-derived constituent of the pore. A 
strong correlation was established between the resistance of the cells to TNFalpha killing 
and the density of PBR-binding sites. Concomitantly, TNFalpha down-regulated Bcl-2 
mRNA and protein expression. As Bcl-2 has been shown to be an endogenous inhibitor 
of the PT pore, we hypothesize that the TNFalpha-induced up-regulation of PBR 
expression may compensate for the decrease in Bcl-2 levels to prevent the opening of the 
PT pore. 
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GLUTl messenger RNA and protein Induction relates to the malignant 
transformation of cervical cancer. 

Rudlovvski C Becker AJ. Schroder W, Rath W , Buttncr R , Moser M . 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
69 1 1 5 Heidelberg, .Germany. 

We studied whether induction of glucose transporters (GLUTs) I to 4 correlates with 
human papillomavirus (HP V)-dependent malignant transformation of cervical epithelium. 
Tissue samples of cervical intraepithelial neoplasia (CIN; grades 1 to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing was performed. 
Tissue sections were immunostained with GLUTl to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUTl protein expression. Weak 
expression of GLUTl was found in nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions, correlating with 
the degree of dysplasia. In CIN 3 high-risk HPV lesions, cervical cancer, and metastasis, 
GLUTl was expressed at highest levels with a strong correlation of GLUTl mRNA and 
protein expression. Irnmunostains for GLUT2 to GLUT4 were negative. Cervical tumor 
cells respond to enhanced glucose utilization by up-regulation of GLUTl . The strong 
induction of GLUTl mRNA and protein in HPV-positive CTN 3 lesions suggests GLUTl 
overexpression as an early event in cervical neoplasia. GLUTl is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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ExpressioD aad distributiou of laminin alphal and alpha2 chains in 
embryonic and adult mouse tissues: an immunochemical approach, 

Sasaki T, Giltay R , Taks U, Timpl R Tales JF . 

Max-Planck-Institute for Biochemistry, Martinsried, D-82152, Germany. 

Protem levels, mRNA expression, and localization of laminin alphal and alpha2 cliains in 
development and in adult mice were examined. Recombinant fragments were used to 
obtain high-titer-specific polyclonal antibodies for establishing quantitative 
radioimmuno-inhibition assays: This often demonstrated an abundance of alpha2 chain, 
but also distinct amounts of alpha I chain for adult tissues. The highest amounts of alphal 
were found in placenta, kidney, testis, and liver and exceeded those of alpha2. All other 
tissue extracts showed a higher content of alpha2, which was particularly high in heart 
and muscle when compared to alphal . Content of gammal chain, shared by most 
laminins, was also analyzed. This demonstrated gammal chain levels being equal to or 
. moderately exceeding die sum of alphal and alpha2 chains, indicating that these iso forms 
represent the major kno wn laminin iso forms in most adul t mouse tissues so far exarnine<L 
Moreover, we found good correlation between radioinxmuno-mhibition data and mRNA 
levels of adult tissues as measured by quantitative real-time reverse transcriptase-PCR. 
Embryonic tissues were also analyzed by radioimmuno-inhibition assays. This 
demonstrated for day 1 1 embryos comparable amounts of alphal and gammal and a 
more than 25-fold lower content of alpha2. This content increased to about 10% of 
alphal in day 13 embryos. The day 1 8 embryo showed in heart, kidney, and liver, but not 
yet in brain and lung, alphal /alpha2 chain ratios comparable to those in adult tissues. 
Inununostaining demonstrated alphal in Reichert's membrane (day 7.5), while alpha2 
could not be detected before day 11.5. These data were compared with 
immunohistochemical localization results on several more embryonic and adult tissue 
sections. Our results regarding localization are consistent with those of earlier work with 
some notable exceptions. This was in part due to epitope masking for monoclonal 
antibodies commonly used in previous studies in esophagus, intestine, stomach, liver, 
kidney, and spleen. 
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Discordant regulatiou of granzyme H and granzyme B expression in 
human lymphocytes. 

Sedelies KA, Sayers TJ, Edwards KM , Chen W. PeUicci DG . Godfrey PI. Tr aE ani 

J A . 

Cancer Immunology Laboratory, Peter MacCallum Cancer Centre, Locked Bag 1 
A'Beckett Street, East Melbourne, 8006, Australia. 

We analyzed the expression of granzyme H in human blood leukocytes, using a novel 
monoclonal antibody raised against recombinant granzyme H. 33-kDa granzyme H was 
easily detected in un fractionated peripheral blood mononuclear cells, due to its high 
constitutive expression in CD3(-)CD56(+) natural killer (NK) cells, whereas granzyme B 
was Iessabundant. The NK lymphoma cell lines, YT and Lopez, also expressed high 
granzyme H levels. Unstimulated CD4(+) and particularly CD8(+) T cells expressed far 
lower levels of granzyme U than NK cells, and various agents that classically induce T 
cell activation, proliferation, and enhanced granzyme B expression failed to induce 
granzyme H expression in T cells. Also, granzyme H was not detected in NK T cells, 
monocytes, or neutrophils. There was a good correlation between mRNA and protein 
expression in cells that synthesize both granzyrnes B and H, suggesting that gzmH gene 
transcription is regulated similarly to gzmB. Overall, our data indicate that although the 
gzmB and gzmH genes are tightly linked, expression of the proteins is quite discordant in 
T and NK cells. The finding that granzyme H is frequently more abundant than granzyme 
B in NK cells is consistent with a role for granzyme H in complementing the pro- 
apoptotic function of granzyme B in human NK cells. 
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BCL2 protein expression parallels its niRNA level in normal and 
malignant B cells. 

Shen Y , Iqbal J, Huang JZ, Zhou G , Chan WC. 

Department of Pathology and Microbiology, University of Nebraska Medical Center, 
Omaha, USA. 

The regulation of B-cell lymphoma 2 (BCL2) protein expression in germinal center (GC) 
B cells has been controversial. Previous reports have indicated posttranscriptional 
regulation plays a dominant role. However, a number of recent studies contradicted these 
reports. Using real-time polymerase chain reaction (PCR) and Standardized Reverse 
Transcriptase-PCR (SiaRT-PCR), we measured the level of mRNA expression in GC, 
mantle zone (MNZ), and marginal zone (MGZ) cells from laser capture microdissection. 
Both quantitative RT-PCR measurements of microdissected GC cells from tonsils 
showed that GC cells had low expression of BCL2 transcripts conxmensufate with the 
low protein expression level. These results are in agreement with microarray studies on 
fluorescence r activated cell sorter (FACS)-sorted cells and microdissected GC cells. We 
also examined BCL2 mRNA and protein expression on a series of 30 cases of diffuse 
large B-cell lymphoma (DLBCL) and found, in general, a good correlation. The results 
suggested that BCL2 protein expression is regulated at the transcriptional level in normal 
B cells and in the neoplastic cells in most B-cell lyrnphoproliferative disorders. 
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Quantitative determinations of the steady state transcript levels of 
hexokinase isozymes and glucose transporter isoforms in normal rat 
tissues and the malignant tumor ceil Line AH130. 

Shinohara Y , Yamamoto K . InooJK, Yamazaki N. Tcrada H . 

Faculty of Pharmaceutical Sciences, University of Tokushima, Japan. 
yasuo@ph.tokushima-u.ac.jp 

The steady state transcript levels of the four hexokinase (HK) isozymes and four glucose 
transporter (GLUT) isoforms were determined quantitatively by Northern analysis of 
RNA samples from rat tissues using synthetic fragments of the RNAs encoding the HK 
isozymes and GLUT isoforms. Results showed that the levels of HK isozyme- transcripts 
were low in rat tissues, the level of that most highly expressed, the type I isozyme (HKl), 
in the brain being 0.025% of the total poly(A)+ RNA. A good correlation was found 
between the reported HK activities and the total amounts of transcripts encoding all HK 
isozymes in various tissues, showing that the HK activities in tissues can be estimated 
from the total amount of transcripts encoding HK isozymes. The proposed associated 
expressions of HK isozymes and GLUT isoforms in particular tissues were confirmed at 
their transcript levels. The steady state transcript levels of type U HK and the type I 
GLUT isoform in the malignant tumor cell line AH130 were also determined 
quantitatively. 
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UVA irradiation-induced activation of activator protein-l is correlated 
with induced expression of AP-1 family members in the human 
keratiuocytc cell line HaCaT. 

Silvers AL, Bowden GT . 

Department of Radiation Oncology, Arizona Cancer Center, The University of Arizona, 
Tucson 85724, USA. 

To determine whether the transcription factor activator protein- 1 (AP-1) could be 
modulated by ultraviolet A (UVA) exposure, we examined AP-l DNA-binding activity 
and transact! vation after exposure to UVA in the human immortalized keratinocyte cell , 
line HaCaT. Maximal AP-l transactivation was observed with 250 U/m2 UVA between 
3 and 4 h after irradiation; DNA binding of AP-1 to the target 12-0- 
tetradecanoylphorbol-13-acetate response element sequence was maximally induced 1-3 
h after irradiation. Both de novo transcription and translation contributed to the UVA- 
induced AP-1 DNA binding. c-Fos was implicated as a primary component of the AP-1 
DNA-binding complex. Other components of the complex included Fra-2, c-Jun, JunB 
and JunD. UVA irradiation induced protein expression of c-Fos, c-Jun, Fra-1 and Fra-2. 
Phosphorylated forms of these induced proteins were determined at specific time points. 
A strong correlation existed between UVA-anduced AP-1 activity and accumulation of c- 
Fos, c-Jun and Fra-i proteins. UVA irradiation also induced c-fos and c-jun mRNA 
' expression and transcriptional activation of the c-fos gene promoter. These results 
demonstrate that UVA irradiation activates AP-l and that c-fos induction may play a 
critical role in the response of these human keratinocytes to UVA irradiation. 
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Rat kidney glutamyl araiaopeptidase (aminopeptidase A): molecular 
identity and cellular localization. 

SongJL, Ye M, Troyanovskava M WHk E. Wilk S. Hcaly DP . 

Department of Pharmacology, Mount Sinai School of Medicine, City University of New 
York, New York 10029. 

Glutamyl aminopeptidase [aminopeptidase A (EAP), EC 3.4. 11 .7] is an ectoenzyme that 
selectively hydrolyzes acidic amino acid residues from the amino terminus of 
oligopeptides. EAP activity, is highest within the kidney and small intestine. The murine 
pre-B cell BP-1/6C3 and the human kidney glycoprotein gpl60 differentiation antigens 
have been reported to have biochemical properties indistinguishable from EAP. It is not 
known, however, if rat kidney EAP is a homologue of these antigens or molecularly 
distinct. Using the reverse transcription-polymerase chain reaction method with 
oligonucleotide primers based on the BP-1/6C3 nucleotide sequence, we isolated a 450- 
bp partial cDNA from rat kidney poly(A)+ RNA. The partial cDNA encoded a predicted 
protein diat was 92% and 86% identical to the murine BP-1/6C3 and human gp 160 
antigens, respectively; the amino acid sequence within the zinc-binding domain was 
completely conserved. Purification of EAP from rat kidney and microsequence analysis 
of a tryptic digest peptide fragment ( 1 8-mer) indicated that the fragment was highly 
similar to a region within the BP-1/6C3 and gpl60 proteins. Northern blot hybridization 
and immunoblot analyses were also consistent with labeling of products the same size as 
reported for the BP-1/6C3 and gpl60 antigens. There was a good correlation between the 
cellular distribution of EAP mRNA and EAP immiinoreactivity, with proximal tubules 
and glomerular mesangial cells having the highest densities. These results indicate that 
rat kidney EAP is a species homologue of the murine BP-1/6C3 and human gpl60 
antigens. Furthermore, on the basis of its cellular localization, rat kidney EAP is likely to 
be involved in degradation of oligopeptides within the glomerulus and die glomerular 
filtrate. Since cells that express EAP also express receptors for angiotensin If, an 
intrarenal vasoactive hormone that is a substrate for EAP, Uiese results further suggest 
that EAP may play a role in modulating the activity of intrarenal angiotensin II. 
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Tumor necrosis factor-alpha upregulatcs the prostaglandin E2 EP1 
receptor subtype and the cycIooxygcuasc-2 isoform in cultured amnion 
WISH cells. 

Spaziani EP , Ccnoit RR , Tsibrts JC , Gould SF. O'Brien WF . 

University of South Florida Health Science Center, Department of Obstetrics 8c 
Gynecology, Tampa 33612, USA. espazian@cornl.med.usf.edu 

Recent studies have demonstrated a strong correlation between infection and preterm 
labor. Preterm delivery is also associated with high levels of cytokines and prostaglandins 
in amniotic fluid. The purpose of this study was to investigate the effect of tumor necrosis 
factor-alpha (TNF-alpha) on die levels of cyclooxygenase, prostaglandin E2 production 
(PGE2), and expression of the PGE2 receptor subtype EP1 in amnion WISH cell culture. 
Amnion WISH cell cultures were incubated in increasing concentrations of TNF-alpha 
(0-50 ng/ml). Changes in cyclooxygenase and' EPl receptor proteins were evaluated by 
Western blot analysis. Changes in EPl mRNA were evaluated by Northern blot, and 
culture fluid concentrations of PGE2 were estimated by enzyme immunoassay (ElA). 
.EPl protein (p<0.01), EPl mRNA (p<0.05), cyclooxygenase-2 (COX-2) protein 
(pO.OOl), and PGE2 concentrations (p<0-01) all increased with increasing 
concentrations of TNF-alpha. Changes in COX- 1 protein were not observed following 
TNF-alpha-incubation. The results suggest that TNF-alpha may play a role in infection- 
induced preterm labor by its pleiotropic ability to simultaneously stimulate COX-2 
activity, PGE2 concentrations, and PGE2 EPl receptor levels in human amnion; . 
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Transcriptional activity of potent glucocorticoids: relevance of 
glucocorticoid receptor isoforms and drug metabolites. 

Spika I. Hammer S/KJeuser B , Korting HC t Sch a fer- Korting M 

Institut fur Pharmazie, Abteilung fur Pharmakologie und Toxikologie, Freie Universitat 
Berlin, Berlin, Germany. 

As compared to standard glucocorticoids (GC), prednicarbate (PC) is favorable in the 
treatment of eczema due to its high benefit/risk ratio. The remarkable anti-inflammatory 
effects of PC are in strong contrast to its reported low glucocorticoid receptor (GR) 
binding affinity. In transfected COS-7 cells we related the transcriptional potencies of 
PC, its metabolites and conventional GC to their receptor binding properties. Moreover, 
the expression pattern of the human GR isoform hGRalpha and its mutual dominant 
negative inhibitor hGRbeta in skin cells have been investigated as well as the influence of 
hGRbeta on receptor binding and transactivation. hGRalpha mRNA and protein was 
largely overexpressed in skin cells. hGRbeta showed no influence on hGRalpha binding 
and transactivation. Concentration response curves indicated the greater transactivation 
potency of betamethasone 17-valerate followed by dexamethasone and prednisolone 17- 
ethylcarbonate. Native PC appeared almost as potent as dexamethasone. With both a 
strong correlation was observed between transactivation and GR binding. Copyright 2003 
S. Karger AG, Basel 
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Specific inhibition of AQP1 water channels in isolated rat intrahepatic bile 
duct units by small interfering RNAs. 

Splinter PL . Masyuk AI , LaRns'so NF . 

Center for Basic Research in Digestive Diseases, Division of Gastroenterology and 
Hematology, Mayo Medical School, Clinic, and Foundation, Rochester, Minnesota 55905, 



Cholangiocytes express water channels (i.e. aquaporins (AQPs)), proteins that are 
increasingly recognized as important in water transport by biliary epithelia. However 
direct functional studies demonstrating AQP-mediated water transport in cholangiocytes 
are limited, in part because of the lack of specific AQP inhibitors. To address this issue, 
we designed, synthesized, and utilized small interfering RNAs (si RNAs) selective for 
AQP I and investigated their effectiveness in altering AQP I -mediated water transport in 
intrahepatic bile duct units (IBDUs) isolated from rat liver. Twenty-four hours after 
transfection of IBDUs with siRNAs targeting two different regions of the AQP 1 
transcript, both AQP I mRNA and protein expression were inhibited by 76.6-92.0 and 
57.9-79.4%, respectively. siRNAs containing the same percent of base pairs as the 
AQP 1 -siRNAs but in random sequence (i.e. scrambled siRNAs) had no effect. 
Suppression of AQP 1 expression in cholangiocytes resulted in a decrease in water 
transport by IBDUs in response to both an inward osmotic gradient (200 mosm) or a 
secretory agonist (forskolin), the osmotic water permeability coefficient (P(f)) decreasing 
up to 58.8% and net water secretion (J(v)) decreasing up to 87%. A strong correlation 
between AQPi protein expression and water transport in IBDUs transfected with AQP1- 
siRNAs was consistent with the decrease in water transport by IBDUs resulting from 
AQPI gene silencing by AQP 1 -siKN As. This study is the first to demonstrate the 
feasibility of utilizing siRNAs to specifically reduce the expression of AQPs in epithelial 
cells and provides direct evidence of the contribution of AQP 1 to water transport by 
biliary epithelia. 
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Type IV collagenase (M(r) 72,000) expression in human prostate: benign and 
malignant tissue. 

Stearns ME . Wan gM. 

Department.of Pathology, Medical College of Pennsylvania, Philadelphia 19129. 

The expression of type IV collagenase (M(r) 72,000) has been examined in tissues from patients 
with benign prostatic hyperplasia (6 patients) and varying Gleason grades of malignant prostate 
cancer (18 patients). Immunoperoxidase labeling indicated that expression of the type IV 
collagenase was weak or nonexistent in benign tissue but consistently strong in the glandular and 
ductal epithelial cells of prostate tumors diagnosed at Gleason grades 1-8. In moderate to 
advanced cancer (i.e., Gleason grades 2 to 8), invasive tumor foci in the stromal tissue produced 
relatively modest amounts of type IV collagenase. The normal stromal tissue (i.e., fibroblasts) 
uniformly failed to produce detectable levels of type IV collagenase in the 24 patients examined. 
Northern and quantitative slot blot hybridization assays demonstrated that collagenase type IV 
mRNA levels were low in benign tissue and high in malignant tumors. In contrast, the stromal 
cells did not express significant amounts of type IV collagenase mRNA. Enzyme-linked 
immunosorbent assays demonstrated that the amounts of type IV collagenase protein correlated 
directly with the mRNA levels in the tumor tissue. The studies suggest that type IV collagenase 
may be selectively overexpressed by malignant, preinvasive prostatic epithelial cells. 
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The decompensated detrusor HI: impact of bladder outlet obstruction on 
sarcoplasmic endoplasmic reticulum protein and gene expression. 

Stein Gong C» Hutches on JC Canning DA . Zderic SA . 

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, 
USA- 
PURPOSE: Regulation of calcium ion homeostasis has a significant role in smooth 
muscle contractility. The sarcoplasmic endoplasmic reticulum, cafciunii magnesium, 
adenosine triphosphatase (SERCA) is a regulatory ion pump that may have a role in the 
functional outcome after outlet obstruction. We investigate what correlation if any existed 
between SERCA protein and gene expression, and the contractile properties in the same 
bladder. MATERIALS AND METHODS: Standardized partial bladder outlet 
obstructions were created in adult New Zealand white rabbits, which were divided into 
control, sham operated and obstructed groups. Muscle strip studies subcategorized the 
obstructed group into compensated (force greater than 50% of control) and 
decompensated (force less than 50% of control). Microsomal membrane and total RNA 
fractions were prepared from the same bladder tissue. Membrane proteins were used for 
Western blot analysis using a SERCA specific monoclonal antibody, and total RNA was 
assessed with Northern blot analysis. RESULTS: The relative intensities of signals for 
the Western and Northern blots demonstrated a strong correlation between protein and 
gene expression. Furthermore there was a strong association between the loss of SERCA 
messenger RNA and protein expression and loss of bladder function. CONCLUSIONS: 
Bladder contractility after outlet obstruction is influenced in part by smooth muscle cell 
ability to maintain calcium homeostasis via SERCA, The loss of SERCA protein 
expression is mediated by down-regulation in gene expression in the same bladder. These 
data suggest that smooth muscle ion pump gene expression is in part mechanically 
(pressure work) regulated. 
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TNF-alpha and IL-8 arc upregulated in the epidermis of normal human 
skin after UVB exposure; correlation with neutrophil accumulation and E- 
selectiu expression. 

Strickland t , Rhodes LE , Flanagan BF. Friedmann PS . 

Department of Dermatology, University of Liverpool, United Kingdom. 

The in vivo response to ultraviolet B (UVB) radiation in skin is characterized by the 
accumulation of both mononuclear and polymorphonuclear cells within the dermis and an 
induction of vascular endothelial adhesion molecules. Epidermal production of cytokines 
(IL-8 and TNF-alpha) has been strongly implicated in the development of UVB-induced . 
inflammation. In the current study, we examined the time course of IL-8 and TNF-alpha 
mRNA and protein expression in the epidermis over a 24-h period after in vivo UVB 
irradiation. Also, the induction of adhesion molecule expression and the accumulation of 
neutrophils within the dermis were followed. We found constitutive expression of both 
cytokines (mRNA and protein) in the epidermis of unirradiated skin. IL-8 was rapidly 
upregulated after irradiation and mRNA and protein increased at 4 h, reaching a 
maximum between 8 and 24 h. TNF-alpha mRNA and protein was rninimally increased 
by 8 h after UVB irradiation and reached a maximum by 24 h. No significant alteration in 
ICAM-l or VCAM-l expression was observed. E-selectin expression, which was absent 
from control samples, was increased from 4 h onward. and also reached a maximum at 24 
h, coinciding with peak neutrophil accumulation. A strong correlation (r = 0.96) was 
found between number of E-selectin-positive vessels and numbers of infiltrating 
neutrophils at this time. Moreover, because E-selectin expression was increased before 
any apparent increase in TNF-alpha protein (4 h), TNF-alpha does not appear to be 
involved in the early induction of the adhesion molecule, but cytokines such as TNF- 
alpha and IL-8 may act subsequently to augment the inflammatory response. 
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Basic fibroblast growth factor expression is increased in human renal 
fibrogenesis and may mediate autocrine fibroblast proliferation. 

Strutg F > Zeisber£]VJ, Mernmerlein B , SartlerB. Humme) K Becker V ; MuIJer GA . 

Department of Nephrology and Rheumatology, Georg-Augusl-University Gottingen, 
Germany. fstrutz@gwdg.de 

BACKGROUND: Interstitial fibroblasts play a critical role in renal fibrogenesis, and 
autocrine proliferation of these cells may account for continuous matrix synthesis. Basic 
fibroblast growth factor (FGF-2) is mitogenic for most cells and exerts intracrine, 
autocrine, and paracrine effects on epithelial and mesenchymal cells. The aims of the 
present studies were to localize and quantitate the expression of FGF-2 in normal and 
pathologic human kidneys and to study the in vitro effects of FGF-2 on proliferation, 
differentiation, and matrix production of isolated cortical kidney fibroblasts. METHODS: 
FGF-2 protein expression was localized by immunofluoresence double labelings in 
normal and fibrotic human kidneys. Subsequently, interstitial FGF-2 labeling was 
determined semiquantitative^ in 8 normal kidneys and 39 kidneys with variable degrees 
of interstitial fibrosis and was correlated with the morphometrically determined 
interstitial cortical volume, hi addition, FGF-2 expression was quantitated by immunoblot 
analysis in three normal and six fibrotic kidneys. FGF-2 mRNA was localized by in situ 
hybridizations. Seven primary cortical fibroblast lines were established, and expression of 
FGF-2 and FGF receptor- 1 (FGFR-1) were examined. The effects of FGF-2 on cell 
proliferation were determined by bromodeoxyuridine incorporation and cell counts, those 
on differentiation into myofibroblasts by staining for alpha-smooth muscle actin, and 
those on matrix synthesis by enzyme-linked immunosorbent assay for collagen type I and 
fibronectin. Finally, proliferative activity in vivo was evaluated by expression of MJB-1 
(Ki-67 antigen). RESULTS: In normal kidneys, FGF-2 expression was confined to 
glomerular, vascular, and a few tubular as well as interstitial fibroblast-Iike cells. The 
expression of FGF-2 protein was increased in human kidneys, with tubulointerstitiaj 
scarring correlating with (he degree of interstitial fibrosis (r = 0.84, P < 0.01). 
Immunoblot analyses confirmed a significant increase in FGF-2 protein expression in 
kidneys with interstitial scarring. Jn situ hybridization studies demonstrated low-level 
detection of FGF-2 mRNA in normal kidneys. However, FGF-2 mRNA expression was 
robustly up-regulated in interstitial. and tubular cells in end-stage kidneys, indicating that 
these cells are the source of excess FGF-2 protein. Primary cortical fibroblasts express 
FGF-2 and FGFR-! in vitro. FGF-2 induced a robust growth response in these cells that, 
could be blocked specifically by a neutralizing FGF-2 antibody Interestingly, the 
addition of the neutralizing antibody alone did reduce basal proliferation up to 31.5%. In 
addition, FGF-2 induced expression of alpha-smooth muscle actin up to 1.6-fold, but no 
significant effect was observed on the synthesis of collagen type 1 and fibronectin. 
Finally, staining for M1B-] revealed a good correlation of interstitial FGF-2 positivity 



with interstitial and tubular proliferative activity (r = 0.71 , P < 0.01 for interstitial 
proliferation, N - 30). CONCLUSIONS; Interstitial FGF-2 protein and mRNA 
expression correlate with interstitial scarring. FGF-2 is a strong mitogen for cortical 
kidney fibroblasts and may promote autocrine fibroblast growth. Expression of FGF-2 
correlates with interstitial and tubular proliferation in vivo. 
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Adiposity elevates plasma MCP-l levels leading to the increased CDi lb- 
positive mouocytes in mice, 

Takahashi K, MizuaraiS , Araki H Maskiko S , Ishihara A, Kanacani A Itadani B 
Kotani H . 7 ' 

Banyu Tsulcuba Research Institute in collaboration with Merck Research Laboratories 
Tsukuba, Ibaraki 300-261 1, Japan. 

Obesity is currently considered as an epidemic in the western world, and it represents a 
major risk factor for life-threatening diseases such as heart attack, stroke, diabetes, and 
cancer. Taking advantage of DNA microarray technology, we tried to identify the' 
molecules explaining the relationship between obesity and vascular disorders, comparing 
mRNA expression of about 12,000 genes in white adipose tissue between normal, high 
fat diet-induced obesity (DIO) and d-Trp34 neuropeptide Y-induced obesity in mice. 
Expression of monocyte chemoattractant protein-1 (MCP-I) mRNA displayed a 7.2-fold 
increase in obese mice as compared with normal mice, leading to substantially elevated 
MCP-1 protein levels in adipocytes. MCP-1 levels in plasma were also increased in DIO 
mice, and a strong correlation between plasma MCP-1 levels and body weight was 
identified. We also showed that elevated MCP-l protein levels in plasma increased the 
CDI lb-positive monocyte/macrophage population in DIO mice. Furthermore, infusion of 
MCP-1 into lean mice increased the CD 1 lb-positive monocyte population without 
inducing changes in body weight/Given the importance of MCP-l in activation of 
monocytes and subsequent atherosclerotic development, these results suggest a novel role 
of adiposity in the development of vascular disorders. 
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Augmented expression of neuronal nitric oxide synthase in the atria 
parasynipathcticaUy decreases heart rate during acute myocardial 
infarction in rats. 

Takmioro Y, Aoyama T , Tanaka K , Kcyamura R . YuiY . SasayaniaS . 

Department of Cardiovascular Medicine, Graduate School of Medicine, Kyoto 
University, Kyoto, Japan. 

BACKGROUND: Nitric oxide (NO) synthesized within sinoatrial cells recently has been 
shown to participate in the autonomic control of heart rate. We hypothesized that NO. in 
the neuronal cells in the heart was increased and parasympathetically regulated heart rate 
after myocardial infarction (MI). METHODS AND RESULTS: We examined heart rate 
dynamics and neuronal NQ synthase (nNOS) expression and activities in the atria of rats . 
with MI I, 3, 7, and 14 days after MI (n=7 to 22 for each group). Both the mRNA levels 
of nNOS in the atria determined by competitive reverse transcriptase-polymerase chain 
reaction and the protein levels determined by Western blotting were significantly 
increased compared with controls 1, 3, and 7 days after ML nNOS activity in the atria 1 
day after infarction was also increased in MI rats, nNOS immunoreactivity was observed . 
in nerve fibers in the atria. After infusion of a specific inhibitor of nNOS and iNOS, l-(2~ 
Irifluoromethylphenyl) imidazole (TRIM) (50 mg/kg IV), heart rate was. significantly 
(P<0.01) increased in MI rats compared with controls .1 , 3, and 7 days after ML The 
iNOS-specific inhibitor, 1400W (10 mg/kg SC), did not significantly affect the heart rate 
in rats with MI. The effect of TRIM was abolished by.pretreatment. with L-arginine (25 
mg/kg IV) or by parasympathetic blockade with atropine but not by propranolol. There 
was a strong correlation (r=0.837, P<0.0001) between the nNOS protein expression and 
heart rate change after TRIM infusion. CONCLUSIONS: These results indicate that 
increased nNOS parasympathetically decreased heart rate via the production of NO in * 
rats with acute ML 
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Differential upregulatiou of cellular adhesion molecules at the sites of 
oxidative stress in experimental acute pancreatitis. 

Tclek G y Ducroc R , Scoazcc JY . Pasquicr C , Feldmann G . RozeC. 

INSERM U 4 10, Universite Pads 7 Denis Diderot, 75870 Paris, France. 

BACKGROUND: Severe acute pancreatitis (AP)(2) is associated with exaggerated 
leukocyte adherence and activation. Endothelial cellular adhesion molecules (CAMs) can 
be induced by cytokines, but also directly by oxygen free radicals (OFRs), mediated by 
nuclear factor kappa-B (NF-kappa B). We investigated die behavior of inducible CAMs 
in relation to pancreatic oxidative stress. Our novel modification of cerium capture 
histochemistry (reaction of QFRs with cerium produces laser reflective Ce perhydroxide 
precipitates) combined with reflectance confocal laser scanning microscopy (CLSM) 
allows the histological (^demonstration of in vivo OFR production and immunolabeled 
CAMs, or NF-kappa B. METHODS: Taurocholate AP was induced in rats; sham 
operated and normal animals served as controls. To achieve in situ, in vivo reaction of 
cerium with OFRs, animals were perfused with CeCl(3) solution at different time points 
(I, 2, 8, 24 h) and then sacrificed; E-selectin, P-selectin, ICAM-1, VCAM, and NF-kappa 
B p65 were labeled by immunofluorescence (IF) on frozen sections of cerium perfused 1 
pancreata. IF and Ce perhydroxide reflectance were simultaneously detected by CLSM. 
Pancreatic gene expression of the same CAMs was quantified by competitive RT-PCR 
(MIMIC internal control). RESULTS: Control pancreata showed negligible reflectance 
and minimal CAM expression. Early (1, 2 h) AP samples were characterized by intense, 
heterogeneous acinar OFR production, strong P-selectin, and increasing [CAM 
expression, with nuclear translocation of P 65, histologically all colocalizing with the > 
areas of acinar oxidative stress. Adherent polymorphonuclear leukocytes (PMNs) 
displayed weak OFR formation. Later (8, 24 h), a slowly declining P-selectin, but 
persisting ICAM-1 expression, was paralleled by widespread adherence of PMNs 
producing surprisingly large amounts of OFRs. VCAM and E-selectin showed a mild 
increase at 24 h. CAM gene activation was in good correlation with the protein 
expression. CONCLUSIONS: The early acinar oxidative stress is colocalized with NF- 
kappa B activation, preferential P-selectin, and ICAM upregulation in this AP model. 
Subsequently, adherent, activated PMNs become the major source of OFRs, thereby 
contributing to tissue damage. Copyright 2001 Academic Press. 
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Myotonic dystrophy: an unstable CTG repeat in a protein kinase gene. 
Timchenko L . Moncktoa DG . Caskey CT . 

Department of Molecular and Human Genetics, Baylor College of Medicine, Texas 
Medical Center, Houston 77030, USA. 

Myotonic dystrophy (DM) is caused by the amplification of CTG repeats in die T 
untranslated region of a gene encoding a protein homologous to serine/threonine protein 
kinases. £n DM patients the CTG repeats are extremely unstable, varying in length from 
patient to patient and generally increasing in length in successive generations. There is a 
strong correlation between the size of the repeats and the age of onset and severity of the 
disease. The molecular basis of the effect of the CTG expansion on the development of 
the DM phenotype continues to be investigated. The first working hypothesis of the 
molecular mechanism of DM was a reduction in steady-state myotonin-proteinkinase 
(Mt-PK) mRNA and protein levels. However, although the consensus finding is diat die 
Mt PK mRNA and protein levels are decreased in DM patients, it is still not clear if this 
reduction leads directly to the DM phenotype. In this short review we discuss the 
molecular aspects of CTG instability and the expression of the myotonin-protein kinase 
gene in normal and DM populations. 
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Induction of class 3 aldehyde dehydrogenase in the mouse hepatoma cell 
line Hepa-1 by various chemicals. 

Tprro ncn R , Korlcalainen IVT . Karcnlampi SO . 

Department of Physiology, University of Kuopio, Finland. 

The mouse hepatoma cell line Hepa- 1 was shown to express an aldehyde dehydrogenase 
( ALDH) isozyme which was inducible by TCDD and carcinogenic polycyclic aromatic 
hydrocarbons. The induced activity could be detected with benzaldehyde as substrate and 
NADP as cofactor (B/NADP ALDH). As compared with rat liver and hepatoma cell 
ines, the response was moderate (maximally 5-fold). There was an apparent correlation 
between tins specific form of ALDH and aryl hydrocarbon hydroxylase (AHH) in the 
Hepa-1 wild-type cell line-in terms of inducibility by several chemicals. However the 
magnitude of the response was clearly smaller for ALDH than for AHH. Southern Wot 
. analysis showed that a homologous gene (class 3 ALDH) was present in the rat and 
mouse genome. The gene was also expressed in Hepa-1. and mere was a good correlation 
between the increase of class 3 ALDH-specific mRNA and B/NADP ALDH enzyme 
activity after exposure of the Hepa-I cells to TCDD. It is concluded that class 3 ALDH is 
inducible by certain chemicals in the mouse hepatoma cell line, although the respective 
enzyme is not inducible in mouse liver in vivo. 
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Relationship between cyclin Dl and p2l(Wafl/Cipl) during differentiation 
of Iiuman myeloid leukemia cell lines. 

Ullmannova V , Stockbaucr P . Hradcova M Soucck J , Haskovec C . 

Department of Molecular Genetics, Institute of Hematology and Blood Transfusion, U 
Nemocnice 1, 128 20 Prague 2, Czech Republic, ullman@ulikt.cz 

Expression of cell cycle-regulating genes was studied in human myeloid leukemia cell 
lines ML-1 , ML-2 and ML-3 during induction of differentiation in vitro. 
Myelomonocytic differentiation was induced by phorbol ester (12-o-Tetradecanoyl- 
phorbol- 13-acetate, TPA), tumor necrosis factor alpha (TNFalpha) or interferon gamma 
(INFgamma), or their combination. Differentiation (with the exception of TNFalpha 
alone) was accompanied by inhibition of DN A synthesis and cell cycle arrest, Inhibition 
of proliferation was associated with a decrease in the expression of cdc25A and cdc25B, 
cdk6 and Ki-67 genes, and with increased p21(Wafl/Cipl) gene expression, as measured 
by comparative RT-PCR. Expression of the following genes was not changed after 
induction of differentiation: cyclin Al, cyclin D3, cyclin El and p27(Kipl). Surprisingly, 
cyclin Dl expression was upregulated after induction by TPA, TNFalpha with 
IFNgamma or BA. Cyclin D2 was upregulated only after induction by B A. The results of 
the expression of the tested genes obtained by comparative RT-PCR were confirmed by 
quantitative real-time (RQ) RT-PCR and Western blotting. Quantitative RT-PCR showed 
as much as a 288-fold increase of cyclin Dl specific mRNA after a 24h induction by * 
TPA. The upregulation of cyclin Dl in differentiating cells seems to be compensated by 
the upregulation of p2l(Wafl/Cipl)These results, besides others, point to a strong 
correlation between the expression of cyclin Dl and p21(Wafl/Cipl) on the one hand 
and differentiation on the other hand in human myeloid leukemic cells and reflect a rather 
complicated network regulating proliferation and differentiation of leukemic cells. 
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Intestinal carbamoyl phosphate synthase I in human and rat. Expression 
during development shows species differences and mosaic expression in 
duodenum of both species. 

Van Beers EH Rings EH. Posthuma G . Dingemanse MA, Taminiau JA , Hermans 
HS, Eincrhand AW . Puller HA . Dckker J . 

Pediatric Gastroenterology and Nutrition, Department Pediatrics, Emma Children's 
Hospital, Academic Medical Center, Amsterdam, The Netherlands. 

The clinical importance of carbamoyl phosphate synthase I (CPS!) relates to its capacity 
to metabolize ammonia, because CPSI deficiencies cause lethal serum ammonia levels: 
Although some. metabolic parameters concerning liver and intestinal CPSI have been 
reported, the extent to which enterocytes contribute to ammonia conversion remains 
unclear without a detailed description of its developmental and spatial expression 
patterns. Therefore, we determined the patterns of enterocytic CPS! mRNA and protein 
expression in human and rat intestine during embryonic and postnatal development, using 
in situ hybridization and inumunohistochemtstry. CPSI protein appeared during human 
embryogenesis in liver at 31-35 e. d. (embryonic days) before intestine (59 e,d,), whereas 
in rat CPSI detection in intestine (at 16 e.d.) preceded liver (20 e.d.). During all stages of 
development there was a good correlation between the expression of CPSI protein and 
mRNA in the intestinal epithelium. Strikingly, duodenal enterocytes in both species 
exhibited mosaic CPSI protein expression despite uniform CPSI mRNA expression in the 
epithelium and the presence of functional mitochondria in all epithelial cells. Unlike rat, 
CPSI in human embryos was expressed in liver before intestine. Although CPSI was - 
primarily regulated at the transcriptional level; CPSI protein appeared mosaic in the 
duodenum of both species, possibly due to post-transcriptional regulation. 
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Expression of deoxycytidine kinase in leukaemic cells compared with solid 
tumour celllines, liver metastases and normal liver. 

van tier Wilt CL, Kroep JR, Loves WJ . RotsMG, Van Groenineen CJ. Kaspers GJ 
Peters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK) is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) arid 2-chIorodeoxyadenosine (CdA) in 
me treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid tumours. 
In this study, expression of dCK mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples and 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK in the cell lines were closely related to the mRNA expression 
. levels (i=0.75, P-0.026 and r=0.86, P=6.007). In AML samples, dCK mRNA expression 
ranged from 1.16 to 35.25 (x-1.0(-3)xdCK/beta-actin). In the cell line panel, the range was 
2.97-56.9 (x I G(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCK mRNA expression (r=0.497, P=0.05). In the liver 
samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver while a 150-fold range was observed in the liver metastases. In addition, dCK 
activity and mean .mRNA levels were 2.5-fold. higher in the metastases than in the liver 
samples. Since dCK is associated with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in malignant 
cells and tumours, the CT-RT PCR assay will be useful in the selection of patients that 
can be treated with deoxycytidine analogues. 
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Expression of cytokines and growth factors in human 
glomerulonephritides. 

Waldhcrr R , Noronha IL , NiemirZ, Kru&erC, SjcinJH, SturmnG. 
Department of Pathology, University of Heidelberg, Germany. 

Numerous experimental studies point to the potential role of cytokines and growth factors 
in the pathogenesis of renal disease. However, from the various autocrine and paracrine 
mediators identified invitro and in animal models, so far only a few have been 
demonstrated in selected human glomerulopathies. We examined two types of 
glomerulonephritis (GN): extracapillary GN with anri-neutrophil cytoplasmic 
autoantibodies (ANCA), an example of an acute form of GN, and mesangial IgA GN 
usually a chronic form of GN, with immunocytocliemistry, in situ hybridization and the 
polymerase chain reaction. Normal renal tissue from tumour nephrectomies served as a 
control. In ANCA-positive GN with active renal lesions (crescents, glomerular and 
vascular necrosis), infiltrating mononuclear cells in glomeruli and in the interstitium 
expressed interleukin (IL)-1 beta, tumour necrosis factor (TNF)-alpha, IL-2, interferon 
(IFN)-gamma, platelet-derived growth factor (PDGF) and transforming growth factor 
(TGF)-beta. Cytokine expression was also observed in activated resident cells, including 
endothelial cells, capsular epithelial cells, smooth muscle cells of vessel walls, fibroblasts 
and some tubular epithelial cells. In addition, we noted an increase in the cytokine and 
growth factor receptors TNF-R, IL-IR type II, IL-2R, IFN-gamrria R and PDGF beta-R . 
In contrast, in mesangial IgA-GN, IL-l beta, TNF-alpha, IFN-gamma and IL-2 were 
usually absent in glomeruli. Mesangial expansion in this disorder was accompanied by an 
increased expression of PDGF, PDGF beta-R, TGF-beta and IL-6 in mesangial areas. In 
both conditions a good correlation was observed between cytokine expression at the 
mRNA (in situ hybridization) and protein level (immunocytochemistrv) ( ABSTRACT 
TRUNCATED AT 250 WORDS) . ' 
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Malignant trausformatiou of the human endometrium is associated with 
overexpression of lactoferrin messenger RNA and protein. 

Waimcr DK, Padin CJ , Wrona MA , Healy BE , Bentley RC Tsao lVf$ , KohlerMF . 
McLachlan JA . Gray KD 

Department of Obstetrics and Gynecology, Duke University Medical Center, Durham 
North Carolina 27710. 

In the mouse uterus, lactoferrin is a major estrogen-inducible uterine secretory protein, 
and its expression correlates directly with the period of peak epithelial cell proliferation. 
In this study, we examine the expression of lactoferrin mRNA and protein in human 
endometrium, endometrial hyperplasias, and adenocarcinomas using 
itnmunohistochemistry, Western inimunoblotting, and Northern and in situ RNA 
hybridization techniques. Our results reveal that lactoferrin is expressed in normal 
cycling endometrium by a restricted number of glandular epithelial cells located deep in 
the zona basalis. Two thirds (8 of 12) of the endometrial adenocarcinomas examined 
overexpress lactoferrin. This tumor-associated increase in lactoferrin expression includes 
an elevation in the mRNA and protein of individual cells and an increase in the number 
of cells expressing the protein. In comparison, only. 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also observe distinct 
cytoplasmic and nuclear immunostaining patterns under different fixation conditions in . 
both normal and malignant epithelial cells, similar to those previously reported, in the 
mouse reproductive tract: Serial sections of malignant specimens show a good correlation 
between the localization of lactoferrin mRNA- and protein in individual epithelial cells by 
in situ RNA hybridization and immunohistochemistry. Although the degree of lactoferrin 
expression in the adenocarcinomas did not correlate with the tumor stage, grade, or depth 
of invasion in these 12 patients, there was a striking inverse correlation between the 
presence of progesterone receptors and lactoferrin in all 8 lactoferrin-positive 
adenocarcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis which is not shed with menstruation and is 
frequently overexpressed by progesterone receptor-negative cells in endometrial 
adenocarcinomas. 
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ABSTRACT 

In the moose uterus, lacloferrin is a major eslrogen-inducible uterine 
secrelory protein, and its expression correlates directly v/itb ibe period ot 
peak epithelial cell proliferation. In Ibis study, we examine the expression 
" of lacloferrin mRNX and protein J» human endoroenniro, endometrial 
hyperplasias, and adenocarcinomas using jmmnnohfe I chemistry, West- 
em immunoblottinj, and Norfbem and in tint RKA hybridisation Hech- 
niqnes. Our results reveal ihai Uctoftrrin is expressed in norma] cycling 
endometrium by > restricted number of glandular epithelial cells located 
deep in the jod» basfibs. Two thirds (8 of J 2) of the endometrial adeno- 
carcinomas crammed over express lactoferrin. This tumor- associa ted in- 
crease in lactoferrin expression includes an elevation In the mRNA and 
.pTotein of individual cells and an increase In the number of ctlls express- 
ing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in betoferrin- We also 
observe distinct cytoplasmic and nudtnr immunosulning patterns under 
different fixation conditions in both normal and malignant epithelial cells, 
similar to those previously reported in the mouse reproductive tr3cL 
Serial sections of malignant specimens show a good correlation betwrca 
the localization ©I lacloferrin mRNA and proiein in individual epithelial 
ctlls by in situ RNA hrbriduauon and iromunohisrocnemisirr. Although . 
the degree of loclolerrin expression in the adenocarcinomas did not 
correlate with ibe tumor stage, grade, or depth or invasion in these 12 
patients, there was a striking inverse correlation belween the presence of 
progesterone receptors and lactolerrin in all 8 lac lolerrin- positive adeno- 
carcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium kno*vp as the zona basalis "which Is not shed with menstru- 
ation and is frequently o?er expressed by progesterone receptor-negative 
cells 5n endometrial adenocarcinomas. 



INTRODUCTION 

The uterus is a sex steroid- responsive organ (ha I plays a major role 
in women's health. Hysitjeclonries were Ibe most frequently per- 
fojincd rrjajoi surgical procedures id a 20-year study interval 
0965-1984; Ref. 1). Fifty-eight to 80% of ibesc 123 million 
procedures wtre performed foi estrogen-i elated disorders of prolifei- 
ati on. Chronic unopposed estrogen exposure, roosi commonly asso- 
ciated with type II ovulatory disordejs, eventually leads to the devel- 
opment of complex endometrial hyperplasia and adenocarcinoma. 
Since the sex steroids, estrogen and progesteione, act on iheii target 
tissues by regulating the expression of a wide variety of signaling 
molecules, identifying these regulatory factors will provide critical 
information towards under standing normal icpj eduction and jepjo- 
duciivc tract pathology. Our emient knowledge of estrogen and pro- 
gesterone acrion on the reproductive tract is based 10 a gjeai extent on 
information collected from lodents (2). Although differences exist 
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betwetD the reptodnciive physiology oX rodents and barnar^ ^ 
xno^se bas been a useful model for studying steroid hormorrc actioD b 
Iht human female repiodudivt tract (3, A), One potential icgulaiorj 
molecule shown to be itgulated by csuogen in the mouse jepToduciive 
Had is laclo/erriD. Lsdoferrin is a basic gjycopjotein with an extrao;* 
dmariJy high affinity fbi iron that was originally discovered in milk. 
This protein is expressed in a wide variety of tissues, most notably in 
polymorphonuclear leukocytes and. roost mammalian exocriDC glan- 
dular secretions. Id the mammary gland (5) and ibe female reproduc- 
tive tract of the moose (6- 8), ladofenin is regulated by endocrine 
hormones. Prolactin stimulates lacloferrin synthesis in the breast; 
whereas in the uterus and vagina, the ovarian sex steroid, 170- 
estradiol, is the inducer. (6, 7, 9). To date, lactoferrin is one of the few 
genes that have been identified in the rodent that are directly regulated 
by estradiol. The lactoferrin gene contains an ERE 3 that is important 
for. regulating its expression in vivo xo the mouse reproductive trad 
Being linked to estradiol, the expression of lactoferrin by the uterine 
epithelium parallels the onset of DNA synthesis. Although sequencing 
information suggests that the human lactoferrin gene also contains a 
functional imperfect ERE in the 5 ^flarJcmg promoter region (10, 11), 
there is very Jittle data regarding lacloferrin expression in the human 
female reproductive tract; 

The purpose of our study was to examine the expression of lacto- 
ferrin in the human endometrium under normal and pathological 
conditions by irnmunohistocbemistry, imcnunoblotling, and Northern 
and in si/w RNA hybridization techniques. In addition, wc looked for 
con clarions between lacloferrin expression and several parameters, 
such as the stage of the. menstrual cycle, the distribution of estrogeD 
and progesterone receptors, rJQER-2/new expression, majkers of cell 
proliferation, and the hislopatbological grade and extent of myomc- 
irial invasion in the adenocarcinomas. Our data demonstrates thai 
lactoferrin is expressed in a very restricted number of glands in the 
basal region of normal human endometrium and is markedly over ex- 
pressed in a significant number of the uterine adrntx^rdrronias by 
PR-negative cells. 



MATERIALS AKD METHODS 

Tissue Preparation Dnd Hisiotojjial Equation. Surgical paibology 
specimens weie obtained bom Duke University Medical CcDier (Durham, 
and tbc Dcpajlmtnt of Pathology si Modum! GeDeial Hospital (Ouebtc, 
Mosnc-sl. Ow-.sJs) Cycling eodcrDcUium was obtained Iron) 22 women t'ajt? 
3)-49). and auopbit eudoroe»iuro was obiaincd bom 7 postmenopausal 
women (agts 64-77). Hystcitciom>e5 fjoro cycling ^omeD wcic performed 
for sobitjosal leiomyomas (n - 6). pelvic, relaxation (n = 8), pelvic pun 
(n = 4)„ peiitontal tndorne uiosis (n = 2), apd cancer ot eidtcr the exoccrvw 
(n = ]) OJ Lbr ovary (n ~ J). In ithhtion, 1? adenorarcinoroas, 3 i typical 
complex hyperplasias, 3 complex liypcrplasias wjtboui atypia, and 4 simple 
hyperplasia were analyzed. Each human incrus was bivilved ibo/lly 111(1 
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bysl erect omy, and endometrium was removed from ibe fandal region, A full 
tbic)a>css biopsy was placed into either 10% "neutral-buffered formalin or 
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Promega Riboprobe iJl (Promeea, Madison, Wi> washed twice in J X SSC for - 
10 mm ai room temperature, digested with RNase [Z8>g/m| RNase-A, 03 



^ .... ^ , x sSC twice for 20 mm each tune at 5(rC, twKC for 20 a> 



embedding, and sectioning at 4 ujij on silarii2ed slides. Histological cvalua- 
rjons of hematoxylin and cosnvstaincd slides were performed blindly by one 
board-certified pathologist Normal endometrial samples were dated by the 
criteria of Noyes t\ ol (12). Endometrial hyperplasias and carcinomas were 
classified according to the current recommendations, of the International 
Society of Gynecological Pathologists under the auspices of WHO (13). 
Histological grading of tumors was performed according to Federation Inter- 
jrttfonaJe des Gynccologistes ct ObstetristeS criteria (14). Each specimen was 
read a minimum of three times, and only specimens that were read consistently 
the same way were included in tbc study. Unstained sections of die same 
issues weie used for the cytochemical analysis of pjotan and mRNA expres- 
sion using specific reagents. A few endometrial samples were frozen for 
subsequent protein and RNA extraction, which were evaluated by Western and 
jsJojlbcrn blotting, lespectively. All horoan lissues were handled with the 
precautions and the. guidelines required by DuJce University and National 
Institute of Environmental Health Sciences- 

ImmunolocnKiDtion. Slides chosen for study were deparaffinized and 
rinsed in 20% glacial acetic acid at 4*C for 15 s to inhibit endogenous alkaline 
phosphatase. AM subsequent incubations arid washes were at room tempera- 
ture. Sections were next equilibrated in PES for 20 min and incubated for 20 
roio with J 5% normal goat serum diluted in PBS to bloc* nonspecific binding. 
Defection of lacioferrin was performed primarily with a rabbit anti-human 
lactoferrin polyclonal antisera gencjated in our laboratory and affinity purified. 
Similai results were also seen wiib a nonaf finny-purified commeicial unrrsera 
(Bio^toex, San Ramon, Ca) Following incubation at room lempcratuje wiib 
primary antisera for 60 min, the sections were washed in PBS twice for 30 min 
each, and lactoferrin was Iocalhed using an alkaline phosphalssc-biotm- 
sueptavjdin detection system (Vcciastain ABOAP kit: Vector Laboratories, 
BurlSngame, CA; di the Super Sensitive Detection System: Biogenex, San 
Ramon, CA> To identijy nonspecific staining, pie immune rabbit serum was 
used in pjacc of ibe primary antibody. The immun or taction was quant itsted by 
determining tbe percentage of glands and the percentage of cells naming for 
lsctojeniD in the 2ons basalis and iht zona funciiona)is, with a minimum of 
300 ccJts counted in each region. PR antibody was ptovided by Geoffrey 
Greene (KD66), and- a commercial source was also used (Biogenex, San 
Ramon, CA). Identical staining patterns were confirmed with both prepara- 
tions. Other commeicial (y obtained amis era include PCNA {Biogenex, San 
Ramon, CA), ER (ER1D5; AMAC, Wcstbiook, Ma), M1B-1 (AMAQ, and 
HER-Z'ne« (Biogenex, San Ramon, CA). The primary antisera incubations 
were 2 b foi tbc PR* J b.for PCNA : M1B-1. and ER, and 30 min for 
HER-2/ncu. Antiscia dilutions weie 1:j00Joi M1B-5 and 1:20 loi HER-2Aw:w. 
Antigen leuieval (Biogenex, San Ramon, CA) was performed befoic adding 
the progesterone primary amisera. 

Western Blot Analysis. Pioteins were extracted bom tndoroetiial biopsies 
by bomogenuation on ice in J% Tiitbn-X and 20 mM Tris-HO (pH 7.4) with 
protease inhibit ois (10 >igftnl kupeptin, 200 KUM>1 apiotinin, and 20 jAg/ml 
phenylmeibyhulionyl fluoride) and clarified by ccnrri/ugaiion at 45,000 rpm 
for 30 min in a Beckman 70.1 Ti lotor: then the supernatant was analysed lot 
piotein concent i at ion by tbe BCA pjottin assay (Pit ice, Rotkfoid, IL). Ali- 
quois of 200 ^ig wtit separated by electrophoresis on an S_3% SDS polyac- 
ijlamide gel, blotted onto niiroceDulose membranes, incubated with poly- 
clonal rabbit antihumw) Hcio/eirin anosaa, and localized with an l7> Mabc)cd 
rfonkej antioebbii immuno olobulin, B5 described previously (7). 

/« Sinj Ttybridiution. All slides ^eit pre Heated with 02 N HO lot 50 
min at room leropeiarwe, digested with l ^c/ml pioteinase-K (Sigma Chtm- 
icai Co.. St. U>uis. MG> in 0.1 m Tris-HCJ fpH 7.4>G.05 u EDTA ioi 35 min . 
at 37"Q and then treated with O.J M iiietbanolaDUue-0_25% acetic inhydiirk 
/or 5 min at. room icmpcratmc and 0.) m Trii- glycine (pH 7.4) jor 30 min at 
mono leniperature. Tht scciions were iubscquendy dchvdiaied with g;aded 
nh?nol, air dried, and prtbvbiid)2cd ai 30*C for ) b in 2 > SSC )0 inM DTT, 
- >■ Denbajdl** .solution, )00 >jr/ml ol boib salmon :pcm> DNA and veast 
tRNA, and 50% formimide (15). The ibdej w ere then hvbtioized overnight at 
50*C in the same medium wiito ]0% dciiian suUsit and 2^30* cpm<}x} of tbc 
ipeci/ic RKA proU. Tl)f laciofciria oligoniiclcoiidc p;obc waJ aniplifwd by 
^CR using piiineis thai jpinn;d nutlrot'idcs 718-3 654 (10) and cloned inio 
"S-labt ltd sense and antisenst RNA prober >*eie insde wjifc (be 



0J X SSC at 55'C, and once foi 20 min at 60* C Tbe sections were then 
debydiated and dipped- in Kodak 3Utmadic)grapbjc emulsion (NTB-2) for 
dejection of specific mRNA expression. The slides were allowed to develop for 
2 weeks. After this period, the slides were developed using Kodak PJ9 
developer and Kodak Rapid Fixer. 

Northern RNA Analysis. Total cellular RNA was purified £rom froieo 
tissue by the guanidme isomrocyanate-eesium chloride method, and poly(A*> 
RNA was isolated by oligo{dT>ccllulo5e chromatography using xnemods 
described previously (15> For Northern blot analysis, po!y(A^RNA'was 
resolved by electrophoresis on 1-5% formaldehyde agarose gels, stained with 
etbidium bionnde, and bansfened to a nylon membrane. Tbe membrane was 
probed with a ^P-labeled bclofcrrin cDNA derived from human uterus (nu- 
cleotides 718-1654; accession no. S52659) using PCR techniques, followed 
bv cloning into pGEM-42 (Promega, Madfeon, WT). In order to insure thai the. 
quality and quantity of RNA analyzed by Nortbem blotting was equivalent 
berween control and treated groups, the Wot was probed simultaneously for 
. glyceialdehydc-3-phosph«te debydiogcnase. 
. Statistical Analysis. Values are presented as means ± SD. Piffercnccs 
between the zona basalb and fundionalis were tested by the two-taDed 
Student's r tcsL • 

RESULTS 

jmrnunobisiochemical Analysis of LsctoferriD Protein 

Norma) Cycling Endometrium. IntmuDOhislochtiDical studies of 
norma) cycling human endometrium localize lactoferrin protein pre- 
dominantly to the glandular epithelium deep in the zona basaKs and 
not to the functionalis (Fig. 1A> ^association of lactoferrin protein 
expression with the zona basalis is statistically significant, (P < 0-001; 
Tig. 2). Two to 56% of the glands express lactofen in at any given time 
during the menstrual cycle. Within positive glands, lactoferrin protein 
immonolocaJization is heterogeneous in that positively staining epi- 
thelial cells ate interspersed with cells negative for lactoferrin expres- 
sion. No apparent differences in morphology, PCNA, EH, or PR 
expression are seen 10 account for the heterogeneous pattern of inua- 
and inteigland lactoferrin expression in normal endometrium- Similar 
to our previous findings in mouse uterine epithelial cells, the positive- 
staining glandular cells of the human endometrium demonstrate two 
distinct irornunostaining patterns foi lactoferrin, cytopJasmic and nu- 
clear (Fig. IB% seen with boLb formalin and BomVs fixation- )n 
evaluating the lempoia! expression of bctofenin, there is a trend 
towards more glands expressing lactoferrin during the secretory 
phase. Because oi the large variance, tbe trend is not statistically 
significant. As expected, the polymorphonuclear leucocytes in the 
endometrium also stain intensely for lactoferrin piotein, which is 
stoied in ibcii secondary grannies (Fig. 3). These results dernonsuate, 
foi tbc fiist time, thai lactoferrin protein is expressed in tbe human 
endomtuiuro predominately by polymorphonuclear leukocytes and 
epithelial cells of glands located deep in tbe 20na basaJis. 

Proliferalivr Endomf trial Disorders: Hyperplasias and Adeno- 
carcinomas. lrnn?unohisiochemisiry reveals that die expression of 
JafjruVj-is psr4r3six inured in 66. 6% (8 nf 32) nf iht. endrunelriaJ 
adenocarcinomas examined- In one-half of these cases (4 ol 12), 
lactolenio is intensely expressed by malignanl epithelial cells 
throughout the entire Tumors (Fig- 4). The oihct four adenocarcinomas 
demonstrate increased staining lor lactcfetrin in concentrated tegtons 
of ihc rurnois. Jn all. tight cases where lactoJenin expression's 
elevated, the alls expressinc lactoferrin have one similarity vvitb 
normal positive gjands in that they demonstrate he* logeneous stain- 
ing of inters frfrsed positive and negiiive cells. Howcvci, the expres- 
sion ol tactolerrin by the malignant cells cl:a:ly difirrs /torn normal 
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positive glands in that the lactoferrin is r>6t Jixniied lo the basal legions 
of the tumors, many more ccBs arc positive, and tbe relative intensity 
of the staining ovci individuals cells is increased. Although increased 
lactoferrin expression is associated with malignant transformation, we 
do not find a correlation between Jactoferrin protein piesence and ibe 
stage, nuclear, giade, Federation Internationale des GynecoJogistes et 
Obstctristes grade, or tbe depth of my ome trial invasion in the 12 
minors studied (Table 1). In sharp contrast to the common dys regu- 
lation of Jactoftirin expression found iD the malignant endometrium, 
only J of 10 endometrial hyperplasia specimens evaluated coniained 
an increased number of cells staining for Jactoferrin. The hyperplastic 
specimen over expressing Jactoferrin was read as complex without 
atypia. 

In Sim and Northern Analyses of Lactoferrin mRNA 
Expression in Normal and Malignant Endometrium 

To funher our urideis:and;ng of !he location cf Iacioferrin protein 
synthesis in the human endometrium, we examined Iacioferrin mRNA 
expression by in siiu and Northern hybridization using specific 33 S- 
labeled probes for human Jactoferrin. No detectable RNA hybridiza- 
tion is observed in 'he normal endometrium by in siw hybridization, 
even in the pjoence of iromunodereciabJe pjotein, usins equivalent 
hybridt2rftion conditions and development limes ai used for tbe ade- 
nocarcinomas. Consistent with the jesohs obtained by in jt'rn hybrid- 
ization, long exposure time^ wcte.requiied to demonstrate lactoferrin 
mRNA in normal endometrium by Northern biot uM»g poJy(A"*)- 
mRNA. This indicates thai "Iacioferrin mRNA is piesrm in- normal i 
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tACTOFOWW DYSRECULATHJH IS ENDOMErWAL CftKCER . - 

tissue but in very low levels, consistent with ibe very limited partem 
of piotein expression in normal endometrium. Equivalent RNA load- 
ing and quality for each specimen was demonstrated by etbidium 
" *" bromide staining of the RNA gels (data not shown) and by probing for 
tbe housekeeping gene glyceraldebyde-3-phosphalc dehydrogenase 
tbat does not fluctuate significantly with tbe metabolic state of the 
tissue- A representative Northern blot is shown in fig. 5. Jn situ 
hybridization witb several aotne^cinomas reveals tbat there is a 
direct correlation between the localization of lactoferrin mRNA and 
tbe immunostaining of expressed Jactoferrin protein (Kg. 4> Lactof- 
errin mRNA is not associated with polymorphonuclear leukoses by 
' in sirv hybridization in either normal or malignant tissue.' - 

Western Biol Analysis 

To confirm the specificity of tbe antisera tbat we- used for unmo- 
nomsiocftemistry, we performed* Western blot ana/ysrs on proteins 
extracted from both normal and malignant endometrium which were 
separated by "8.5% SDS-polyacrylamide gel electrophoresis (Fig, 6), 
Immunoblotting identified a single broad protein band with a molec- 
ular weight between 70,000- 80,000 in both norxoaJ and neoplastic 
endometrial tissue bomogenates, consistent with tbe reported molec- 
ular weight of human lactofenin. Supporting the HjnrmnocytocrxiTrj- 
cal analysis, a representative inununoblot clearly demonstrates that 
tbe proportion of protein thai is Jactoferrin is roarledly increased in 
tbe adenocarcinomas' in comparison to the normal endometrium. Tbe 
molecular weight of Iacioferrin in the adenocarcinomas appears to 
have a slightly higher molecular weight that the predominant form in 
normal tissue. 
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Cor relation of Lactoferrin Expression with the Expression of 
PCNA, Ki 67, Jfl£R-2Areu, ER, and PR 

In an at tempi ro characterize ihc phenotype of endometrial cells 
which express lactoferrin, we pei formed imrounobistocbemistry on 
serial sea ions for the Ki-67 antigen, PCNA, HER-2/nru, laclofenin, 
ER, and PR. in normal tissue, Ki-67 and PCNA expression are cell 
cycle-specifrc markers of cell proliferation (16, 37). Upon analysis of 
normal cycling endometrium, no relationship between lactofeirin pro- 
tein expiession and ER, PR, oi Ki-67 expression was observed. 
Similarly, in nrosl of the adenocarcinomas evaluated, no relationship 
w as noied between lactoferrin and PCNA protein expression. How- 
ler, in one adenocarcinoma (Fig. 7), there was a clear inverse 
lelaiiowbip.secn berwecn Jactofenin and PCNA legalization, which 
was present throughout the entire tumor. Most dramatic, however, was 
a smiting inverse con elation seen between hctoferrin and PR expres- 
sion in 8 of 8 PR-positjvt uterine, adenocarcinomas (rig. 7). Two 
tumors negative for PR ^so did not express lactoferrin. Although an 
inverse correlation was also suggested wirh HER-ZWu and PR, (he 
iDveise correlation was moic precise with Jacio/crrin in these tumors. 

DISCUSSION 

In the mouse uterus, lanoferrin is an esnopt-n- induced meiine 
secretory protein thai is present dtr oughom the epithelium (7). and it 



is expressed concomitantly with epithelial cell proliferation. In con- 
trast to lactoferrin *s "ubiquitous expression in the cstrogeniied mouse 
uterine epimeb'um, Jactofenin protein is limited to glandular epithelial 
cells in the basa) reports of norma) bum an endometrium and usually 
to glands that were directly adjacent to the rroyomctrium {ic, the 
deepest glands of the 2ona basalis). This regional localization of 
lactoferrin . expression is not surprising in that other biocutroical 
parameters have been reported to show site specifipry in prirnate 
endometrium. These parameters include the proliferative Index and 
the expression of the secretory component of IgA (18)l Similar 10. our 
observations in the endometrium, lactoferrin is also expressed region- 
ally in the mammary gland. Id bovine bjeast tissue, lactoferrin is 
localized primarily to the basal alveolar cells (39); whereas in human 
breast tissue, the ductal epithelium appears to be the primary source of 
. secreted laclofenin during Jactation (20). 

Examining the endoroelrium on different days of the menstrual 
cycle demonstrates a trend towards increased lactoferrin expression 
during the luteal phase. Although this data is not statistically signif- 
icant the cyclic variation may be biologically relevant Kim el oL (21) 
recently reported that the basal endometrial epithelial cells are unique 
because they proliferate during the postovulatorj luteal phase. Inter- 
estingly, in the mouse uterus, tbeie is a direct correlation between 
laclofenin expression and epithelial cell proliferation (6). Therefore, 
lactoferrin may have a similar role in the human and mouse endome- 
trium. If lactoferrin expression iscycljc, the ERE in the S'-flanking 
promoter region of the human lactoferrin gene may be activated 
during the luieal phase (30, 31)* 

Another similarity between human and mouse uterine lactofemn 
expression is the observation of two inuDunobistochernicaJ staining 
patterns. In pDe pattern, the aniisera binds primarily over the cyto- 
plasm, and in ibe other, the nucleus is the primary site of localization. 
Although this could represent a fixation artifact, we bave now ob- 
served this pattern in two species and under different fixation condi- 
tions- It has been demonstrated tbal signaling peptides, i.e.. platelet- 
derived growth factor (22), lot- 2 (23), and probasin (24), can be 
selectively directed into the nucleus, cytoplasm, or secreted. It is 
believed that this differential processing may allow proteins ro have 
in I r serine, autocrine, and paracrine roles, depending on the physio- 
logical stale of the cell. The two localization patterns observed suggest 
th3l lactoferrin might have signaling sequences that duecl the final 
destination of the mature peptide. 

In normal endometrium, lactoferrin roRNA is present but in very 
Jow levels. Prolonged exposure times are needed to visualize ibe 
mRNA band with Northern analysis. Although with in situ hybrid-, 
ization the lactoferrin mRNA signal is easily seen in adenocarcino- 
mas, we failed to localize the lactoferrin roRNA in normal endome- 
trium using an equivalent exposure time. These low levels of mRNA 
in normal endometrium suggest that the synthesis and degradation of 
lacrofenin mRNA is more tightly regulated in normal tissue than in 
tire adenocarcinomas. Because of the low levels of message in normal 
tissue, we are unable at this time to definitively conclude that laclo- 
fenin roRNA is synthesized by Ure same epithelial cells which express 
the protein. 

In endometrial adenocarcinomas, malignant transformation of the 
endometrium is associated with the up-regulation of laciolenin 
mRNA and protein biosynthesis. The up- regulation at the RNA level 
is demonstrated by zn increase in steady-state RNA levels using both 
in ^irv hybridization and Northern analysis techniques. In these can- 
cers, we ilso observe an increase in the number of lactofei rin-positrve 
cells, which express both the protein and mRNA. In this smdy, 8 of 
ihr 12 adenocarcinomas evaluated overexpress lacioierrin, compared 
with only 3 of 10 hyperplastic specimens. The form of Jactofenin 
protein txtiicied from endometrial adenocarcinomas appears to have 
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Kg. 4. CDlocaliuixm of Jactofemii pnaein {Up pctxh) u*A 
mRNA (right poach) in a normal poliferaftve tD<ho&ctm>in iwf 
endometrial »dtnoojoi>oroi5 by performing iiorouDohisrocbt/Qb- 
try and in jjto RNA hybridization on serial motors. Dual anatyns 
of protein and mRNA expression itvtab thai {fends in nonakl 
endometrium do not K»vt dcttcUbk mRKA, as mtJS»ftd by mit» 
hybjjdfe>bon (rep pantlr. X 40X.tv>.ctc« analysis of the adeno- 
carcinomas cJeady derooisiraies a direct coneUUon between pro- 
tein bnd RMA expression Jot lactoftrrin (midrib and fcw;w»/»w&; 
X 10). Note ibrt baoforin piotcin and mRNA is distributed, n • 
hetejojcDtous palters in ibt epithelial celts of tbc adenocarcinomas. 
As is shows in Bg. 1, » heterogeneous staining pattern for boot- 
crrin protein is iho seen unjucDtly to normal endoxDttnam. 





a slightly higher moJecuJai weigh! than the protein present in normal 
tissue. This could be due lo alleralions in ibe processing of ihe 
lacioferrin mRNA, protein, or glycosyJation by. the malignant cejls. 
Alternatively, tbeje cou)d be minor differences between lactoferrin 
protein which is present in neutrophils and tbc form synthesized by 
uterine epithehal cells. "We suggest two hypotheses to explain Jactof- 
crrin over expression in endometrial adenocarcinomas. lii the first 
hypothesis,, lacioferrin biosynlbesis is deregulated by the same pro- 
cesses thai lead to the malignant transformation of endometrium. Jf 
this hypothesis, is Hue then iaciofeirin may be a useful marker for 
endometrial adenocarcinoma investigation, and further research is 
needed to determine whethei Jactoferjin plays a contributing tole in 
ibe malignant transformation. A second hypothesis is that lanoferrin- 
positive buroan endometrial adenocarcinomas evolve from the clonal 
expansion of cells >esiding in the regener alive zone (zona basal is) of 
normal endometrium. It is interesting to speculate that lacioferrin 
expression in endometrial cancer may be Jinked to estrogen action in 



some way, since proliferative disorders of human endometrium art 
Jinked to chronic estrogen exposure over several years and sequencing 
data suggests ihaf the promoter for the human lacioferrin gene does 
contain an ERE. 

AJ though the function of lactoferrin is unknown, a variety of 
biological roles have been proposed for lacioferrin which could link 
this protein to a role in cancer, including the regulation of DNA 
synthesis (25-29), modulation of the immune response (25, 30, 3J)> 
and iron tjansport (32). Some forms of lactoferrin are jeported lo have 
FtNase activity (33, 34), Secreted RNases are involved in develop- 
ment, reproductive function, neoplasia, angiogCDesis, and immune 
suppression. (35, 36) If angiogenesis and immunosuppression are 
components of Jactofenin RNase activity, these properties could pro- 
mote tumor growth. 

In the endometrial adenocarcinomas, we observed a heterogeneous 
expression pa Mem for Jactofenin, PCHA, Her-2//iev, ER, and PR. 
With regard to prognosis, patient survival is reportedly worse when 



T»bU J Correlation of lonoferrin and PR typtnsion in hutnrut cnJonyrtrioi nticnc*-OTti>wmo3 
An in>tijf condition indica»es uv>l bcioJtmn and pjorcstcjout wejt noi tj.pi eiscd >d Ok same ;rc>ons of the (uj»o> by iTTimUDohisioc hernial analysis, PF »'«s deiecJcd *ritn 'he 
anuscr? KD65. >nd (Miofcnrm »*« dntcred by a specific polycJon»l amuera. 
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T\9,5. Norton) analysis ot )«:tofoiin mRNA Mission con&ms thai endometrial 
^tnociciDOmM (Lones 3 and 0 ii^iScinrry crv«expi«v the 2 <-Moba« ti^nscDpt^ 
humaj, jaciofc™ i» comparison io norroJ cndorotinwD 7 and;?). Tbc Nontor> 

hvbj»dil»6on data snppwis 0>t in sin> RNA results and confirm* ihsl Jnciofcmn RNA 
tiprcssiori » djsit^btrd in uurine adenocarcinoma Noma! uitrinc tiuves app"» 
comai Jo* steady-suit JtNA IrvcJs oj fecroiLrrin, reOecrins a controlled partem of 
protein expression. Equivalent RNA loading and QMliry i™ «ch spcome* ws drooD- 
slraitd by elbidiom bromide stwniof o/ the RNA gels <da» not shown) »nd by pTobin X for 
■ housekeeping £ene i^JyceraldchydtO-pbDSpbaie cehycrogenast) ibsl does noi fluctuate 
SigDifionlly wiih ibe metabolic siate of ibc lissue. 



LaCTOFEWUH DfVSREGULADON )H ENDOMETRIAL CANCER J 

endometrial adcDOCaidoomas Jose sw steroid receptors (37, 38), have 
a higher proliferative index (39), and demonstrate DNA aneuploidy. 
During the tumor progression of endometrial adtDOcaidnomas, rt 
appears that the loss of steroid hormone receptors occurs earlier than 
cither the increase in proliferation rate or tbe development of DNA 
aneuploidy (40> Id our study, we note a sinking inverse correlation 
between tbe wpressioB of lactoferrm and PR in the endometrial 
adenocarcinomas. An inverse relationship also has been described for 
HER-2//KU and PR in endometrial adenocarcinomas that correlates 
with patient prognosis. Furthermore, in cancers of the human endo- 
metrium, ovary and breast Ber-2//*» expression bas been associated 
with advanced disease and poor survival (41-43). }lER-2/ne» is an 
oncogene thai shares sequence homology with tbe epidermal growth 
factor receptor and is speculated to contribute. to aberrant growth. Of 
note is tbat lactoferrm biosynthesis in the mouse uterus is associated 
with the expression of the epidermal growth factor, like HER-2/new, 
-the epidermal growth factor .receptor is also frequently overexpressed 
in PR negative cells of endometrial adenocarcinomas (44). The am- 
plication of growth factor receptor expression in PR-negative endo- 
metrial adenocarcinomas may be associated with the acquisition of 
giowth autonomy and hormone independence, which may contribute- 
to the poorer prognosis of PR-negative endometrial carcinomas (45). 
Some endometrial adenocarcinomas, including recurrent rumors, can 
be treated successfully with progestine therapy (46, 47). Although 
the significance of the inverse relationship between lactoferrin and PR 
expression is not known, wc speculate that the PR-negative cells do 
not undergo the normal growth inhibition and secretory differ en baDon 
normally associated with progesterone action. 

A survey of human tissues reveals ihat lactoferrin is expressed by 
most normal mammalian exocrine glands and may be a prognostic 
marker in rumors (20). Lactoferrin is found in normal ductule breast 
epithelium and in primary breast carcinomas- In breast tumors, there 
is an inverse correlation between lactoferrin and ERs (20). Notably, 
lactoferrin expression in breast cancer may fall into the same category 
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Pig. 6. Western blot analysis using an antiserum 
sptciBc tot human lacioferrin "« performed on 
prole ini canst 'to from normal and miUgniai cb- 
doir.ctnuin .and sepajaitd by SDS-polyacrylamide 
fct electrophoresis. A broad protein band wtb a 
molecular **cifhi bcr*-ccn 70,000 and 80.000 is 
& iccted in boib normal and neoplastic tr»domuri»l 
■ t»3Ut homoferiaies,. com i; ten l viirt the rcpontd 
mcrfrculai wojhl ol bwroan lacioltrrin (A). Tbe 
moil significant observation fiom iht IminunobJot- 
tirif studies h thai laciclerriD piotcbv is markedly 
ckvaittf i n ibc acenocuunornas, in comparison io 
normal rndon>ctn»m. vbicb jvppons ibc unrnvno- 
ryrochtciici) analysij thsl drrnonsirairj t jrraier 
Dumbo ol ctjls posiiN* lot laciolemn ptoirin in 
tht mt rinc tumors. B. rcUtivt iinount ol pioiein 
loaded in csr h laj»t b) Coornasiit blue Mainin* 
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lACTOFEBSlK BYSJUECULATTOr* & ENDOMETRIAL CA>JC** 



LTF 



PCNA 



Fig. 7. Ctsntliu'tm of lacioforin protein ciprts- 
siou with the PCNA, a marker of proliferation, awl 
PR »s uitastiittf by imrounohisiochrniistry per- 
formed on serial seede**; X 20. in most c*a> i» 
Klaiionsbip between berofemj) and PCKA e*pre> 
sion Is found in either normal tfl roalipta* endo- 
intlmna (drta nol shown). Howcvej, in I of 12 
xfcjiocaiciccanas, m inverse correlation b seta 
berweert Uaofemn (A) and POM (B) Joo 1 *"" 
bon. This striking J»«en> consistm through- 
out the entire tumor. su^gtsun* dV rxwsibiliiy of 
crU cyck regulation of J*aofej7in expression *» 
this atlenewcinoni*. The bottom ptmris cahibil « 
inverse coneHboD between bctofcron (C) and PR 
atpicsska (O), wbitb was seta in ail eight endo- 
metrial adenocarcinomas which e^prfssed PR. 
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PR 



as ibe otbti makers for ER-rjcgative tumOTS, such as amplification of 
EGF receptor. HER-2/neii, and transforming growth factor a expres- 
sion, which are associated with poor prognosis. Id gastiic carcinomas, 
factofenin expression is associated with transformation of specific 
cell types roclnding mtestroal-iype carcinomas, adenomas, and incom- 
plete inie-siinaJ metaplasias (28). Although this is complete specula- 
tion at ibis time, perhaps laciofenin overexpiession Id the various 
malign antics may complement ihc actions of the growth factor path- 
way molecules and contribute to the autonomous giowih of these 
mmois. 

In conclusion, oui studies icvea) that lactofenin is. associated 
with a unique population oi epithelial cells in the zona basalts and 
thai lactolertiD ovcreapressioD may be associated with malignant 
tians/oiroaiion oi ihc human endometrium. Further studies are 
Deeded to elucidate the roJe of Jactoferrin >d normal and patholog- 
ical endometrial physiology- 
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Cell proliferation id human soft tissue tumors correlates with platelet- 
derived growth factor B chain expression: an immunohistochemical and in 
situ hybridization study. 

Wang J. Coltrera MP . Gown AM . 

Department of Pathology, University of Washington, Seattle 98195. 

•The authors tested the hypothesis that the B chain of the platelet-derived growth factor 
(PDGF), a known connective tissue mitogen and growth factor, could be expressed by 
human soft tissue tumors/and that its expression could play a role in the control of cell 
prohferahon in these tumors, Using a set of 56 soft tissue tumors, including benign 
tumors and all three grades of sarcomas, PDGF-B chain protein was localized using 
mununohistochemistry and PDGF-B mRNA was localized using in situ hybridization 
The hypothesis that PDGF-B expression was related to cell proliferation was tested by 
simultaneously demonstrating the expression of the proliferating cell nuclear antigen in 
SSw^ t,SSUe sections of the same tumors. Sixty and 82% of tumors had demonstrable 
PDGF-B mRN A and protein, respectively, with a strong correlation between their 
degrees of expression (P = 0.0001). Among the sarcomas, a strong correlation between 
PDGF-B expression and increasing malignant tumor grade (P = 0.006), and between 
PDGF-B expression and increasing proliferating cell nuclear antigen index (P = 0.01) 
was found. All rumors were also demonstrated to express the beta receptor of PDGF via 
unmunohistochemistry. These studies suggest that PDGF-B expression may be an 
important mediator of cell proliferation control, via an autocrine mechanism, in human 
soft tissue tumors and may correlate with clinical outcome in the sarcomas. 
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Expression of cadherins and catenins in paired tumor and non neoplastic 
primary prostate cultures and corresponding prostatectomy specimens. 

jffaagJ, KrjjlD, Torbenson M, Wang^ BiscegUaM, Stonerj, Thomas A 
DeFlavia P. Dhir R. Becich MJ . : — "' 

Department of Pathology, University of Pittsburgh Medical Center, PA, USA. 

Cadherins are a family of transmembrane proteins that play a crucial role in cell : 
differentiation, cell migration, and intercellular adhesion. Cadherins are associated with 
catenins through their highly conserved cytoplasmic domain. Down-regulation of E- 
cadhenn protein has been shown in various human cancers. This study examined the 
expression of cadherins and associated catenins at the mRNA level. Paired tumor and 
nonneoplastic primary prostate cultures were obtained from surgical specimens 

rnC , ?™f ,P l, eX flU0FesCence reverse transcriptase-polymerase I chain reaction 
l^Mf KI-PCR) and quantitative analysis were performed and correlated with 
immunosta.n results. Six of seven cases of neoplastic cultures showed moderately-to- 
markedly decreased levels of E-cadherin and P-cadherin mRNA. Similar losses of alpha'- 
catenm and beta-catenin mRNA were also observed. The results of QMF RT PCR ' 
showed good correlation with the results of immunohistochemical studies based on 
corresponding formalin-fixed sections. In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and associated catenins at the mRNA 
and protein level in most of the cases studied. This down-regulation may play an 
important role in the pathogenesis of prostate cancer. 
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Abstract Cadherins arc a family of transmembrane 
proteins that play a crucial role in cell differentia lion 
cell migration, and intercellular adhesion. Cadherins are* 
associated with catenins through their highly conserved 
cytoplasmic domain. Down- regulation of E-cadberin 
protein has been shown in various human cancers This 
study examined the expression of cadherins and associ- 
ated catenins at the mRNA level. Paired tumor and non- 
neoplastic pnmary prostate cultures were obtained from 
surgical specimens. Quantitative multiplex fluorescence 
reverse Iranscriptase-polyroerase chain reaction (QMF 
RT-PCR) and quantitative analysis were performed and 
correlated with immunostain results. Six of seven cases 
of neoplastic cultures showed moderalely-to-markediy 
decreased levels of E-cadherin and P-cadherin mRNA 
Similar losses of a-catenin and ^-catenin mRNA were 
also observed. The results of QMF RT-PCR showed 
good correlation with the results of immunohistochem- 
ical studies based on corresponding formalin- fixed sec- 
lions. In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and 
associated catenins at the mRNA and protein level in 
most of the cases studied. This down-regulation may 
play an important role in the pathogenesis of prostate 
cancer. 
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Prostate cancer is the most common malignant tumor 
and the second leading, cause of cancer death in men 
The clinical course of patients with prostate cancer 
varies widely, and different factors contribute to this 
marked clinical variability, including genetic back- 
ground, hormonal environment, and the invasive 
potential of the tumor. Invasion and metastasis arc the 
hallmarks of malignancy and have been closely linked to 
alterations in cell-to-cell adhesion, cell migration, and 
interactions with extracellular matrix components I22L * 
. Cadherins are a family of transmembrane glycopro- 
teins responsible for maintaining the integrity of tissue 
and are involved in cell differentiation, cell migration 
and intercellular adhesion through a calcium- dependent 
mechanism characterized by homotypic adhesion [35- 
37). Their highly conserved cytoplasmic domains asso- 
ciate with catenins, a group of intracellular proteins thai 
mediate contact between Ihc cadherins and the micro- 
filaments or the cytoskelelon. Each cadherin subclass 
shows a unique tissue distribution: E-cadherin is pre- 
dominantly expressed in epithelial cells and P-cadherin is 
restricted to decidua tissue and the basal or lower layers 
of stratified epithelium [30]. 

The accumulating evidence suggests a decrease or loss 
of function in E-cadherin and P-cadherin in several 
human carcinomas [3, 9, 23, 32}. Loss of heterozygosity 
(LOH) at chromosome 16 in the location of the E-cad- 
herin gene is present in a high percentage of prostate 
cancers |6, 21,26, 28). Decreased expression of E-cadberin 
is seen in various human malignant tumor cell lines, and 
the level of decrease correlates with the invasive potential 
of the tumor cell lines (1, 9, 12, 24, 40, 42). In addition, 
many, but not all, immunohistochemical studies using 
formalin-fixed, paraffin-embedded tissue have shown that 
the E-cadherin protein is decreased in prostate cancer and 
ihe decrease is correlated with tumor grade (4, 8, 13. 27, 
38). Other studies have shown decreased- to-abseni' 
P-cadherin levels, but variable E-cadberin levels J33) 
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Fig- 1 A Ccncscan image of 
QMF RT-PCR of five paired 
non-ncoplaslic and neoplastic 
primary" prostate cultures. 
cDNA corresponding to 50 ng 
of total RNA was subjected to 
21 cycles of PCR, and all 
reactions were done in tripli- 
cate. RNA of both E-cadherin 
and P-cadherin showed moder- 
ate- io- ma iked reduction in pri- 
mary neoplaslic cultures in four 
. of five cases (Cases I, 2, 4, and 
5). B Representative automated 
sequencer traces (ABJ 373A) 
from 21 cycles of QMF 
RT-PCR using cDNA from a b 
paired no n- neoplastic (top 

panel) and neoplaslic {bottom _ ■ ' 

pone!) primary prostate cultures 

(Case 2X and copy number of DCwecpfasoc 

PCR products (a-actin, 

E-cadherin, and P-cadherin) 
. was calculated using peak areas. 

Neoplastic culture showed 

markedly decreased expression 

of both E-cadherin and 

P-cadherin 
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bcta-Acrin 
(170bp) 



E-odhcrin 
( I90bp) 



P-cadherin 
into?) 



neoplastic 



o.rate buffer ( P H 6.0) for antigen retrieval. The avidin-biotin 
compk, method for immunohislochemistry was u.i ize^ from 
Vector Laboratory (Burlingame. Calif.. USA). The E^ dbShT 

Evaluation of immunostaining 

The percentage of positive cells, intensity of the staining a „J cel- 
lular local.zai.on of the staining were examined by two i„de«„d7», 
observers us.ng normal prostate epithelium from the sa« sped 
men as an mternal control. The intensity of the signal waTg^ed 
as strong modcra ^ wcak and ^ Thc 5 J nin ! Wa a S r f; r a n dt j 

^!Tpe:,I a ns COmp3fcd wi,h ,ha ' of norm3 ' n*«iTr m ,h°e f 



Resurts 

Expression of E-cadhcrin and P-cadhcrin 

Since many prostate cancers are known to be his.olog- 
.caly heterogeneous, adjacent H&E-siaincd sections 
of the tissue fragments sent to culture were reviewed 



lo^ssess tissue homdgeneity and lo rule out the presence 
of other diseases. The paired primary cultures for this 
smdy were selected based on the following histologic 
features: (I) non-neoplastic tissue showed no cancerous 
nmJ?* h,Bh ' grade Prostatic intraepithelial neoplasia 
(PJN); (2) neoplastic sections contained less than 5% 
non-cancerous epithelium. Seven out of 38 pairs of 
cultures met the selection criteria and were included in 
this study. Histologically, all seven cases were moder- 
ately difTerenlrated adenocarcinomas, with Gleason 
scores ranging from 5 lo 7 (median = 6). )n six of these 
cases, adjacent sections of tumor contained no benign 
prostate epithelium, and in one case (specimen 4), there 
was less than 5% non-cancerous epithelium. Prostate 
stromal cells express HGF, whereas the epithelial cells 
express c-met, thc receptor for HGF (17, 20). Thc cases 
included, in this study showed no deleclable HGF 
expression after 21 cycles of QMF RT-PCR (data not 
shown).. Thts indicates there was no significant stroma) 
cell contamination in the current epithelial cultures. 

QMF-PCR is an accurate method of measuring the 
relative levels ofmRNA.in small tissue samples |4I]. Jn 
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a- and 0-ealenins. Moderate inlerspecimcn variation was 
observed in the baseline expression of a-catenin and 
/f-calemn mRNA levels in the non-neoplastic cultures. 

Immunohisioehemical studies of cadherins and catenins 

In benign prostate tissue, E-cadherin was, in all cases, 
uniformly localized to the membranes ofluminaJ glan- 
dular epithelial cells, predominantly at cell-cell junctions 
(Fig. 3A). One case of prostate cancer showed complete 
negative staining for E-cadhenn (Fig. 3B), and the re- 
maining six cases demonstrated reduced immunoslaining 
for E-cadherin, with 2S-75% of cancer cells positive 
(Fig. 3Q. The cancerous glands generally showed re- 
duced signal intensity and an altered heterogeneous 
staining pattern, which included focal cytoplasmic stain- 
ing and reduced membranous staining (Fig. 3C, Table 4). 

Benign prostate tissue showed uniform basal cell 
immunoreactivity for P-cadherin, with principally cyto- 
plasmic and focal membranous pattern staining The 



benign, glandular non-basal epithelial cells and stromal 
cells were negative for P-cadherin (Fig. 3D); P-cadherin 
immunoreactivity was completely absent in Iwo cases of 
prostate cancer (Fig. 3E), and the remaining rive cases 
showed variable focal positivity, which was predomi- 
nantly cytoplasmic (Fig. 3F). This focal P-cadherin 
immunostaimng positivity was confirmed by staining 
multiple sections and by using different monoclonal 
antibodies: In some cases, serial sections also appeared 
to show immunoslaining for both E-and P-cadherins 
with the same neoplastic cells. 

In all cases of benign prostate tissue, a- and /f-catenin 
protein expression showed strong homogeneous staining 
of the luminal glandular epithelium and the basal cells 
In a pattern similar to that of normal E-cadherin, the cr- 
and ^catenins were localized predominately at luminal 
epithelial celt-cell, borders (Fig. 2A, C). f„ a |i casc$ of 
prostate cancer, there was a mild- to-moderate reduction 
in staining for both catenins with 50-73% of cells 
positive, and the staining tended lo be heterogeneous 
(Fig. 2B, D; Table 4). 



Table 4 Immunohislochcroical expression of cadherins and catc- ■ 

mw .» pwatectom, specimens corresponding lo primary prostata Dfoctt»k»> 



Case E-cadherin P-cadherin 



a-catenin /f-calenin 



I 

2 
3 
4 
5 
6 
7 



+ + 
+ + + 

f + + 
+ + 
+ + 



+ + 
+ + 
+ +• + 
+ + 
+ -1 + 
■f + + 
+ + 



+ + 
t + 
+ + + 
+ + + 
+ + + 
+ + + 
+ + + 



- Negaiivc; +( <25V. positive; + + , 25-30% posilive; + + + 
50-75/. posmvc; + + + + , >?5% posilive 



In this study, we observed a coordinated down-regula- 
tion of the expression in the genes involved in the 
cadhenn and catkin mediated cell-cell pathway at the 
mRNA level. The protein levels, as demonstrated by 
the immunohistochemical studies on the corresponding 
tissue sections, were generally consistent with the 
mRNA data as well as with that reported in the litera- 
ture fig, 33, 34, 38, 39). 

E-cadherin showed the most consistent loss of ex- 
pression at both the mRNA level and the protein level. 



Fig. iA-D Immunohisioehemi- 
cal staining of a- and l-calenin 
in nonneoplastic prostate and 
in prostate adenocarcinoma. 
Original magnification xl 15. 
A, C Normal membranous 
expression of a-caleoin ami 
/7-caicmn in non-neoplastic 
pros talk: epithelium (Case 1). 
B Decreased immunostaimng 
for tr-catenin in prostate . 
adenocarcinoma (Case I). 
D Decreased and be (erogenous 
immunoslaining for 0-catemn 
in prostate adenocarcinoma 
(Case I) 
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appears to be derived by different combinations of a 
relatively few factors common to many types of tissues, 
and does not appear to be derived from transcription 
factors specific for each type of tissue. In our study, the 
. . ° f cx P ression «>f «- and ^catenins at the 
r v J?* WaS COrrc,atcd ^*t»> reductions in the levels 
of E-cadhexin expression. Although the exact mecha- 
nism of th,s coordinated down-regulation is not known/ 
he coordmated pattern supports ibe hypothesis that 
loss or alteration of some regulatory factors occurs 
during prostate tumor progression. Possible mechanisms 
include transcriptional factor alterations or hypcr- 
metbylation of the promoter region [14, 43]. Limited 
information is available on the transcriptional regula- 
tion of the catenins; but it is possible that similar 
mechanisms may play a role. 

Jn summary, this is the first comparative study of the 
expression of the genes, involved in the cadherin-medi- 
ated cell-cell adhesion pathway at the mRNA level mine 
paired neoplastic and nonneoplastic primary cultures 
denved from, prostatectomy specimens. Our results 
indicate that (1) there is a marked palienl-to-paiient 
van aI>on in the^ normal levels of the cadherins and 
ca enins; (2) mRNA levels of E-cadherin as well as 
catemns are stgnificantly reduced in some prostate can- 
cer primary cultures, and the reduction tends to be to the 
same degree in each tumor, suggesting a defect in a 
regulating mechanism common to all of these genes; (3) 
P-cadhenn appears to be present at both the mRNA 
level and the protein level in some prostate cancers. This 
coordmated down-regulation of E-cadherin and calenin- 
mediated adhesion pathways may play a crucial role in 
tumor pathogenesis and metastasis. 
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Down-regulation of prostate-specific antigen expression by finasteride 
through inhibition of complex formation between androgen receptor and 
steroid receptor-binding consensus in tfie promoter of the PSA gene in 
LNCaP cells. 

Wang-LG, Liu XM Kreis W, Budman DR . 

Department of Medicine, New York University, Manhasset I I 030, USA. 

As a specific competitive inhibitor of Salpha-reductase, an intracellular enzyme that 
converts testosterone to dihydrotestosterone, finasteride is being extensively used for the 
treatment of benign prostatic hyperplasia and in experimental settings for prostate cancer. 
In this study, we showed that finasteride markedly inhibited prostate-specific antigen 
(PSA) secretion and expression. The promoter of the PSA gene contains several well- 
known cis-regulatory elements. Among them, steroid receptor-binding consensus (SRBC) 
has been identified as a functional androgen-responsive element Our previous study 
showed that PSA was not only present in conditioned medium of the PSA-positive 
LNCaP cells but was also detectable in small amounts in PSA-negative cell lines, PC-3 
and DU-145 (L. G. Wang et aL, Oncol. Rep., 3: 911-917, 1996). A strong correlation 
between binding of nuclear factors to SRBC and the level of PSA present in the 
conditioned medium and cell extracts was found in these three cell lines, whereas no such 
correlation with binding was obtained using Spl oligonucleotide as a probe. Binding of 
LNCaP cell nuclear proteins to SRBC was diminished when the cells were exposed to 25 
microM finasteride, at which concentration 50% of both PSA mRNA and protein were 
inhibited. As a major component of DNA-protein complexes, the level of androgen 
receptor was dramatically decreased in the cells treated with finasteride. Our data indicate 
that inhibition of complex formation between SRBC and nuclear proteins due to the 
remarkable decrease in the level of androgen receptor plays a key role in the down- 
regulation of PSA gene expression by finasteride in LNCaP cells. 



PMID; 9044850 [PubMcd - indexed for MEDLINE] 
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Expression of calcyclin in human melanocytic lesions. 

Weterman MA, van Muijen GN , Blocmers HP, Ruitcr DJ , 

Department of Biochemistry, University of Nijmegen, The Netherlands. 

When comparing two subsequent stages of melanocytic tumor progression we identified 
calcyclin as a new potential progression marker, the expression of which was correlated 
with metastatic behavior of various human melanoma ceil lines in nude mice. In this 
study, we describe a good correlation between RNA and protein levels in the xenografts 
of these cell lines and extended these experiments to a panel of 120 routinely processed 
human melanocytic cutaneous lesions. Northern blot analysis demonstrated that calcyclin 
RNA expression was elevated in melanoma metastases as compared to several types of 
neyocellular nevi. Calcyclin staining using a specific polyclonal antiserum showed a 
more complex pattern. A stronger staining in a higher percentage of positive cells was 
observed in thick primary melanoma (> or = 1.5 mm) as compared to thin primary 
melanoma (< 1.5 mm). Calcyclin expression was also present in a higher percentage of 
cells showing a stronger staining in melanomas with higher Clark levels (> II) 
corresponding to the vertical growth phase of primary melanomas. Protein expression in 
nevocellular nevi was confined to the dermal part and was highest in the lower parts of 
the dermis. Remarkably, dysplastic nevi (atypical moles), potential precursors of 
melanoma, did not show any expression at all, either in junctional or dermal parts. 
Confinement of the expression to the dermal part of nondysplastic nevi and primary 
melanomas may reflect interactions with the microenvironment of the reticular dermis 
that occurs with vertical growth! 

PMID: 8261423 [PubMed - indexed for MEDLINE] . 
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Estrogen regulation of the cytochrome P450 3A subfamily in humans. 

Williams ET, Leyk M, Wrighton SA, Davics PJ , Loose PS, Shipley GL, Strobel HW . 

Department of Biochemistry, Medical School, University of Texas Health Science Center 
atHouston, 6431 Fannin, MSB 6.200, Houston, TX 77030, USA. 

This study examines the possible role of estrogen in regulating the expression of the 
human CYP3A subfamily: CYP3A4, CYP3A5, CYP3A7, and CYP3A43. To accomplish 
this goal; raRNA was quantified from human livers and endometrial samples, and total 
CYP3A protein levels were evaluated by Western imrhunoblot analysis of the liver 
samples. The human endometrial samples were from premenopausal and postmenopausal 
women. The premenopausal endometrium was either in the proliferative or secretory 
: phase, whereas for the postmenopausal endometrium samples, the women had been 
treated with either a placebo or estropipate, an estrogen substitute. After analyses, 
CYP3A4 mRNA was shown to have lower hepatic expression in females thain in males. 
In the endometrium, CYP3 A4 and CYP3 A43 are down-regulated by estrogen, whereas 
CYP3 A5 is expressed at higher levels during the secretory phase. CYP3A7 was not 
detected in the endometrium. In addition, the CYP3 A subfamily showed increased 
mRNA expression in the liver as age increased. The expression levels of total CYP3A 
protein and total CYP3A mRNA showed good correlation. Despite apparent regulation of 
CYP3 A4 mRNA expression by estrogen, the effects of estrogen may be overshadowed 
by additional regulators of gene expression. 
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Severely decreased MARCKS expression correlates with ras reversion but 
not with mitogenic responsiveness. 

VVojtaszek PA, Scumpo DJ , Blackshear PJ , Macara fC . 

Department of Pathology, University of Vermont College of Medicine, Burlington 



Phorbol ester-inducible phosphorylation of MARCKS, the '80-kDa< substrate of protein 
kinase C, was undetectable in several phenotypically dominant, non-transformed 
revertants independently derived from the ras-transformed cell line NIH3T3 DT-ras. 
Extremely low expression of MARCKS . protein accounted for this apparent lack of 
phosphorylation. MARCKS -eacodmg mKN A. levels were correspondingly decreased 
relative to normal and ras-transformed cells in all four ras revertant cell lines studied: C- 
1 1 and F-2, derived by 5-azacytidine treatment and selection with ouabain; CHP 9CJ, 
derived by ediylmethahe sulfonate mutagenesis and selection with cis-hydroxy-I^proline; 
and 12- V3, derived by transfection with the human Krev-1 gene. However, re-expression 
of MARCKS after transfection of a cloned MARCKS cDNA into the C-l I ras revertant . 
cells was not sufficient to induce rerransforrnation. In fact, no significant difference in 
sensitivity to mitogenic stimulation by phorbol esters was observed among several cell ■ 
lines expressing widely varying levels of MARCKS. This evidence argues against a 
direct role for MARCKS in mitogenic signaling. However, the strong correlation between 
attenuation of MARCtS expression and phenotypically dominant ras reversion suggests 
that a common negative regulatory mechanism might be responsible for both effects, 
presenting a potentially useful strategy for identifying factors involved in transducing the 
ras signal 
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Outline Abstract 



i Background 

,, Prostate stem cell antigen (PSCA) is a recently defined homologue of the 
, . Tby-l/Ly-6 family of glycosylphosphatidylinositol (GPl)-anchored cell 
surface antigens. The purpose of the present study was to examine the 
expression status of PSCA protein and mRNA in clinical specimens of human 
prostate cancer (Pea) and lo validate it as a potential molecular target for ' 
diagnosis and treatment of Pea. 



Materials and Methods 



Jmmtmohislochemical (IHC) and in situ, hybridization (JSH) analyses of PSCA 
expression were simultaneously performed on paraffin- embedded sections 
from 20 benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial 
neoplasm (PJN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was 
semiquantitative^ scored by assessing both the percentage and intensity ol 
PSCA- positive staining cells in the specimens. Then compared PSCA 
expression between BPH, PIN and Pea tissues and analysed the correlations 
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of PSCA expression level with pathological grade, clinical stage and 
progression to androgen-independence in Pea. 

Results 

• In BPH and low grade PIN, PSCA protein and mRNA staining were weak or 
negative and less intense and uniform than that seen in HGP1N and Pea 
There were moderate to strong PSCA protein and mRNA expression in 8 of 
11 (72.7%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by 
IHC and ISH analyses, with statistical significance compared with BPH 
(20%) and low grade PIN (22.2%) samples (p < 0.05, respectively) The 
expression level of PSCA increased with high Gleason grade, advanced stage 
and progression to androgen- independence (p < 0\05, respectively). In 
add.tion, IHC and JSH staining showed a high degree of correlation between 
PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is 
overexpressed by a majority of human Pea. PSCA expression correlates 
positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and 
androgen-independence, and speculatively with prostate carcinogenesis 
PSCA protein overexpression results from upregulated transcription of PSCA 
mRNA PSCA may have prognostic utility and may be a promising molecular 
target for diagnosis and treatment of Pea. 



Outline Introduction 

Abstract Prostate cancer (Pea) is the second leading cause of cancer-related death in 
intrwi-iction . Am encan men and is becoming a common cancer increasing in China 
ieteri,-.is iv.n- De sp*e recently great progress in the diagnosis and management of 

localized disease, there continues to be a need for new diagnostic markers • 
that can accurately discriminate between indolent and aggressive variants of 
Pea. There also continues to be a need for the identification and 
characterization of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and metastatic Pea have 
been limited by a lack of specific target antigens of Pea. 

Although a number of prostate-specific genes have been identified (i e 
prostate specific antigen, prostatic acid phosphatase, glandular kallikrein 2) 
the majority of these are secreted proteins not ideally suited for many 
immunological strategies. So, the identification of new celf surface antigens 
« critical to the development of new diagnostic and therapeutic approaches 
to the management of Pea. 

mcr^ RE ^ 31 m ' epo,ted ,he identification of prostate stem cell antigen 
dc/~ ° SU '' 3ce an,i 9 en lhat * Predominantly prostate specific The 
PSCA yene encodes a J 23 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs to a member of 

rm. . V "' ,/Ly * 6 famr ' Y a ° d iS anchored b V 0 yfycosylphosphalidy (inositol 
(GPJ) l.nkage. mRNA in situ hybridization (ISH) localised PSCA expression in 
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normal prostate to the basal cell epithelium, the putative stem cell 
compartment of prostatic epithelium, suggesting that PSCA may be a 
marker of prostate stem/progenitor cells. 

In order to examine the status of PSCA protein and mRNA expression in 
human Pea and validate it as a potential diagnostic and therapeutic target. • 
for Pea, we used immunohistochemistry (1HC) and in situ hybridization 
(ISH) simultaneously; and conducted PSCA protein and mRNA expression 
analyses in paraffin- embedded tissue specimens of benign prostatic 
hyperplasia (BPH, n = 20), prostate intraepithelial neoplasm (PIN, n = 20) 
and prostate cancer (Pea, n =' 48), Furthermore, we evaluated the possible 
correlation of PSCA expression level with Pea turnorigenesis, grade, stage 
and progression to androgen-independence. 



Outline Materials and methods 
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Tables 

I - J . J :C_.*L.~"!-"[ 
Table i 

Correlation of PSCA 
expression with Gleason 
score 



-Tshte 2 

Correlation of PSCA 
expression with clinical 
stage 



Tissue samples 

All of the clinical tissue specimens studied herein were obtained from 80 
patients of 57-84 years old by prostatectomy, transurethral resection of 
prostate (TURP) or biopsies. The patients were classified as 20 cases of BPH, 
20 cases of P]N, 40 cases of primary Pea, including 9 patients with recurrent 
Pea and a history of androgen ablation therapy (orchiectomy and/or 
hormonal therapy), who were referred to as androgen-independent prostate 
cancers. Eight specimens were harvested from these androgen-independent 
Pea patients prior to androgen ablation treatment. Each tissue sample was 
cut into two parts, one was fixed in 10% formalin for JHC and the other 
treated with 4% pafaforrnaldehyde/0.1 M PBS PH 7.4 in 0.3% DEPC for 3 h 
for JSH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 jim sections and mounted on the glass slides 
specific for 1HC and ISH respectively in the usual fashion. H&E-stained 
section of each Pea was evaluated and assigned a Gleason score by the 
experienced urological pathologist at our institution based on the criteria of 
Gleason score [2]. The Gleason sums are summari2ed in Table ;i . Clinical 
staging was performed according to Jewett-whitmore-prout staging system, 
as shown in Table 2. In the category' of PIN, we graded the specimens into 
two groups, i.e. low grade PIN (grade j - U) and high grade PIN (HGPJN, 
grade III) on the basis of literatures (3,4). 

Immunohistochermcal (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in 3 0 mmot/L citrate buffer, PH 6.0. (Boshide, Wuhan, China) in 
a 900 W oven for S min to induce epitope retrieval. Slides were allowed to 
cool at room temperature /or 30 min. A primary mouse antibody specific to 
human PSCA (Boshide, Wuhan,. China) with a J : 3 00 dilution was applied to 
incubate with the slides at room temperature for 2 h. labeling was detected 
by sequentially adding biotinylated secondary antibodies and strepavidin- 
peroxidase, and localized using 3,3'-diaminobenzidine reaction. Sections 
were then counterstained with hematoxylin. Substitution of the primary 
antibody with phosphate- buffered- saline (PBS) served as a negative- 
staining control. 
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mRNA in situ hybridization (ISH) 

Five-pm-thick tissue sections were deparaffinized and dehydrated, then 
digested in pepsin solution (4 mg/ml in 3% citric acid) for 20 min at 37.5°C, 
and further processed for ISH. Digoxigenin-labeled sense and antisense 
human P5CA RNA probes (obtained from Boshide, Wuhan, China) were 
* hybridized to the sections at 48°C overnight- The posthybridization wash 
with a high stringency was performed sequentially at 37°C in 2 x standard 
saline citrate (SSC). for 10 min, in 0.5 x SSC for 15 min and in 0.2 x SSC 
for 30 min. The slides were then incubated to biotinylated mouse anti- 
digoxigenin antibody at 37.5°C for 1 h followed by washing in 1 x PBS for 
20 min at room temperature, and then to strepavidin-peroxidase at 37.5°C 
for 20 min followed by washing in 1 x PBS for 15 min at room temperature. 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals. 
Sections hybridized with the sense. control probes routinely did not show any 
specific hybridization signal above background. All slides were hybridized 
With PBS to substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA immunostaining 
and mRNA in situ hybridization, the same scoring manners are taken in the 
present study for PSCA protein staining by IHC and PSCA mRNA staining by 
ISH. Each slide was read and scored by two independently experienced 
urological pathologists using Olympus BX-41 light microscopes, the 
evaluation was done in a blinded fashion. For each section, five areas of 
similar grade were analyzed semiquantitative^ for the fraction of cells 
staining. Fifty percent of specimens were randomly chosen and rescored to 
determine the degree of interobserver and intraobserver concordance. There 
• was greater than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopically was graded on a 
scale of 0 to 3+ with 3 being the highest expression observed (0, no 
= staining; l + , mildly intense; 2-v, moderately intense; 3-f-, severely intense). 
The staining density was quantified as the percentage of cells staining 
positive for PSCA with the primary antibody or hybridization probe, as 
follows: 0 = no staining; 1 = positive staining in <25% of the sample; 2 ~ 
positive staining in 25%-50% of the sample; 3 = positive staining in >50% 
of the sample, intensity score {O.to 3+) was multiplied by the density score 
(0-3) to give an overall score of 0-9 |.!,5J. In this way, we were able to 
differentiate specimens that may have had focal areas of increased staining 
from those that had diffuse areas of increased staining The overall score 
for each specimen was then categorically assigned to one of the following 
groups: 0 score, negative expression; J -2 scores, weak expression; 3-6 
scores, moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expression in BPH, PJN and 
Pea tissues were compared using the Chi-square and Student's f-test. 
Univariate associations between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were calculated using . 
Fisher's Exact lest. For all analyses, p < 0.05 was considered statistically 
significant. . . 



hi .tp://\vww. wjso.com/cont cnt/2/ 1/13 



5/25''200(> 



■World Journal of Surgical One 



y | Full text | Prostate stem cell antigen (PS 



expression in ... Page 5 of 1 2 



Outline Results 



. Abstract 
Introduction 
Materials arid methods 
Results 
Discussion 
Competing inierests 
References 

Figures 



Representatives of PSCA 
JHC and ISH staining in Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed weakly in individual 
samples of BPH. Some areas of prostate expressed weak levels (composite 
score 1-2), whereas other areas were completely negative (composite score 
0). Four cases (20%) of BPH had moderate expression of PSCA protein and 
mRNA (composite score 4-6) by 1HC and ISH. In 2/20 (10%) BPH 
specimens, PSCA mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 2) in one and 
negative (composite score 0) in the other. PSCA expression was localized to 
the basal and secretory epithelial cells, and prostatic stroma was almost 
negative staining for PSCA protein and mRNA in all cases examined. 

PSCA expression in PIN 

In this study, we detected weak or negative expression of PSCA protein and 
mRNA (=S2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of 11 (18.2%) 
HGPIN, and moderate expression (3-6 scores) in the rest 2 tow grade PIN 
and 5 of 1 1 (45.5%) HGPIN. One HGPIN with moderate PSCA mRNA 
. expression (6 score) was found weak staining for PSCA protein (2 score) by 
1HC. Strong PSCA protein and mRNA expression (9 score) were detected in 
the remaining 3 of 11 (27.3%) HGPIN. There was a statistically significant 
difference of PSCA protein and mRNA expression levels observed between 
HGPIN and BPH (p < 0.05), but no statistical difference reached between 
low grade PlN and BPH (p > 0.05). 

PSCA expression in Pea 

In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared 
with benign glands. Forty-eight paraffin-embedded Pea specimens were 
analysed by IHC and ISH. Jt was shown that 19 of 48 (39.6%) Pea samples 
stained very strongly for PSCA protein and mRNA with a score of 9 and 
another 21 (43.8%) specimens displayed moderate staining with scores of 
4-6 (Figure 1). Jn addition, 4 specimens with moderate to strong PSCA 
mRNA expression {scores of 4-9) had weak protein staining (a score of 2) 
by IHC analyses. Overall, Pea expressed a significantly higher level of PSCA 
protein and mRNA than any other specimen category in this study (p < 
0.05, compared with BPH and PJN respectively). The result demonstrates 
that PSCA protein and mRNA are overexpressed by a majority of human 
Pea. 



Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described in Materials and 
Methods, we compared the expression level of PSCA protein and mRNA with 
Gleason grade of Pea, as shown in Table .1 . Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-diflerentiation, 5-7 scores = 
moderate-differentiation and 8-3 0 scores = poor- differentiation |>'j. 
Seventy- two percent of Gleason scores 8-10 prostate cancers had very 
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strong staining of PSCA compared to 21% with Gleason scores 5-7 and 17% 
with 2-4 respectively, demonstrating that poorly differentiated Pea had 
significantly stronger expression of PSCA protein and mRNA than 
moderately and well differentiated tumors (p < 0.05). As depicted in Figure 
1, IHC and ISH analyses showed that PSCA protein and mRNA expression in 
several cases of poorly differentiated Pea were particularly prominent, with 
more intense and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed the 
results in Table 2. Seventy- five percent of locally advanced and node 
positive cancers (i.e. C-D stages) expressed statistically high levels of PSCA 
versus 32.5% that were organ confined (i.e. A-B stages) (p < 0.05). The 
data demonstrate that PSCA expression increases significantly with . 
advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen-, 
independent progression of Pea 

All 9 specimens of androgen-independent prostate cancers stained positive 
for PSCA protein and mRNA. Eight specimens were obtained from patients 
managed prior to androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the strongest staining 
category (score - 9), compared with three out of eight (37.5%) of patients 
with androgen- dependent cancers (p < 0.05). The results demonstrate that 
PSCA expression increases significantly with progression to androgen- 
independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates 
significantly with increasing grade, worsening stage and progression to 
androgen-independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

Jn all 88 specimens surveyed herein, we compared the results of PSCA IHC 
staining with mRNA ISH analysis. Positive staining areas and its intensity 
and density scores evaluated by IHC were identical to those seen by JSH in 
79 of 88 (39.8%) specimens (18/20 BPH, 3 9/20 PJN and 42/48 Pea 
respectively). Importantly, 27/27 samples with PSCA mRNA composite 
scores of 0-2, 32/36 samples with scores of 3-6 and 22/24 samples with a 
score of 9 also had PSCA protein expression scores of 0-2, 3-6 and 9 
respectively. However, in S samples with PSCA mRNA overall scores of 3-6 
and in 2 with scores of 9 there were less or negative PSCA protein 
expression (i.e. scores of 0-4), suggesting that this may reflect 
posltranscriptional modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data demonstrate that 
the results of PSCA immunostaining were consistent with those of mRNA 
ISH analysis, showing a high degree of correlation between PSCA protein 
and mRNA expression. 
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PSCA is homologous to a group of cell surface proteins that mark the 
earliest phase of hematopoietic development. PSCA mRNA expression is 
prostate-specific in normal male tissues and is highly up-regulated in both 
androgen- dependent and- independent Pea xenografts (LAPC-4 tumors). We 
hypothesize that PSCA may play a role in Pea tumorigenesis and 
progression, and may serve as a target for Pea diagnosis and treatment. In 
this study, 1HC and ISH showed that in general there were weak or absent 
PSCA protein and mRNA expression in BPH and low grade PIN tissues. 
However, PSCA protein and mRNA are widely expressed in HGP1N, the 
putative precursor of invasive Pea, suggesting that up-regulation of PSCA is 
an early event in prostate carcinogenesis. Recently, Reiter RE et al [1], 
using ISH analysis, reported that 97 of 118 (82%) HGP1N specimens stained 
strongly positive for PSCA mRNA. A very similar finding was seen on mouse 
PSCA (mPSCA) expression in mouse HGPJN tissues by Tran C. P et al [8]. 
These data suggest that PSCA may be a new marker associated with 
transformation of prostate cells and tumorigenesis. 



. We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly 
differentiated, androgen-independent and metastatic cases. Fluorescence- ., 
activated cell sorting and cdnfocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in Pea cells [9]. Our 
JHC expression analysis of PSCA shows not only cell surface but also 
apparent cytoplasmic staining of PSCA protein in Pea specimens (Figure 1). 
One possible explanation for this is that anti-PSCA antibody can recognize 
PSCA peptide precursors that reside in the cytoplasm. Also, it is possible 
that the positive staining that appears in the cytoplasm is actually from the 
overlying cell membrane |5). These data seem to indicate that PSCA is a 
novel cell surface marker for human Pea. 



Our results show that elevated level of PSCA expression correlates with high 
grade {i.e. poor differentiation), increased tumor stage and progression to 
androgen-independence of Pea. These findings support the original JHC 
analyses by Gu 2 et al [9], who reported that PSCA protein expressed in 
94% of primary Pea and the intensity of PSCA protein expression increased 
with tumor grade, stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al [10], in which Ihe 
significant association between high PSCA expression and adverse 
prognostic features such as high Gleason score, seminal vesicle invasion and 
capsular involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recurrence, clinical 
progression or survival of Pea. Horo H et al 1 3 7 J used RT-PCR detection of 
PSA, PSMA and PSCA in 1 ml of peripheral blood to evaluate Pea patients 
with poor prognosis. The results showed that among 58 PCa patients, each 
PCR indicated the prognostic value in the hierarchy of PSCA>PSA>PSMA RT- 
PCR, and extraprostatic cases with positive PSCA PCR indicated lower 
disease-progression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic factor. Dubey P et al 
|].?] reported that elevated numbers of PSCA + cells correlate positively 
with the onset and development of prostate carcinoma over a long time 
span in the prostates of the TRAMP and PTE N \f- models compared with its 
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normal prostates. Taken together with our present findings, in which PSCA 
is overexpressed from HGPIN to almost frank carcinoma, it is reasonable . 
and possible to use increased PSCA expression level or increased numbers 
of PSCA- positive cells in the prostate samples as a prognostic marker to 
predict the potential onset of this cancer. These data raise the possibility 
that PSCA may have diagnostic utility or clinical prognostic value in human 
Pea. 

The cause of PSCA overexpression in Pea is not known. One possible 
mechanism is that it may result from PSCA gene amplification. In humans, 
PSCA is located on chromosome 8q24.2 [1], which is often amplified in 
metastatic and recurrent Pea and considered to indicate a poor prognosis 
[13-15]. Interestingly, PSCA is in close proximity to the c-myc oncogene, 
which is amplified in >20% of recurrent and metastatic prostate cancers 
[16,17]. Reiter RE et al [18] reported that PSCA and MYC gene copy 
numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coamplification in Pea. Gu Z et al [9] recently reporteted that in 102 
specimens available to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had identically positive areas 
of PSCA protein and mRNA expression. Taken together with our findings, in 
which we detected moderate to strong expression of PSCA protein and 
mRNA in 34 of 40 (85%) Pea specimens examined simultaneously by IHC 
and ISH analyses, it is demonstrated that PSCA protein and mRNA 
overexpressed in human Pea, and that the increased protein level of PSCA 
was resulted from the.upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and its 
biological function are yet to be elucidated. PSCA expression may be 
regulated by multiple factors [13]. Watabe T et al [19] reported that 
transcriptional control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be regulated or 
mediated through cell-cell contact and protein kinase C (PKC) [20). 
Homologues of PSCA have diverse activities, and have themselves been 
involved in carcinogenesis. Signalling through SCA-2 has been demonstrated 
to prevent apoptosis in immature thymocytes [2]). Thy- 3 is involved in T 
cell activation and transducts signals through sre -like tyrosine kinases [22). 
Ly-6 genes have been implicated both in tumorigenesis and in cell-cell 
adhesion [T.i-'~2b]. Cell-cell or cell-matrix interaction is critical for local tumor 
growth and spread to distal sites. From its reslricted expression in basal 
cells of normal prostate and its homology to SCA-2, PSCA may play a role in 
stem/progenitor cell function, such as self- renewal (i.e. anti-apoptosis) 
and/or proliferation [ J J. Taken together with the results in the present 
study, we speculate that PSCA may play a role in tumorigenesis and clinical 
progression of Pea through affecting cell transformation and proliferation. 
From our results, it is also suggested that PSCA as a new cell surface 
antigen may have a number of potential uses in the diagnosis, therapy and 
clinicaL prognosis of human Pea. PSCA overexpression in prostate biopsies 
could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Similarly, the detection of PSCA-overexpressing 
cells in bone marrow or peripheral blood may identify and predict metastatic 
progression better than current assays, which identify only PSA-positive or 
PSMA- positive prostate cells. 
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In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGP1N through all stages of Pea in a majority 
of cases, which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell differentiation), 
advanced stage and androgen- independent progression. PSCA protein 
overexpression is due to the upregulation of its mRNA transcription. The 
results suggest that PSCA may be a promising molecular marker for the 
clinical prognosis of human Pea and a valuable target for diagnosis and 
therapy of this tumor. 
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Expression of superoxide dismutases, catalasc, and glutathione peroxidase 
in glioma cells, 

Zhong W , YaaT, LunJR, Qberley LW . 

cS1 2 242 R U?A Ch Lab0rat °^' ^Partment of "Radiology, The University of howa, Iowa 

Four primary antioxidant enzymes were measured in both human and rat glioma cells 
Both manganese-containing superoxide dismutase (MnSOD) and copper-zinc-containing 
superox,de d.smutase (CuZnSOD) activities varied greatly among the different glioma 
cell Imes. MnSOD was generally higher in human glioma cells than in rat glioma cells 
and relatively higher than in other tumor types. High levels of MnSOD in human glioma 
cells were due to the high levels of expression of MnSOD mRNA and protein 
Heterogeneous expression of MnSOD was present in individual glioma cell lines and 
™* b * 1™ l ° sub P°P ul at'ons or cells at different differentiation stages. Less difference 
in CuZnSOD, catalase, or glutathione peroxide was found between human and rat glioma 
cells, The human ghoma cell lines showed large differences in sensitivity to the 
glutathione modulating drugs 1 ,3-bis (2-chloroethyl)-l -nitrosourea (BCNU) and 
buth.on.ne sulfoximine (BSO). A good correlation was found between sensitivity to 

?™ ? aCt ! VltleS 0f 0313,336 in tl,ese cel ' ,ines - ° n 'y cell line Was sensitive to 
BSO and tins line had low CuZnSOD activity. 
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[Expression of human telomerase reverse transcriptase in cervix cancer 
and its significance! 

[Article in Chinese] 

XLL ZOilT, WuP, Xu&, Huaa^, UJCZ^ Li^O!, M 

Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College 
Huazhong University of Science and Technology, Wuhan 430030, China. 

^SS^o x,? inveSti S ale the expression of human telomerase reverse transcriptase 
(nibKl) mRNA and protein in cerv.x cancer, cervical intraepithelial neoplasia fClN) . 
and normal cervix. METHODS: Expression of hTERT mRNA and the other two subunits 
of telomerase human telomerase RNA component (hTR), human telomerase-associated 
protein (hTP I) was determined by RT-PCR in 3 cervix cancer cell lines, 2 diploid cell 
. - lines, 38 cases of cerv!x cancer, 16 cases of CIN and 20 cases of normal cervix 
Telomerase activity was also examined by telomeric repeat amplification protocol 
enzyme-luTked immunosorbent assay (TRAP-ELISA). Expression of hTERT protein was 
J e ' e ^ CCH liDeS 311(1 101 ^ of paraffinized cervix tissue sections, 
r c 7?*, c RT mRNA ex P res ? iotl wa « detected in all of the three cervix cancer cell 
lines, 8L6/0 of cerv.x cancer, 37.5% of ON, 5.0% of .normal cervix, while in neither of 
the two diploid cell hues. The other two subunits of telomerase were prevalently 
expressed in all of the cell lines and most cervix tissues. There was a strong correlation 
between hTERT mRNA expression and telomerase activity. Immunostaining also 
revealed that hTERT protein was expressed in all three cervix cancer cell lines 65 5% of 

°r™T er ' 28 ° % ° f CIN md 4 8% 0f noiTOal cen,ix - CONCLUSION: Up-regulation 
™ T Y , P y a " im P° rtant ro,e in *e development of CIN and cervix cancer 
could be used a s an early diagnostic biomarker for cervix cancer. 

PMBD: 16008894 [PubMeil - in process] 
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both general and selected populations. We will have 
to take the next steps with not only advanced 
computational tools in hand, but also with a measured 



and useful eye towards the clinical context, one that 
guides us towards the introduction and utilization of 
genetic information to improve health worldwide. 
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The value of 
microarray techniques 
for quantitative gene 
profiling in molecular 
diagnostics 

Stephen A. Bustin and Sina Dorudi 



There has been an explosion of interest in microarray technologies that allow the 
quantification of whole-genome RN A expression data. The apparent correlation of 
expression profiles with clinically relevant parameters such as disease outcome has 
raised expectations with respect to the clinical usefulness of the data generated. 
*Yet the accuracy and biological relevance of these data remain contentious, even 
in basic research applications. Therefore, numerous issues related to format, 
quality, validation and interpretation remain to be resolved before microarray 
profiling can become a diagnostic tool of clinical relevance for routine work. 

Published online: 8 May 2002 

The current fashion for functional genomics has put 
the spotlight on microarray technologies that are 
capable of comprehensive, quantitative RNA 



expression analysis, with their promise of a global 
approach to the quantification of gene expression [1 ] . 
However, there are two conflicting opinions on the 
value of micro arrays: they are either promoted as an 
exploratory-driven replacement for hypothesis-driven 
biology |21, or they are criticized as expensive fads [3] 
that cannot substitute for the old-fashioned, 
low-throughput approach to experimental biology |4]. 

Variability 

Unquestionably, microarrays can reveal associations 
between gene-expression signatures and the biology 
and outcome of disease, for example by identifying 
clinically significant subtypes of cancers [5,6] . This 
has raised expectations that the expression profile of a 
cell can be used as a diagnostic and/or prognostic aid in 
cancer management. However, although microarray 
experiments generate long lists of genes with altered 
expression, the interpretation of these data depends 
on thejudgement of the investigator performing the 
experiment |7]. Furthermore, a comparison of the same 
microarray experiment performed a few weeks apart 
can demonstrate a considerable lack of reproducibility 
[8]. This is exacerbated by an apparent lack of 
robustness of the microarray data, as demonstrated 
by the variability in the results obtained in different 
laboratories. Indeed, a comparison of data obtained in 
independent studies performed with different 
microarray platforms, for example in lung [9, 1 0| or 
colorectal [11-13] cancers, shows both similarities * 
and significant differences. Moreover, a comparison of 
47 and 98 genes identified from independent studies to 
be associated with metastasis 1 1 4 J 5] does riot reveal a 
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single common gene. Different methods of data 
analysis are one cause for the variation in raicroarray 
results in the membership of genes on one list but not 
on another from a similar experiment. For example, 
although patients with diffuse large B-cell lymphoma 
could be categorized into groups with differences in 
overall survival using unsupervised clustering 
analysis of expression profiles [5] , the prognostic power 
of these gene signatures was not confirmed in a more 
recent study that used a supervised clustering method 
of analysis [16]. However, 13 key genes defined in the 
second study were able to predict disease outcome in 
both patient series. Good experimental design is 
crucial, and misleading conclusions can appear in 
even the most rigorously refereed publications [17]. 

Tissue selection 

Normal biopsies contain a range of different cell types, a 
problem exacerbated in heterogeneous tumour samples 
that also include normal and inflammatory cells as well 
as diversely evolved cell populations. Furthermore, 
normal cells adjacent to a tumour could be phenotypically 
normal but genotypically abnormal, or could exhibit 
altered gene expression profiles owing to their 
proximity to the tumour [18]. This does not present a 
problem for conventional histopathology but expression 
profiling of such biopsies provides a composite of the 
whole population, and cannot identify expression 
limited to subpopulations or sections of the biopsy. 

The accuracy of quantitative gene profiling is 
critically dependent on microdissection, especially on 
the laser capture microdissect ion (LCM) of individual 
cells or cell populations [ 1 9]. However, because the 
amount of mRNA becomes a limiting factor, any 
amplification procedure has the potential to distort 
the original mRNA distribution pattern [20)/This is 
important since quantitative expression profiling 
relies on the preservation of mRNA species 
distribution in the tissue of interest. New procedures 
for target amplification are being developed that 
appear to maintain comparable expression profiles 
between amplified and non-amplified target [2 1 ,22), 
but results obtained using the different protocols 
have not been tested for comparability. In addition, is 
inclusion of labour-intensive and costly steps such as 
LCM a realistic proposition for routine clinical 
diagnostics? Even if it is, expression-profile analysis 
might only refine, rather than revolutionize, current 
histopathological staging information [23]. 

Compatibility 

Fundamental issues of protocol and platform 
comparability remain to be addressed. There is no 
standardization of protocols or reagents for sample 
acquisition, or RN A or cDN A target preparation. 
This, together with the lack of a universally accepted 
measure of quality assessment, virtually ensures 
significant variability with respect to target 
preparation. The microarray measurement process 
itself relies on consecutive steps that influence data 



reproducibility and comparability: array construction, 
target labelling, probe-target hybridization, signal 
detection, normalization and interpretation [24]. The 
array user generally has no control over quality-control 
issues affecting fabrication of the DN A microarray 
platform; however, commercial cDNA microarrays and 
oligonudeotide chip-based systems have had significant 
errors in sequence identity [2 5,26) . The user does have 
control over the implementation of target labelling, 
hybridization and detection protocols, but this is likely 
to result in additional sources of variability; as there 
are uncertainties associated with different measuring 
instruments and procedures [27]. Unfortunately, there 
is no consistency or standardization with respect to 
any single component to aid comparative analysis. 

Analysis 

The meaningful interpretation of results depends on the 
appropriateness of the techniques employed for data 
analysis [28], and the accuracy of microarray profiles 
requires correct study design, well-characterized 
statistical goals and careful attention to old-fashioned* 
and proven statistical principles [29]. Since array data 
typically consist of parallel expression measures for 
thousands of genes, a combination of analysis of 
variance (ANOVA) analysis and bootstrap re-sampling 
of the residuals has been suggested as the most 
appropriate way of obtaining confidence intervals of the 
differences in expression levels for each gene [30]. On the 
other hand, lack of power is an important limitation, as 
experimental replication at the level of single genes is 
low and current statistical methods used in microarray 
analysis have been severely criticized [31]. One can 
apply stringent criteria to avoid false-positive results, 
but this is usually at the expense of increasing the 
false-negative rate [32]. Sophisticated software tools 
that identify common patterns of expression have been 
developed and most data are analyzed using clustering 
techniques. These identify distinct expression patterns 
and assume functional relationships by grouping genes 
with similar expression patterns ('guilt by association*), 
but do not address the relevance of any interrelationship 
of their expression levels [33]. Analyses also need to take 
into account the significant specific biases that exist 
among probes used to interrogate the same gene |34]. 

Normalization 

All procedural errors are compounded by continuing 
uncertainties in the normalization step. Signal 
intensity is the most critical parameter that influences 
the reproducibility, specificity and sensitivity of 
microarray experiments [35], but the low dynamic 
range of microarrays [36] can bias expression profiles 
towards low-abundant targets [37]. Furthermore, the 
expression of housekeeping genes, which are 
frequently used as internal standards, is variable [38] . 
Therefore, validation of expression trends in an 
unrelated, secondary assay remains a critical 
requirement, and is best accomplished using the 
real-time reverse transcriptase polymerase chain 
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Table 1. Microarray databases and analysis tools 



Function 



Organization 



URL 



MIAME 

Data exchange protocols: MAGE-ML 
Array-Express database, submission and 

analysis tools 
GEO database 
ArrayDB database 
GeneX database and tools 
Protocols and tools 
Software and tools 
Software and tools 
Reagents 



MGED Group 
MGED Group 
EMBL-EBI 

NCBI 

NHGRI 

NCGR 

DeRisi lab, University of California (San Francisco) 
Biodiscovery 

University of California (Berkeley) 
Stratagene 



www.rnged.orgAA/orkgroups/lvllAME/miame.htrnl 

wvvw.mged.orgAA/orkgroups/MAGE/rnage.htrnl 

www.ebi.ac.uk/microarray 

www.ncbi.nlm.nih.gov/geo 

http://genome.nhgri.nih.gov/arraydb 

www.ncgr.org/genex 

www.microarrays.org 

www.biodiscovery.com 

http://rana.lbl.gov 

www.stratagene.com 



Abbreviations: GEO, gene expression omnibus; EMBL-EBI, European Molecular Blotogy Laboratory-European BioinformaUcs Institute; MAGE-ML, microarray and gene- 
expression markup language; MGED, microarray gene expression data; MIAME, minimum information about a microarray experiment; NCBI, National Center for 
Biotechnology Information; NCGR, National Center for Genome Resources; NHGRI, National Human Genome Research institute. 



reaction (RT-PCR) with its high dynamic range, 
bearing in mind the limitations of the technique [39]. 

Relevance 

How relevant are the data being obtained? DN A 
array work is predicated on the assumption that 
quantification of cellular levels of mRNA transcripts can 
function as a surrogate for measuring the expression 
or activity of the protein, which is the final functional 
product of gene expression. However, the correlation 
between the number of mRNA and protein molecules is 
generally not strong enough to predict one value from 
the measurement of the other [40,4 1 ] , as there is at least 
a ten-fold range of protein abundance for mRN As of a 
given abundance [42]. Certainly, gene expression levels 
perse tell us very little about the biological function of 
the gene, its potential clinical impact or its suitability as 
a drug target. A complete picture can only be obtained 
by assessing multiple biochemical parameters, as was 
shown recently in a report that combined microarrays 
with quantitative proteomics and databases of known 
physical interactions to build a biochemical pathway, 
test the hypothesis and refine the model generated [43]. 
Furthermore, most microarrays are not exhaustive 
and report on selected subsets of genes. Therefore, it is 
possible that relevant information will be missed and 
that, at this early stage, more-exhaustive analyses still 
need to be carried out using arrays containing as many 
genes as possible. Third, most arrays are no't designed to 
discriminate between alternatively spliced transcripts 
of the same gene, or between closely related family 
members [44]. On the other hand, as several 
microarray experiments have generated clinically 
relevant quantitative gene profiles, this question might 
not be quite as relevant for molecular diagnostics as it 
is for mechanistic research applications. 

Minimum information about a microarray experiment 

Easy access to DNA sequence databases and the ability 
to mine sequences gathered from many disparate 
sources has been a key contributor to the success of 
the Human Genome Project. mRNA profiling data 
^typically consist of many thousands of measurements 
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for each array and, unlike DNA sequence data, depend 
on the context of the individual biological sample and 
the experimental conditions under which the sample 
was taken. Hence, access to raw data and the 
experimental conditions resulting in these data are 
prerequisites to enable comparative analyses as well 
as automated data mining [45]. This requires the 
adoption of standards, although it is worth pointing out 
that too-early standardization might establish a 
standard that restricts rather than enables [ 1 7) . 
One proposed solution is adherence to a Minimum 
Information About a Microarray Experiment (MI AME) 
standard for array experiment annotation and data 
representation [46], to provide the minimum 
information required for unambiguous interpretation 
and independent verification of microarray data. 

Repositories 

Although it is obvious that the annotated microarray 

data will have to be stored in accessible locations, 
« 

it is not surprising in the light of the many unresolved 
items associated with these data that the usefulness 
of such repositories has been questioned [4 7] . Some 
repositories state that they are Ml AME compliant 
(e.g. Array-Express), whereas others support the 
spirit of MIAME but are still considering the meaning 
of MIAME compliance (e.g. GEO). However, anyone 
used to mining public DNA sequence databases is 
aware of the numerous inconsistencies and errors 
associated with the data deposited therein. Hence, it 
will be crucial to have a readily appbcable and 
transparent means of validating data quality [48]. 

Ontologies and toolsets 

Data interpretation represents an enormous challenge 
for bio informatics and, to date, there is no consensus on 
how to merge data from different array types, let alone 
how to include data from alternative gene expression 
profiling approaches, such as differential display and 
serial analysis of gene expression. Hence, there is an 
urgent need to apply agreed ontologies (i.e. sets of 
well-defined terms with well-defined relationships) to 
the description of experimental conditions, samples 
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and their treatment sources [49] . Also required are 
integrated toolsets that permit users to download 
their data into the database, and to use common 
data-exchange protocols such as Microarray Annotation 
Markup Language (MAML) to communicate with 
other databases in order to refine their own array 
data or for more-general autom ated data mining. 
The National Centre for Genome Resources' GeneX™ 
project aims to create an internet-available relational 
database of public data derived from multiple 
technologies, and standardization programmes have 
been undertaken under the umbrella of the Microarray 
Gene Expression Data (MGED) group. Table 1 
summarizes the main public-repositories analysis 
tools and a source of reagents for microarray analyses. 



Conclusion 

Microarrays are already having a major impact on 
cancer biology, pharmacology and drug development 
and, with their rapid development, new 
technologies, methods of analysis and applications 
are emerging continuously However, the major 
limiting factor in their further application is the 
current lack of data comparability, which is 
essential for appropriate comparisons between 
different array data. Therefore, standards for 
rigorous data annotation, agreed ontologies and 
data deposition in public repositories must be 
agreed if this technology is to be of practical use and 
become a routine tool for diagnostic and 
therapeutic applications. 
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17.1 Introduction 

There are now numerous organisms whose genome sequences are known, for 
example Arabidopsis (Arabidopsis Genome Initiative. 2000), Drosophila (Adams 
etai 9 2000), human (International Human Genome Sequencing Consortium, 
2001) and rice (Yu et a/., 2002). Hie prediction of open-reading frames from 
these genomic sequences has enabled the comprehensive identification of many 
putative protein sequences. These proteins can be arranged into three categories, 
namely those of known function, those with recognisable motifs and hence a 
vague idea of function, and those with no sequence similarities to any pro- 
tein (Gabor Miklbs and Malenszka, 2001). Many proteins reside in this latter 
'functional vacuum' which could represent as much as 30% of the predicted 
proteins. Determining protein function is key to understanding cellular mecha- 
nism. Studying how protein expression is modulated in response to a given set 
of circumstances, such as infection, disease, developmental stage, senescence or 
response to drugs, will facilitate the elucidation of disease pathways and thus 
provide a mechanism for diagnosis and therapy. 

DNA chips (mRNA profiling studies) can contribute to the study of gene 
expression in response to a particular biological perturbation. However, the 
extrapolation that changes in transcript level will also result in corresponding 
changes in protein amount or activity cannot always be made, lb understand 
fully, we need integrated data sets from a variety of protein expression studies, 
providing information on relative abundances, sub-cellular locations, protein 
complex formation and the profiling of isoforms generated by either alternate 
mRNA splicing or post-translationai modifications. Proteomics is the word now 
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commonly used to describe the discipline associated with the acquisition of 
these data sets. 

17*2 Definitions and applications 

Before embarking on an overview of the techniques used in the field of pro- 
teomics, it is necessary to define a few terminologies; proteomics, functional 
genomics, structural genomics and post-genomics are terms that have crept into 
the scientific vocabulary with alarming stealth and are used freely, and in some 
cases interchangeably, especially in cases where the use of such 'buzz words* 
is likely to increase the attendance at departmental seminars. 

• Genome: A genome represents the entire DNA content in a particular cell, 
whether or not it is coding, non-coding, or is located either chromosomally 
or extra-chromosomally. 

• Genomics: Genomics is the study of the genome, interrogating the complete 
genome sequence using both DNA and RNA methodologies. 

• Proteome: The proteome represents the complete set of proteins encoded by 
the genome. 

• Proteomics: Proteomics is the study of the proteome and investigates the cel- 
lular levels of all the isofoims and post-translational modifications of proteins 
that are encoded by the genome of that cell under a given set of circumstances. 

• Functional genomics: This is the study of the functions of genes and their 
inter-relationships. 

Whilst a genome is more or less static, the protein levels in a particular cell can 
change dramatically as genes get turned on and off during the cell's response to 
its environment The proteome originally was defined over seven years ago as 
'all the proteins coded by the genome of an organism* (Wasinger et aL f 1995). 
Nowadays the term 'proteomics* is used more widely and implies an effort to 
link structure to function by whatever means are appropriate. We can expect the 
definition to change again with time as the field and the investigator's view of 
it evolves. 

17*3 Stages in proteome analysis 

Proteomic analysis can ascertain function either by looking for changes in the 
expression of either all or a subset of proteins, or by identifying binding partners 
for particular proteins and seeing how their interaction is affected by biological 
perturbation. Whatever the rationale of the investigation, or the number of pro- 
teins involved, the study of the proteome can be broken down into the following 
stages of analysis. 
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Table 2. Posterior probability of expression In sample tissue 

Re P |icate 1 Replicate 2 
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smaller It is unclear why replicate 2 is so different from the 
others, but it serves to remind us that replication does not ensure 
duplication of results, a fact that cannot be quantified when 
rephcationis not used. We also note in Table 1 that the estimate 
of p varies greatly from one replicate to another. Recall in our 
controlled eyeriment that only 32 of the 288 genes (fraction 
U.lll) should be classified as expressed Thus, the estimates of 
p. provided by replicates 1 and 3 are much too large. 

->\ t i?*? neXt t0 estimates of the posterior probabilities (Eq. 
2£ Table 2 summarizes a representative fragment of the results 
We see generally that the posterior probability clearly indicates 
whether a gene is expressed in the sample tissue and that the 
results are quite uniform across the three replications. There are 
occasions, however, as illustrated by the results for gene no. 17 
where the three replications do not give uniform results. Rep- 

n^nnnw° r i hlS g6ne ? iveS 3 ve ry large posterior probability 
(1.0000) to the expression event %Jr# = y, whereas the other 
two replicates give smaller probabilities (0.1404 and 0.0000). 

2,4. Checking the Consistency of Results from the Three Replicates. 
We next study the extent to which the three replications 
analyzed separately, provide consistent classification with re- 
S P^ t . t0 P ne expression. Using the posterior expression prob- 
abilities (such as those in Table 2) for each replicate /, we will 
classify a gene* as being expressed if Pr^Y^ = y ) i s larger than 
U.i and as not being expressed otherwise. This classification is 
done independently for each replicate. 

^ r T ^ bl t 3 contains the results of ^is classification process. Table 
. Le P shcrws a three-way crossclassification, whereas, for ease of 
interpretation, Table 3 Right shows the three two-way crossclas- 
sitications corresponding to the three pairs of replicates If the 
replicates were perfectly consistent, only two cells of Table 3 Left 
would have counts, namely, the cell counting unexpressed genes 
in all three replicates and the cell counting expressed genes in all 
three replicates. In fact, however, all of the cells in the table have 
counts, and four of these are sizeable. This is evidence that the 
replicates are not perfectly consistent. As one illustration of 
inconsistency, we note in Table 3 Left that 23 genes classified as 

Table 3. Comparing results of a separate analysis for each replicate 
Three-way crossclassification 



genes g, for replicates / «= 1, 2, 3. 

expressed in replication 3 are classified as unexpressed in 
fcLT,h ™ d ? T ther mustrat ion, we note in Table 3 
S£ r r f nmS ^^ t genes dMsWed as expressed in the 
three replicates are 55, 36, and 58, respectively. M 32 of the 288 
genes should be classified as expressed, we are again remkded 
by these results that replicates 1 and 3 are providing a large 
number of false positives. B 

T i? £?-l el ^i 001 "" data Table 3 , we again postulate a prior 
probabihty p that any given gene is expressed in the sample 
tesue. As discussed earlier, mRNA in the tissue must clear t£o 
further hurdles to appear "expressed" on the microarray slide. It 
must be converted to probe and hybridized to the cDNAs that 
are deposited on the slide. The stochastic behavior of this 
mechanical process can cause replications to differ. We let r, 
denote the tondmonal probability that a gene will be classified 
as expressed" in replicate/, where/ = 1, * . , j and assume that 
the corresponding conditional events are independent from one 
replicate to another. We also postulate that, by chance, a gene 

Z ™' ex P reK?ed 'n the sample tissue may appear exposed 
Z!tt< ru becaU *? ° f background noise or otheV experimental 
artifacts. The conditional probability of such a spurious indica- 
tion will be denoted by sj for the/th replicate,/ = 1 / Aeain 

we assume that these conditional events are independent aSong* 
the replicates. We can now estimate these probabilities from the 

IT A T a 7 >. le 3 - the method of ™^nium likelihood 
applied to the following likelihood function 



where 



Replicate 3 



1 1 Kwgi, • - . , hv) = n ipp s + (i - p)e„), 



p i= 11 [rp'(l -/-,)' --«'], 
i" 1 

J 

j « i 



Three two-way crossclassifications 



[3] 



Replicate 1 



U 
E 

Total 



Replicate 2 



Replicate 2 Replicate 2 Replicate 3 Replicate 3 

U E Total Replicate 1 U E Total Replicate 1 U E Total Replicate 2 U E Total 



207 2 23 1 233 U 230 3 233 U 

'H 2 3 31 55 E 22 33 55 E 

226 4 26 3 2 288 Total 252 36 288 Total 



209 24 233 U 
21 34 55 E 
230 58 288 Total 



226 26 252 
4 32 36 
230 58 288 



r . ... — „ iju _>o /loo 

o.^::^z^i^° ,or three '^^^ 

U, unexpressed; E, expressed. 
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and y denotes the indicator variable for whether eene p » 
dasafied as expressed in replicate / = 1 

SS3Z Tt* oj 1 ^** L' - 3! and the nSZ 
r^f^n ^ |lma n t ^ ba8ed 011 in Table 3 are j^Hn 

0.103. We note that r% vis n-.n-ericn!:- so dr iC - to r e v<- . 

u w,iS sct t0 1 die computation. " *"* " 

The probability estimates reveal several points of interest for 
SKdXSSt ^ th l experiment dign 'pur^efy 



Table 4. Analysis of the combined data f rom all three replicates 

Parameter Estimate 

: Estimate Est.Std.Err. 



P 

f': 7 

oi 



0.118 

-0.204 
'(.524 
0.044 
0.126 



0.013 

0.CO9 
0.058 
0.0O3 
0.036 



fraction t - »TTl VU "," "R™™. w hirh is tho oxoct 
£^T P ,T.!..' , H ? nce ' th? ***«kal analysis has rehabrv 
reproduced this fraction in ttar admsie of 4 Second "'me 

fenT^ ^ fry f h °^ that W * 18 not a that anSSLtd 

pXb£tv nf H 25 -' 5611 aS on the slide, aT^e 

probability of doing so can vary from one experimental execu 

nidications of genes (i.e., false positives) can occur £ a 
frequency as large as 10% in a single experiment 

now seeK to describe the microarray data from the three 

nSlm^el £^£2? ,mear ^ " 3 1 Statis - 

Y *J + ft +V gl + Bei , for^-l,...,c,y-l /. 

W) 

S E(yK,) T^+^ + ft+'lto" the mean log-ratio for eene 
gunder experimental conmaon;^^ 

KSS* or 311 8 e "« and experimental conditions 

conEn/ ?„H 1 gCne - ft ft k ^ main Cffect for «P™ta! 
condition;, and y^is aninteraction term that reflects differential 

fhTnS? ,0n f ° r under experimental conZon ; In 

replicate;. The term Bg} is a random error which, by definition 
has a mean of zero. We assume that the error terms are 
independent but we have no need in this study to make Z 
assumption about their distributional form Y 
define rT ng ^ 2*™ assum P ci °" for individual replicates we 

whethTr a f tW ° dlStU,ct distr i«>utions according to 

whether gene g is expressed in the sample tissue ("event f \ Th. 
d.str.bunon of a, therefore foUows a L5S£dSr2i 

M-pfeW +(1 -pl/Wa) r S , 

rnmodel" 1°°^ ^"V" ° fy for the variable Nation 

Sen el VT^ 3 ' * arC inde P e »dent effects for 
mean ^ ° Ur 5tudy b * "bating the overall 

mean, the mam effects for genes, and the main effects for 
exper.mer.tal replicates in model 4, as follows 



JEES^ZtSZ™ derived .romthe 



A = Y~ 



Y.. 



[6a] 
[6b] 
[6c] 



The f ? effCCtS in m0del 4 are -"dom. 

ine estimates in Eq. 6, however, are standard fixed-efff-n 
es imates. We choose these estimators because they are in [er 
ently free of any distributional assumption. In'paSa the" 

wl now 1 thl aSS t Umpt !° n of a , norma ' -""ture distribution. . 
distXtio . in P„ ^ h I'" 12 ' 6 the P aram «ers of the mixture 
oistnbutioi, ,n Eq. 5 and subsequently use them again to examine 



2.6. Analysis Results for the Combined Data. As the main effects for 
genes are now estimated from three replicationsXrSSe 
more sharply delineated than they are in Table 1 where Z 
parameter estimates are calculated'separately for eachTep Lt 

mean parameters ^ and are well separate7¥n1ra Se 
Z££ anT teS ^ "f ™ 8maller tha n those cSid* 
^ 3 !. ? ^ SCS aS lKted in Tab,e J - I" f a «, they would be 
expected to be smaller by a factor of about 3 FoS the 
estimate of variance parameter smaUer than LVo?o|tSs 
difference is expected by the fact that, in the event of no 
expres on (..e event %), variance parameter oi VSEc? the 
In^ttSnlf 1 : n g - rati ° of ba ^°-dnoise aSSSJl 
Ser ref L/tf^T ex P ress,on < i e - ^ent^), variance param- 
eter reflects two sources of variability: (/) the loe-ratio of 

va J he P° steri °r Probability that gene g is expressed, given the 

y repl L^d bv" a ^ ^ **» b ' ^ 4^JS 

Hn T fn T ^ « heSC P° sterior Probabilities are M either 
dose to l or close to zero. In fact, classifying the genes accordJne 

2ofthe288gene S are incorrectly classified as to whether thev are 
expressed. Hence, basedon the combined data, the SificTtion 

g ;«°nt T5 o an1 s i«r tiv r and no faise S 

genes nos 75 and 185 are classified as expressed when thevwere 

trom Table 3 that the individual replicates were far from perfect 
in their ability to classify genes. periect 
Fig. l a and b show normal probability plots of the & for the 
genes classified as expressed^ and unexpressed respectivelv 
AccorCng t0 th ixed nomal mod ^ • ^ fhould 
both be normal ,f the classification were perfect. The evidence 
seems quite supportive of the normality «sunuJ?„ Z boS 
plots. For the genes classified as expressed, there is some 
ev.dence of values being clustered. For those ckssified as 
unexpressed, there may be a little contamination of the norma 
distribution, leading to a few outlying observations relative to a 
pure normal distribution. Fig. 2 shows an over! y of at og am 

SlS^fa d I^ s ^* , i i no ™K■ , probability den ^ 

aesenbed n Eq. 5, based on the parameter estimates in Table 4 
A co mpanso „ of the histogram and the density function shows 
hat the mixed normal model is quite reasonable Both the 
histogram and density function show that the expressed and 
Nore P hTw e e d v 8 er n rhe°r Py ^ns SSScS 

^n^ittiar d ' fferenCe in Variability ° f the 
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- inconsistent expression ^^^^T^.? 1 
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2.7. Rsjiabiljtif as a Function of the Number of Replicates How does 
^ f e «P«nment, a partial answer fa orovided hv 



fciv«*e Normal 
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.452312 -f 
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kwerse Normal 



J51465 



acsrjssr Assr co " di,ion! " d - ■» 
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Fig. 2. Overlay of a histogram and mixed 
main effect. 
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normal p.d.f. for gene expression 



IS^d Si ^ e r heatCS 1,1 «!**■«■ Thefalse- 

KSSSi ^^ an ^ ,tud,,SL Second ' 11,6 Aow. 
now classification precision varies with the number of reolicates 

I 3 ^ A a,M ¥ 1 *™pli»«» 3 alone misclassified 90%. of 
eplSt« We Df m ^ rehab ! , 'ty ^crease with the number of 

with increasing replication unless all sources of exntrilm^ 

sruay will depend on many factors, including the type of arrav 
eqiupment, laboratory technique, and the conditionand prep- 
aration of samp es. If experimental resources anj tim St 
we see potential benefit from using a minimum of Sree S 
cates because three or more classification outcome! offer Ae 
Posabihty of tnangulation of results. A comparison of cM 
cation outcomes for all possible combinations^ replicate * Tis 

stow 5sr of replicates in Tab,e 3 for «S3TJS 

ShfTt, k °"? ° r m0re re P^ates are rogues. A judmen 

Kepl,cates might also be used with a majority votine rule to 
decide whether a gene is expressed. Such a rui Tis no beneficial 
m th,s expenment but might be useful in some a PP Tic at L„ s 
Concluding Discussion 

location of spots on the slide. Variability fr™ li ' 
such as multiple preparations of X ^J^TSS? 
or arrays generated at different times, has not been adm S' 

£r Six's™ - ™- - 
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Table 5. Misclasslflcatlon percentages for different combinations of replicates 

Classification Combination of Replicates 

Outcome 

False positive, % 

F« negative, V* 
ftfisclassified, % 



(1) 


(2) 


(3) 


(1,2) 


(1,3) 


(2,3) 


(1. 2, 3) 


8.3 


1.4 


9.0 


1.0 


2.1 


07 


0.7 
0.0 




G.O 


0 0 


0.3 


C.3 


G.O 


8.7 


1-4 


9.0 


1.4 


2.4 


0.7 


0.7 



— ■ Sftr . nnri, in modeling tho rand o m vdiidliuii in gene expres- 
sion, we have found in any-single replieate the probabiiity may 
be as large as 5% that mRNA in the sample tissue either fails 
to be represented as probe or, if it is represented as probe, fails 
to be hybridized to the cDNAs that are deposited on the slide 
(false negatives). Also, the probability may be as large as 10% 
that ghost genes are expressed (false positives). When mi- 
croarray data from several replications are combined, we have 
shown that, quite reasonably, a more accurate genetic picture 
is produced with a reduction of false positives and false 
negatives. Third, in the process of analyzing these experimen- 
tal data, we introduced statistical methodology for microarray 
data. We have modeled gene expression measurements by 
using a mixture of normal distributions. From this mixture 
distribution, a posterior probability is calculated from the 
microarray reading that quantifies the likelihood that the gene 
is truly expressed in the tissue. This probabiiity can be used to 
classify whether a gene transcript is present. A two-way linear 
statistical model is proposed for microarray data that can span 
a range of experimental conditions. 

Although our results depend on specific instruments and 
techniques (e.g., RNA extraction method, probe synthesis and 
labeling, hybridization, array construction, use of glass slide as 
solid support, and use of only one channel Q3), the statistical 
methods we propose can be extended to accommodate more 
general settings; For example, the methods can be used for 

1. Eisen, M. (1999) SCANALYZE User Manual (Stanford Univ., Stanford CA.) 
Ver. 2,32. " 



experiments that use both channels Cy3 and Cy5. If the two- 
channel system is used in the stahdard way with mRNA ffom a 
test sample and a reference sample, differential gene expression 
becomes relevant (Le., the interaction term of the two-way linear 
model). As there are then three states of expression (unex- 
pressed, differentially expressed in favor of the test sample, and 
differentially expressed in favor of the reference sample), a 
three-component mixture model applies. The statistical methods 
ahio extend to data sets from experimental designs that involve 
additional sources of variability, such as variability introduced by 
multiple preparations of probes. 

The main lesson to be learned from the study is that replication 
in microarray studies is not equivalent to duplication and hence 
is not a waste of scientific resources. Experimental replication is 
essential to reliable scientific discovery in genetic research 
Understanding the sources of noise in the process, controlling it 
and, if possuble,,eliminating it, are essential to drawing reliable 
inferences. By pooling data from replicates, we can provide a 
more reliable analysis of gene expression data. 
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Increasing numders of human Dis- 
eases, both acquired and genetic, are 
being considered to be based at least 
in part on alterations in DNA se- 
quence. For most diseases, inheritance 
aTtd acquisiticm are likely to be com- 
plex and polygenic. The efforts of the 
Human Genome Project to elucidate the 
structural generic background by iden- 
tifying the chromosomal positions and 
genomic organization of between ap- 
proximately 30000 and 35000 human 
genes are nearly complete. 1 Based on this 
structural knowledge, a byproduct 
should be a better "scaffolding" to help 
link specific genes to susceptibility to 
various human diseases. However, to un- 
derstand how the products of these ge- 
netic linkages work together to orches- 
trate the initiation and progression of 
particular complex diseases, there will 
be a need to apply a functional genetic 
rather than a structural genetic ap- 
proach. 2 - 3 

Until recently, functional genetic 
studies have generally been of limited 
scope, only able to elucidate the role of 
1 or a few genes at a time in 1 system. 
Information on the specificity and rela- 
tive abundance of expression prod- 
ucts has traditionally been obtained by 
techniques such as RNA Northern blot 
hybridization and ribonuclease protec- 
tion assays. Somewhat more sophisti- 
cated methods, such as differential dis- 
play* and Serial Analysis of Gene 

2280 JAMA, November H, 2001— Vol 286. No. 19 (Reprinted) 



DNA microarrays represent a technological Intersection between biology and 
computers that enables gene expression analysis In human tissues on a ge- 
nome-wide scale. This application can be expected to prove extremely valu- 
able for the study of the genetic basis of complex diseases. Despite the enor- 
mous promise of this revolutionary technology, there are several Issues and 
possible pitfalls that may undermine the authority of the mlcroarray plat- 
form, We discuss some of the conceptual, practical, statistical, and logisti- 
cal Issues surrounding the use of microarrays for gene expression profiling. 
These Issues Include the Imprecise definition of normal In expression com- 
parisons; the cellular and subcellular heterogeneity of the tissues being stud- 
led; the difficulty In establishing the statistically valid comparability of ar- 
rays; the logistical logjam In analysis, presentation, and archiving of the vast 
quantities of data generated; and the need for conflrmatlonal studies that 
address the functional relevance of findings. Although several complicated 
Issues must be resolved, the potential payoff remains large. 
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Expression, 1 have been used to screen 
larger numbers of complementary DNA 
(cDNA) clones. However, technical 
limitations render these techniques 
nonconducive to large-scale genetic 
survey. 

To this end, a powerful new tech- 
nology is emerging, using hybridiza- 
tion to nucleotide arrays, the so-called 
gene chips. 6,7 This technological inter- 
section of biology and computers en- 
ables the reliable screening of a vast 
number of genes simultaneously and is 
amenable to automation. On a nylon 
membrane or glass surface, gene- 
specific cDNAs can be spotted, or oli- 
gonucleotides can be synthesized in situ 
by a combination of photolithography 
and oligonucleotide chemistry. This 



permits simultaneous monitoring of the 
expression of thousands of genes in a 
single step. Individual chips can be cus- 
tomized to include any chosen set of 
fully or partially characterized ge- 
nomic or expressed sequences. Chips 
can monitor over 50000 unique se- 
quences. The power of these chips lies 
in the potential for comparative expres- 

Author Affilattons: Department of Dermatology, Weill 
Medical College of Cornell University, New York, NY. 
Financial Disclosures; Dr SInha received reimburse- 
ment (or material on hair loss prepared for a Heal- 
thotogy Web cast and received funding from Merck 
for research on microarrays involving alopeda. 
Corresponding Author and Reprints: Anlmesh A. 
SInha, MD, PhD, Department of Dermatology, Weill 
Medical College of Cornell University, 510 E 70th St 
IO702, New York, NY 10021 (e-mail: ans20O3©med 
.comell.edu). 

Toward Optimal Laboratory Use Section Editor: 

Oavid H. Mark. MD, MPH, Contributing Editor. 



©2001 American Medical Association. AH rights reserved. 



Downloaded from www.jania.cora at U.S. Patent and Trademark Office, on June 6, 2006 



DNAM1CR0ARRAYS 



Definition 
of Normal 



vh:'..' ™; V.-:". 



sion studies in diseased vs normal 
samples, and in documenting changes 
at different stages during the natural 
course of the disease or in response to 
treatment. It provides the researcher 
with a new arsenal to analyze under- 
Lying pathomechanisms on a grand scale 
and also to review the rationale of thera- 
peutic concepts. 

However, despite the enormous po- 
tential of this revolutionary technol- 
ogy, there are several issues and pos- 
sible pitfalls that attenuate the power 
of microarrays. First, the definition of 
normal in expression comparisons is 
neither precise nor unambiguous. Sec- 
ond, the heterogeneity of the tissues 
being studied complicates the mean- 
ing of the expression profiles. Third, the 
statistically valid comparability of ar- 
rays is an unresolved problem. Fourth, 
the vast quantities of data create a lo- 
gistical logjam for analysis, presenta- 
tion, and archiving. Finally, confirma- 
tional studies are needed to corroborate 
the biological significance of microar- 
ray data (FIGURE 1). 

TROUBLE WITH NORMAL 

The standard normal vs diseased tis- 
sue type of comparison, which is the ba- 
sic design foundation of profiling stud- 
ies, may be more quicksand than 
bedrock. Normal is not so easy to de- 
fine — neither is diseased. Gene expres- 
sion in normal tissue is likely to be de- 
pendent on several factors involving 
patient and sample variation. These fac- 
tors will also have an impact on expres- 
sion profiles of diseased tissue. 

Patient Variation: Ethnicity, 
Sox, Age, Genetic Background, 
Disease States 

The ethnicity, sex, age, and genetic back- 
ground of a patient are likely to affect 
the gene expression profiles of many tis- 
sues to varying extents. 8,9 A simple ex- 
ample is provided by the expression pro- 
files of genes involved in scalp and body 
hair follicle activity, which can be ex- 
pected to vary over a normal range un- 
der the influence of all of these sources 
of patient variation. The effects of these 
parameters on gene expression are likely 

©2001 American Medical Association. All rights reserved. 



Figure 1. Possible Pitfalls in Microarray Gene Expression Profiting Analysis 
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There are ksuet that complicate each step of microarray gene expression analysis. 



to be subtle bu t pervasive, not fully un- 
derstood at this time, and quite prob- 
lematic for defining normal. 

The presence of disease in a subject 
who is the source of tissue for control 
purposes, presents further potential 
variabilities. For example, there maybe 
a significant difference in the conclu- 
sions reached by 2 similar microarray 
expression profiling studies. One may 
compare genes expressed in a pa- 
tient's diseased lung tissue with those 
expressed in normal, nondiseased lung 
tissue from the same patient, and an- 
other may compare genes expressed in 
the same patients diseased lung tissue 
with those expressed in normal, non- 
diseased lung tissue from a healthy con- 
trol or normal individual. Moreover, it 
is also possible that seemingly unre- 
lated disease states may influence gene 
expression at distant sites. For in- 
stance, the presence of diabetes in 1 of 
2 renal cancer patients may compli- 
cate the direct comparison of renal 
tissues. 

Sample Variation: Proximity 
to Disease, Anatomic Location, 
and Developmental Range 

Yet another complication derives from 
the proximity of the normal tissue used 
as a control for the diseased tissue. Tis- 
sue adjacent to an area of disease may 
not be normal despite absence of evi- 
dence of disease clinically or under the 
light microscope. Normal-appearing tis- 
sue near a tumor could, for example, 
be genotypically altered or exhibit an 
altered gene expression profile. 10 * 13 



Moreover, factors such as the degree of 
disease-associated inflammation may 
have a significant impact on gene ex- 
pression profiles. Other bystander ef- 
fects, epiphenomena, or secondary dis- 
ease processes could all play important 
roles in determining expression pro- 
files within these adjacent, so-called 
normal tissues. These factors must be 
considered in the choice of normal. 

The precise location within a par- 
ticular organ may be another impor- 
tant factor that affects gene expres- 
sion. 9 For example, just as location 
relative to the urethra may influence ex- 
pression profiles in the prostate, 14 skin 
from the nose, back, and palm are cer- 
tain to have different expression pro- 
files as well, despite all being from the 
same organ. Thus, site and specific ana- 
tomic location must also be taken into 
account in a description of normal. 

It must also be kept in mind that the 
definition of normal actually repre- 
sents a dynamic state. 14 All tissues, 
which are composed of early and late- 
stage cells, have a normal developmen- 
tal range. For example, normal epithe- 
lium in prostatic ducts ranges from 
atrophic to resting to hyperplastic, and 
each has a unique pattern of gene ex- 
pression. 14 . 

A 3-dirnensional analytic approach is 
a strategy that has been used to address 
some of these concerns about defining 
normal. Cole et al 14 used a 3-dimen- 
sional model to characterize the entire 
prostate gland in their study of gene ex- 
pression profiles in prostate cancer. In 
this study, whole-mount prostactec- 
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tomy specimens were divided Into trans- 
verse cross sections such that the entire 
prostate gland, including the complete 
spectrum of normal epithelium and tu- 
mor progression, was available for view- 
ing, microdissection, and microarray 
analysis. This method was used to de- 
termine the exact physical relationship 
of the normal ducts, premalignant le- 
sions, and tumors — thus obtaining an 
anatomic framework on which to over- 
lay gene expression data. This tech- 
nique offers several advantages over the 
normal vs tumor comparison. Previous 
studies had used normal epithelium in 
prostatic ducts as a baseline control 
against which to compare and contrast 
tumor gene expression profiles. 15,16 How- 
ever, the expression profile of this nor- 
mal epithelium is affected by proximity 
to tumor, location within the gland, and 
developmental state. 14 These factors can 
be better appreciated using a 3-dimen- 
sional approach. 

DIsoase-Roifflted Variation 
Of course, many of the parameters that 
affect normal expression profiles (pa- 
tient ethnicity, age, sex, and genetic 
background, location within an or- 
gan, and developmental stage) will also 
affect disease expression profiles. 17 ' 19 
Disease heterogeneity, including sub- 
type, activity, severity, stage, and pre- 
vious as well as current treatments, also 
may have a significant impact on gene 
expression. 10 * 1 * 

Categorizing and subgrouping pa- 
tients on entry into a study may be use- 
ful to control for as many of these fac- 
tors as possible. However, there may be 
problems surrounding attempts to de- 
fine microarray-based categorization on 
the basis of another imperfect categori- 
zation system, such as histology, as these 
groups are sometimes arbitrary or in- 
consistently designated. Nevertheless, de- 
termining whether gene expression pro- 
files correlate with existing clinical or 
histological categories can provide new 
insights into the meaning of these cat- 
egories as can new methods of classify- 
ing cancers or other diseases into spe- 
cific diagnostic categories based on their 
gene expression signatures. Several stud- 
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ies have been able to establish expression- 
based criteria (class predictors) for pre- 
existing categories and then use these 
new criteria to categorize new cases (class 
prediction). 1 " 8 Global profiling may also 
allow the development of new classifi- 
cation systems based on gene expres- 
sion alone (class discovery). 1 *-* 0 

Thus, when possible, it will be of 
value to profile a range of normal and 
diseased cell populations from a num- 
ber of patients to distinguish between 
differences in expression that are rel- 
evant to the disease process and those 
reflecting the biological spectrum of the 
normal tissue or that have occurred for 
reasons unrelated to the disease. The 
significance of this disttncttapi is fur- 
ther appreciated when taking into ac- 
count the vast quantity of data gener- 
ated from microarrays and the potential 
for confounding interpretation from the 
inclusion of differential expression un- 
related to disease processes. 

It is worth.noting, however, that the 
issues of patient and sample variability 
are not unique to microarray experi- 
ments. In fact, microarray experiments, 
in contrast to classic single-gene experi- 
ments, may actually provide the tools for 
identifying this heterogeneity. For ex- 
ample, DNA microarrays have been used 
to explore physiological variation in gene 
expression on a genomic scale in 60 cell 
lines derived from diverse tumor tis- 
sues. 31 Cluster analysis allows the iden- 
tification of prominent features in gene 
expression patterns that appear to re- 
flect molecular signatures of the tissue 
from which the cells originated. 31 

HETEROGENEOUS CELL 
POPULATES 

A further complication encountered with 
expression profiles is that any given tis- 
sue is composed of several cell types, 
members of which are likely to be within 
a spectrum of dynamic functional states. 
For example, a simple punch biopsy of 
the skin may contain keratinocytes, me- 
lanocytes, Langerhans cells, Merkel cells, 
adipocytes, smooth muscle cells of ar- 
rector pili, striated muscle cells of the 
panniculus camosus, blood cells includ- 
ing immune system cells, and cellular el- 
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ements of blood vessels, nerves, hair fol- 
licles, sebaceous glands, and sweat 
glands. Moreover, cells from each of these 
populations will be at various stages of 
development and levels of activation, per- 
forming different functions and respond- 
ing to disease processes or treatments in 
different ways and to varying extents. The 
result is a highly heterogeneous sam- 
pling of cells, each expressing a unique 
set of genes. An expression profile gen- 
erated from a microarray study of the 
RNA in such a sample will thus repre- 
sent merely a snapshot of the genes ex- 
pressed by a plethora of cells at a mo- 
ment in time. Such extensive cellular 
heterogeneity complicates the ability to 
draw conclusions about specific pro- 
cesses occurring within a tissue speci- 
men. An illustrative example is pro- 
vided by Stanton et al," who used 
microarrays to identify genes differen- 
tially expressed during myocardial in- 
farction. The expression profiles they 
studied represented transcripts from cell 
populations as diverse as immune sys- 
tem cells, which migrated to the infarct 
region and are responsible for the in- 
flammatory response, cardiac myocytes 
within the ischemic area undergoing 
apoptosis and necrosis, fibroblasts un- 
dergoing proliferation and participat- 
ing in the formation of scar tissue to re- 
place the infarct, and cardiac myocytes 
undergoing hypertrophy to compen- 
sate for the loss of cells in the infarct 
area. 11 The issue of such cellular hetero- 
geneity was avoided by categorizing the 
differentially expressed genes into func- 
tional categories to look for patterns in- 
dicative of cardiac remodeling without 
attempting to attribute specific tran- 
scripts to specific cell types. For gene ex- 
pression studies involving samples with 
mixed cellular populations, further in- 
vestigation, such as with in situ messen- 
ger RNA (mRNA) hybridization, maybe 
necessary to localize the transcripts be- 
fore conclusions can be drawn about the 
roles of specific genes in specific cell types 
during the disease process. 

Laser Capture Microdissection 

An ingenious but technically delicate 
approach to the study of complex bio- 
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logical samples has become possible 
with the development of laser capture 
microdissection (FIGURE 2) * This tech- 
nique allows for the rapid and accu- 
rate procurement of cells from spe- 
cific areas of tissue under direct 
microscopic visualization, and thus 
makes the molecular genetic analysis 
of defined populations in their native 
tissue environment possible. 35 Sgroi et 
al* demonstrated the feasibility of com- 
bining laser capture microdissection 
with high-throughput cDNTA arrays. 
They showed that in vivo subpopula- 
tions of malignant cells from multiple 
stages of breast cancer progression 
could be separated from nonmalig- 
nant populations, and their expres- 
sion profiles could subsequently be ana- 
lyzed using microarrays. 

The po tential is powerful. Specimens 
could beseparated into tissue layers; for 
example, separating a skin biopsy into 
epidermis, dermis, and hypoderrnis. Tis- 
sues could be further differentiated into 
specific structural components, such as 
dermis into blood vessels, adipose, ar- 
rector pilt, and sebaceous glands. Struc- 
tures could be separated into defined cell 
types, such as blood vessels into endo- 
thelial cells, erythrocytes, and lympho- 
cytes. Cell types could even be sepa- 
rated into marker-defined subtypes, such 
as lymphocytes into CD4 and CD8 cells. 
Expression profiles from refined and de- 
fined structures and cell types likely 
would be extremely valuable in the study 
of disease. 

Potential aside, there are significant 
limitations to this technology at the pres- 
ent time. The standard protocols for fix- 
ing and embedding tissue samples from 
surgical resections were not designed 
to be compatible with microarray ex- 
periments, with or without laser cap- 
ture microdissection. Typically, tissue 
suspected of being important for diag- 
nosis and staging is processed through 
aldehyde-based fixatives, such as for- 
malin, which damage mRNA integ- 
rity." If frozen tissue is available, mRNA 
can be recovered and studied from dis- 
sected cell populations. However, fro- 
zen tissue sections are technically dif- 
ficult to prepare, the histology is often 



Figure 2. Laser Capture Microdissection 
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A, Tissue sections are processed and placed on a microscope slide under a thin, transparent thermoplastic film, 
which Is attached to a movable cap. Visualizing the tissue microscopically, a short-duration, focused pulse from 
an Infrared carbon-dioxide laser is used to activate and melt the film to selectively adhere cells within targeted 
areas of Interest B, When the cap Is lifted, the film, with selected cells still bound, Is removed from me tissue 
section for further processing iu retrieve cellular materials (eg, DNA, RNA, proteins). 



severely compromised » and the tissue 
available may contain only a limited 
portion of the lesion." 

Moreover, the sample amounts gen- 
erated from laser capture microdissec- 
tion can be small, even as miniscule as 
a single celL M Consequently, the yields 
of RNA are low. Arrays have a thresh- 
old for the quantity of molecular start- 
ing material: at least 5 to 15 ug for oli- 
gonucleotide arrays and between 2 and 
100 pg for cDNA arrays, depending on 
the manufacturer, the source of the 
RNA, and the use of signal amplifica- 
tion. 19 - 40 Studies that have successfully 
integrated laser capture microdissec- 
tion with microarray technology have 
used samples of approximately 1 X 10* 
to 1 X 10 5 cells with 95% to 98% ho- 
mogeneity as determined by micro- 
scopic visualization. 16 * 41 If needed, am- 
plification techniques may be used to 
generate sufficient genetic material for 
microarray hybridizations. 11 Laser cap- 
ture microdissection is an intriguing 
technology, but time will tell whether 
its potential is realized. 

Although some biological issues re- 
lated to gene expression may be com- 
plicated by the presence of heterog- 
eneous cell populations in studied 
samples, it is also true that some bio- 
logical conditions can be understood 
only in the context of these heterog- 
eneous cell populations. The nature of 
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global gene expression experiments is 
to uncover differences between 2 bio- 
logical samples, including those differ- 
ences based on diverse cell popula- 
tions. For example, to appreciate a 
disease that is characterized by an in- 
flammatory infiltrate, it must be un- 
derstood that the inflammatory infil- 
trate is part of the disease and is part 
of the difference between diseased and 
nondiseased tissue. Thus, the isola- 
tion of specific cell populations for 
study is not necessarily required or even 
desirable in all instances. 

MAKING MICROARRAYS 
COMPARABLE 

Ideally, microarray experirnen ts should 
be comparable both within and be- 
tween laboratory or manufacturing sys- 
tems, but obtaining consistent and com- 
parable data is a critical challenge for 
microarray-based expression analysis. 
Major sources for the observed vari- 
ability of microarray data include the 
normal physiological gene expression 
variations in different samples and the 
noise introduced in the microarray as- 
say process. 41 

Physiological Cone 
Expression Variation 

Inextricably linked to the issues of pa- 
tient and sample variability and tissue 
heterogeneity discussed above, is the 
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problem of normal gene expression 
variations and how to distinguish these 
variations from significant disease- 
associated changes. Few studies have 
systematically investigated physiologi- 
cal expression changes, but data from in 
situ hybridizations suggest that normal 
variance for many tightly regulated tis- 
sue-spedfic genes can be within 20% to 
30%/ 2 However, there can be as much 
as 2- to 4-fold random fluctuations for 
many gents in yeast. 43 Affymetrix 
(Santa Clara, Calif) guidelines have sug- 
gested that for most of the "housekeep- 
ing" genes in human tissues, which are 
likely to be less tightly regulated, differ-: 
ences of less than 4-fold are probably not 
biologically significant. 43 Conse- 
quently, a significant portion of micro- 
array data variability for high- or me- 
dium-abundance mRNAs may simply be 
due to normal expression variations. Sev- 
eral previous studies have used arbi- 
trary 2-fold change criteria to define sig- 
nificant expression change. 16 However, 
the 2-fold threshold has been shown to 
be statistically invalid even for dupli- 
cate experiments. 46 In a recent study that 
used cDNA microarrays to profile gene 
expression in samples of normal skin 
from breast-reduction surgeries, 71 of 
4400 genes were found to demonstrate 
variability in expression greater than 2 
SDs from the mean of each gene. 17 These 
included genes coding for transport pro- 
teins, gene transcription, cell-signaling 
proteins, and cell-surface proteins. Thus, 
physiological variation should be con- 
sidered in the analysis and interpreta- 
tion of microarray data. More stringent 
criteria for defining significant expres- 
sion change may be useful. 

Noise In the Microarray 
Assay Process 

For the tightly regulated (mostly low 
abundance) mRNA species, inconsis- 
tencies introduced at any stage of the 
microarray-based assay process may 
play a major role in data variability. 12 
Due to the miniaturization and the large 
number of genes involved, it is diffi- 
cult to maintain consistent processing 
conditions for each sequence across 
multiple assays, and obtaining accu- 
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rate absolute signals is unlikely. 42 Noise 
may be introduced by slide heteroge- 
neities, printing irregularities (eg, 
pin-to-pin variations), and spotting 
volume fluctuations. 46 Some of the sys- 
tematic variations may be reduced by 
the inclusion of controls, but random 
fluctuation at various manufacturing 
stages cannot be completely con- 
trolled and can accumulate quickly in 
a complicated assay. 48 

In certain microarray systems, 2 
samples are competitively hybridized to 
1 array using different fluors for label- 
ing. In other systems, there is only single- 
sample hybridization. A 2-color system 
might be expected to be more reliable 
since variations in. spot size or amount 
of cDNA probe on the chip should not 
affect the signal ratio (both signals are de- 
rived from the same spot). However, this 
only holds true if signals are well above 
the background in both detection chan- 
nels. 42 In fact, the signal level for most 
of the tightiy regulated genes will likely 
be close to the background level. 42 In ad- 
dition, background level on a slide can 
also vary signilicandy from spot to spot 
due to factors such as unevenness in slide 
surface properties, dust contamination, 
and incomplete washing, leading to high 
levels of signal variability for low- 
abundance mRNA species even in 
2-color systems. 42 

The high levels of variability of mi- 
croarray data also mean that subtle 
changes in experimental conditions 
may significandy alter the results, mak- 
ing it difficult for separate laborato- 
ries to compare experimental data. In 
addition, the lack of standard con- 
trols, the predominant use of relative 
signals (ratios), and the adoption of in- 
compatible data formats contribute to 
poor comparability between studies 41 

Despite the hard-wired variability in- 
troduced by chip manufacturing con- 
ditions, most of the published studies 
to date using microarray-based expres- 
sion analysis include only limited num- 
bers of replicates. 4 * In fact, many stud- 
ies conduct the experiment only once. 
Considering the potential sources of as- 
say variation, the need for sufficiently 
replicated studies is underscored. 49 
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Microarray Data Normalisation 

Because of variability of microarray data 
for single sample arrays and for fur- 
ther analysis of 2-color system arrays, 
each must be brought into the same 
scale to compare 2 or more arrays. How 
to perform this normalization of gene 
expression levels across multiple ar- 
rays, thus removing systematic varia- 
tion between the arrays and rendering 
different experiments comparable, re- 
mains an issue that is not yet fully re- 
solved. 30 Many of the early microarray 
studies in the literature simply ig- 
nored this issue, A more statistically rig- 
orous approach is needed. 

One difficulty has been that leading 
microarray manufacturers have not 
published statistical error models for 
their products. Thus, users are un- 
clear how much to adjust data for varia- 
tions in spot intensity, hybridization 
patterns, and intensity measurement 
sensitivity. Software does exist to al- 
low for array-to -array comparisons by 
using a scaling factor to normalize gene 
expression patterns across arrays. How- 
ever, in general, these algorithms as- 
sume that intensity differences be- 
tween arrays are linearly related with 
a zero y-intercept. 31 This assumption al- 
lows software to trim the tails off dis- 
tributions of expression from differ- 
ent arrays at statistical cutoffs and then 
simply move the distributions along an 
axis to a common level to provide com- 
parisons. However, this linear relation- 
ship often does not hold true. 31 When 
the average expression level of 1 array 
is higher than that of a second array, a 
longer tail will be trimmed from the sec- 
ond array. Thus, a greater number of 
genes from the first array will be 
counted as being expressed because 
their expression level is above the sta- 
tistical cutoff point. In this case, the 2 
arrays cannot be considered compa- 
rable. 

Although bioinformatic software has 
recently been developed that offers 
more statistically robust normaliza- 
tion, the cost of these commercially 
available programs (combined with the 
already expensive microarrays) has 
been prohibitive for many research- 
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ers. 50 Standardization of these pro- 
cesses awaits the development of im- 
proved methods of normalization 
leading to valid statistical models widely 
available to all researchers. 

To this end, Schadt et al 31 have de- 
veloped a standard nonlinear curve 
technique for normalizing the data in 
arrays that do not demonstrate a lin- 
ear relationship between data sets. This 
model performs well when the 2 
samples being compared demonstrate 
a low number of differentially ex- 
pressed genes. However, when expres- 
sion profiles of 2 samples vary to a 
greater extent, Schadt et al 51 recom- 
mend a rank-selection method. Using 
this method, genes expressed on an ar- 
ray are ranked from highest to lowest 
level of expression. Then, for the ar- 
ray expressing a greater number of 
genes, the genes with the lowest ex- 
pression levels are removed from the list 
until the 2 arrays Hst a comparable num- 
ber of expressed genes! "This type of 
rank-selection method has gained sup- 
port from other groups, but it too has 
limitations.' 0 Removing low- expres- 
sion level data points restricts the study 
to the more extreme and easily de- 
tected entities, a technique that blunts 
the genomic-scale potential of micro- 
array technology. 

Efforts continue to improve compa- 
rability between arrays. Jones 30 re- 
cendy applied a statistical model to nor- 
malize spotted cDNA array data that 
takes into account not only the differ- 
ences in numbers of genes expressed be- 
tween arrays, but also the interarray 
variations in fluorescent dye intensity 
and mechanical error occurring in the 
printing process. Nevertheless, the is- 
sue of how to properly normalize ar- 
ray data has not been settled. Research- 
ers must continue to demand statistical 
rigor in their comparisons before they 
can believe the mathematical results of 
their data. 

LOGISTICAL LOGJAM 

Microarrays deliver massive amounts of 
data on lens of thousands of genes. The 
result is an immense quantity of bio- 
logical information that must be ana- 
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lyzed, presented, and archived in a 
meaningful way. 

Data Analysis 

In human studies, the number of hy- 
bridizations that can be performed for 
any set of experimental conditions is of- 
ten restricted by the limited number of 
obtainable tissue samples and by the ex- 
pense of arrays. Restricted numbers of 
hybridizations for each experiment 
hamper the ability to assess the bio- 
logical significance of variation within 
or between given sets of conditions. 
Thus, for the assessment of thousands 
of genes in a setting of limited hybrid- 
izations, the importance of reliable and 
sophisticated algorithms for data analy- 
sis becomes amplified. 51 

A logical beginning is to examine the 
extremes, that is, genes with signifi- 
cant differential expression in indi- 
vidual samples. For example, a com- 
parison of 2 samples can be visualized 
in the form of a simple blvariate scat- 
terplot in which the expression profile 
of 1 sample (x-axis) is plotted against 
that of the second sample (y-axis). The 
distribution pattern generally demon- 
strates that the expression ratios clus- 
ter around the line in which x is equal 
to y (indicating comparable levels), with 
individual genes falling varying dis- 
tances from this line. Additional lines can 
be placed on the scatterplot to repre- 
sent various fold changes of expres- 
sion. Data points that fall above or 
below these lines represent genes ex- 
hibiting expression ratios greater or less 
than the specified fold change. Thus, one 
can begin by examining those genes that 
demonstrate a 10-fold or greater change 
in expression level. To expand the num- 
ber of genes under investigation, one can 
examine genes that demonstrate a 5-fold 
or greater change, or a 3- fold or greater 
change, and so forth. Many studies de- 
fine a 2-fold or greater change in ex- 
pression level to represent significant dif- 
ferential expression. The 2-fold 
threshold, however, as noted above, has 
been shown to be statistically invalid. 46 
Although this simple technique can be 
efficient and effective for focusing on ex- 
panding sets of differentially expressed 
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sequences, again, such an analysis does 
not take advantage of the full potential 
of genome-scale experiments to en- 
hance our comprehension of cellular bi- 
ology that would be provided by an in- 
clusive analysis of the entire repertoire 
of transcripts in a cell as it goes through 
a biological process. 52 A more holistic ap- 
proach, which allows the deciphering of 
patterns from the entire data set, is 
needed. 

Data Organization 
and Presentation 

Statistical algorithms can be applied to 
detect and extract patterns within pro- 
filing data. It is a basic assumption of 
many gene expression studies that 
knowledge of where and when a gene 
is expressed provides information about 
the function of the gene. Therefore, an 
important beginning is to organize 
genes on the basis of similarities in their 
expression profiles." However, even 
this basic tenet deserves critical con- 
sideration. Similarity of gene expres- 
sion profile does not mandate similar- 
ity of function or mechanistic pathway, 
and it maybe purely coincidental. Nev- 
ertheless, the idea of clustering genes 
on the basis of their expression pat- 
terns is well established and cluster 
analysis has become the most widely 
used statistical technique applied to 
large-scale gene expression data." 

Although various cluster methods can 
usefully organize tables of gene expres- 
sion measurements, the resulting or- 
dered but still massive collection of num- 
bers remains difficult to assimilate. Thus, 
another important component of ge- 
nome-wide expression data explora- 
tion is the development of powerful data 
visualization methods and tools. Ap- 
proaches have been developed that pre- 
sent clustering results in simple graphi- 
cal displays such as dendrograms, which 
represent relationships among genes by 
a tree whose branch lengths reflect the 
degree of similarity in expression be- 
tween the genes. Similarity is mathemati- 
cally defined. 34 The computed trees can 
be used to order genes in the original data 
table such that genes or groups of genes 
with similar expression level patterns are 
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Figure 3. Real-Time Polymerase Chain 
Reaction (PCR) 
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A, Complementary DNA synthesis from source mes- 
senger or Iota! RNA proceeds as with traditional re- 
verse transcriptase PCR. B, A dual-labeled fluoro- 
genlc probe (with a higher melting point than the PCR 
primers used for extension) Is annealed to the target 
sequence between the forward and reverse PCR prim- 
ers. A reporter (R) ftuorochrome (usually 6-carboxy- 
fluorosceln) is attached to the 5' end of the probe and 
a quencher (Q) molecule (usually €-carboxy- 
tetramethyl-rhodamine) is attached to the 3' end. 
C, As Taq polymerase extends In the 5' to 3' direc- 
tion, the dual-labeled probe begins to be displaced from 
the target sequence. D, As the Taq polymerase con- 
tinues to extend, the 5' to 3' endonudease activity 
deaves the reporter (R) molecule from the probe se- 
quence such that its emission spectra (51 8 nm) are no 
longer quenched by the second (Q) fluorescent dye. 
fluorescence Is measured continuously throughout the 
PCR amplification In real time and is proportional to 
the amount of PCR product generated in each cyde. 

placed adjacently. Clustering methods 
can also be combined with representa- 
tion of each data point with a color that 
quantitatively and qualitatively reflects 

2386 JAMA, November M, 200 J— Vol 286, No. 18 (Reprinted) 



the original experimental observa- 
tions. 31 Visual assimilation is then more 
intuitive. 

Data Archiving and Mining 

Ultimately, successful interpretation of 
gene profiling studies is likely to be de- 
pendent on the integration of experi- 
mental data with external information 
resources. As multiple experiments in- 
volving multiple cell types and tissues 
from multiple laboratory groups accu- 
mulate, data archiving may well be* 
come the watershed issue. Ideally, all 
data, in a suitably standardized form, 
would be freely accessible in the pub- 
lic domain. Even assuming a willing- 
ness, to share the data, such Utopian 
goals would require a user-friendly and 
powerful database system and stan- 
dardization of correction and normal- 
ization procedures such that data points 
from various projects become compa- 
* rable. 55 The National Center for Bio- 
technology Information Entrez sys- 
tem (http://www.ncbi.nlm.nih.gov 
/Entrez/) does provide uselul data in mis 
regard, but current databases may be 
limited in scope or compiitability. 53 A 
major focus of infrastructure develop- 
ment to support genomic-scale expres- 
sion studies will need to be in the area 
of electronic biological pathway data- 
bases and resources. 

CONFIRMATIONAL STUDIES 

The development of more sophisti- 
cated analytical algorithms and data- 
bases will help lend credence to the bio- 
logical significance of differential gene 
expression determined by microarray 
analysis. In the meantime, several stud- 
ies have begun to examine the sensi- 
tivity and specificity of microarray- 
based experiments. Sensitivity, defined 
as the minimum reproducible signal de- 
tected by a given array scanning sys- 
tem, has been reported for microar- 
rays to be approximately 10 mRNA 
copies per cell, which is slightly infe- 
rior to the sensitivity of Northern blot 
anaIyses. w ' ,7 Specificity studies showed 
that for a given probe any nontarget 
transcripts with more than 75% se- 
quence similarity may show cross- 



hybridization. 56 The problem of clone 
misidentiflcarion and the need for clone 
confirmation have also been ad- 
dressed. 18 One study found that of 1 189 
bacterial stock cultures, only 62.2% 
were uncontaminaled and contained 
cDNA inserts that had significant se- 
quence identity with published data for 
the ordered clones." Thus, the use of 
sequence-verified clones forcDNA mi- 
croarray construction is warranted. 

Additionally, potential gene candi- 
dates can be assessed for relevance to 
disease using parallel technologies. Sev- 
eral such alternative platforms have 
been used to bolster the importance of 
specific sequences first suggested in 
gene chip comparisons including (1) 
methods at the RNA level, (2) meth- 
ods at the protein level, and (3) func- 
tional studies. 

Methods at the RNA Level 

Reverse transcriptase polymerase chain 
reaction (RT-PCR) is a method often 
used to verify microarray data. Al- 
though RT-PCR is not well suited to 
quantitation, the relative technologi- 
cal ease of this assay and the ability to 
rapidly monitor multiple samples make 
it a useful technology. Hybridiza- 
tion data can be verified and multiple 
putative markers can be screened in a 
short period. 

Several other studies have used real- 
time quantitative RT-PCR (TaqMan, 
PE Applied Biosystems, Foster City, 
Calif ). 13 - 62 Real-time PCR is a tech- 
nique that increases the quantitative 
ability of RT-PCR by providing accu- 
rate and reproducible information 
on RNA copy number (FIGURE 3). In 
this method, a fluorogenic probe 
(labeled at the 5' end with a reporter 
fluorochrome and at the 3' end 
with a quencher fluorochrome) is 
annealed to 1 strand of the target 
cDNA sequence between the forward 
and reverse PCR primers. As Taq poly- 
merase extends the forward primer, its 
intrinsic 5' to 3 # nuclease activity dis- 
places and degrades the dual-labeled 
probe, releasing the reporter fluoro- 
chrome from the quencher label and 
allowing the detection of a fluorescent 



©2001 American MedicalAssociation. All rights reserved. 



Downloaded from www.jaina.com at U.S. Patent and Traderaajk Office, on June 6, 20O6 



DNA MICROARRAYS 



signal that is proportional to the 
amount of PCR product generated in 
each cycle.* 1 

Northern blot analysis is also com- 
monly used as a confirrnational tech- 
nique, as it is a standard specific and 
semiquantitative method. 15 '"* 61 For 
mRNA expressed at modern te-tq-high 
levels, and for which cDNA probes arc 
available, Northern blot analysis works 
well, but it is not well suited for low- 
copy mRNA. 64 * 5 Furthermore, only a 
small number of genes can be ana- 
lyzed with this conventional method. 

Methods at the Protein Level 

DNA microarray technology is limited to 
the study of gene expression at the 
mRNA level However, it has been es- 
tablished that mRNA levels do not nec- 
essarily correlate with protein levels. 
Moreover, the level of expression or even 
presence of a protein is not tightly linked 
to physiological consequences. An in- 
vestigation conducted by Winzeler et al, M 
for example, provides a cautionary tale. 
Their study demonstrated that genes up- 
regulated in yeast growing in minimal 
medium did not prove to be more im- 
portant for growth than genes that were 
not upregulated." They found only 2 of 
8 genes required for yeast growth in 
minimal medium to be induced. The les- 
son to be learned is that genes that are 
not differentially expressed may be of 
equal functional importance in disease 
states compared with those that are. 

Furthermore, the regulation of some 
genes maybe at the translattonal rather 
than the transcriptional level, which 
would preclude detection by DNA mi- 
croarrays. Posttranslational modifica- 
tion of proteins is also an important 
mode of regulation that cannot be de- 
tected by DNA microarrays. Protein ac- 
tivity, particularly receptor activity, is 
heavily dependent on phosphoryla- 
tion, for example. DNA and mRNA re- 
veal nothing about whether a given pro- 
tein is active, and can be deceptive when 
used to speculate about quantities of 
proteins. It has been demonstrated that 
the correlation between mRNA and pro- 
tein abundance is less than 0.5, rtT em- 
phasizing that ideally, mRNA expres- 
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sion studies should be accompanied by 
analyses at the protein level. 39 Radio- 
immunoassay and immunohistochem- 
istry have been used in a number of 
studies. 13 * 8 * 9 These techniques, how- 
ever, are not well suited to detecting low 
levels of expression, and they require 
the availability of an antibody specific 
for the protein to be studied. 

The field of proteomics, the large- 
scale parallel analysis of the proteins that 
are present in a cell, is developing rap- 
idly, but has problems of its own. Pro- 
teins vary in abundance by many orders 
of magnitude within a given cell, and 
there is no PCR equivalent for the ampli- 
fication of proteins. Moreover, proteins 
fold in many known (and unknown) 
ways that affect their function. The fea- 
sibility of the microarray analysis of pro- 
teins has begun to be explored. Antibod- 
ies attached to microarrays can be used 
to bind to and quantitatively detect pro- 
teins that have been tagged with fluo- 
rescent dyes. 70 Skeptics doubt the plau- 
sibility of identifying thousands of 
unknown proteins in this manner. 70 The 
diverse chemistry of various proteins 
poses serious difficulties, and it will be 
challenging to find antibodies for every 
protein. Thus, although it is important 
to incorporate protein analyses into 
expression profiling studies, current plat- 
forms are technically limiting. 

Functional Studies 

Confirming the role of a gene initially 
identified in a microarray experiment in 
animal models with transgene or knock- 
out studies provides a particularly pow- 
erful alternative platform. Transcript 
function, rather than mere presence, is 
addressed. However, this approach is ill- 
suited for high-throughput conditions. 
It may be ideal for an in-depth investi- 
gation of 1 or 2 genes of interest, but it 
is not practical for confirming Urge quan- 
tities of profiling data. 

Confirrnational studies are useful to 
corroborate the biological signifi- 
cance of differential gene expression de- 
termined by microarray analysis. While 
improved databases and more reliable 
statistical models will help to lend 
greater authority to array data, alter- 
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native platforms can be used to assess 
the relevance of genes first identified by 
array comparisons. It should be real- 
ized, however, that the alternative tech- 
nologies are not Intended for large- 
scale analyses. Realistically, only 
selected sequences from the array data 
can be confirmed with other plat- 
forms in the short-term, a retreat from 
the initial purpose of the genome- 
scale investigation by microarray. 

CONCLUSIONS ~~ 

Microarrays can be expected to prove ex- 
tremely valuable as tools for the study of 
the genetic basis of complex diseases. The 
ability to measure expression profiles 
across entire genomes provides a level of 
information not previouslyattainable. Al- 
though complicated issues must be re- 
solved, the potential payoff is big. Mi- 
croarrays make it possible to investigate 
differential gene expression in normal vs 
diseased tissue, in treated vs nontreated 
tissue, and in different stages during the 
natural course of a disease, all on a ge- 
nomic scale. Gene expression profiles 
may help to unlock the molecular basis 
of phenotype, response to treatment, and 
heterogeneity of disease. They may also 
help to define patterns of expression that 
will aid in diagnosis as well as define sus- 
ceptibility loci that may lead to the iden- 
tification of individuals at risk. Finally, 
as specific genes are identified and their 
functional roles in the development and 
course of disease are characterized, new 
targets for therapy should be identified. 

Despite the problems of defining nor- 
mal, understanding tissue heterogene- 
ity, making arrays comparable, analyz- 
ing and archiving massive quantities of 
data, and performing confirrnational 
studies in alternative platforms, expres- 
sion profiling with microarrays stands 
as a truly revolutionary technology. As 
we continue to delve into the possibili- 
ties, we will surely progress in our un- 
derstanding of current issues and com- 
plications. No doubt the ride on the 
high-throughput highway will be ex- 
hilarating. 

Author Contribution: Ms King and Dr SInha both con- 
tributed to the conceptualization and writing of thii 
article. 
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13 

Gene Expression Analysis 
Using Microarrays 

Sophie E. Wildsmith and Fiona J. Spence 



13.1 Introduction 

Microarrays are of increasing interest to both industry and academia as tools 
for 'gene hunting' and also as quantitative methods for routine analysis of large 
numbers of genes. Techniques such as real-time polymerase chain reaction (RT- 
PCR) TaqMan™ and SybrMan™ are generally considered to be more accurate, 
robust, larger in dynamic range and less capital intensive, but for rapid, large- 
scale gene expression analysis using limited mRNA, microarrays and gene chips 
are preferred. 



13.2 Microarray experiments 
Platforms 

Global gene expression platforms are now available in multiple formats, 
including cDNA arrays, oligonucleotides spotted onto slides or in situ 
synthesised oligonucleotide arrays manufactured using .photolithography. 
Commercial sources for these include Stratagene (La Jolla, CA), Memorec 
(K8ln, Germany) and BD Biosciences (Oxford, UK) for cDNA microarrays, 
Mergen Ltd (San Leandro, CA) for spotted oligomers and Affymetrix (Palo 
Alto, CA) for oligoarrays synthesised in situ. Purchasing from a supplier is more 
expensive than generating microarrays in-house, although the latter is beneficial 
in labour-intensive institutions or when proprietary gene information is utilised. 

Molecular Biology in Cellular Pathology, Edited by John Crocker an'd Paul G. Murray 
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'Off-the-shelf microarrays may also have the advantage of rigorous quality 
control and standardised protocols. 

It is possible to produce oligonucleotide spotted arrays in-house, by design- 
ing oligonucleotide sequences that match genes of interest and then purchasing 
purified oligonucleotides to spot down on glass or other substrates. Alternatively 
there are new systems such as that available from CombiMatrix (Mukilteo, 
WA, USA) for computer-aided design and in situ synthesis of oligonucleotides. 
However, production of cDNA microarrays is currently the most affordable and 
popular method and is now well established. Numerous sources of informa- 
tion on cDNA microarray fabrication are available in the literature and on the 
internet (Bowtell, 1999; Cheung etaL, 1999; Wildsmith and Elcock, 2001 and 
http://cmgm.Btanford.edu/pbrown). Thus, this chapter will focus on the 
implementation of experiments and analysis of data ftojn cDNA microarrays. 
The experimental procedure differs slightly according to the number of fluo- 
rophores (or channels) and the type and manufacturer of the array. We have 
attempted to describe a generic process, indicating where possible the different 
options. Figure 13.1 demonstrates the procedure for a two-colour hybridisation. 



Sample 1 Sample 2 




Figure 13.1 The microarray experimental process for two-colour hybridisations 
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RNA Extraction 

First, RNA is extracted from the tissue or cells of interest. The quality of the 
RNA extracted is paramount to the overall success of the microarray experi- 
ment, as impurities in the sample can effect both the probe labelling efficiency 
and also stability of the fluorescent label (Hegde et al. f 2000). Snap-freezing 
of tissue in liquid nitrogen, immediately after harvesting, is used to preserve 
RNA integrity. Any ftirther sectioning of the tissues should be carried out under 
RNAse-free conditions (Fernandez et aL, 1997). Total RNA can be extracted 
using kits such as TRIzol® (Invitrogen, Paisley, Scotland) and Rneasy (Qia- 
gen, GmbH, Hilden, Germany). Some researchers perform a further extraction 
of mRNA; this results in a purer starting material but has the disadvantage of 
lower yields. Affymetrix recommend between 5 and 40 \ig of total RNA is 
required for their GeneChips™ and 10 pig or less is the required amount of 
starting material for cDNA microarrays (Hegde et ai 9 2000). 



Sample .Labelling 

The mRNA is transcribed in vitro, with the concomitant inclusion of labelled 
nucleotides. The labels may be fluorescent or radioactive. In the case of dual 
channel/colour hybridisations, two samples will be labelled with dyes that 
fluoresce at different wavelengths, with different emission spectra. Example 
fluorophores, available coupled to nucleotides, are Cy3, Cy5, fluorescein 
and lissamine. Wildsmith et al (2001) have demonstrated that AlexaFluor 
546dUTP™ (Molecular Probes, Leiden, The Netherlands) gives a significantly 
higher signal than Cy3dCTP (Amersham Biosciences, Piscataway, NJ, USA). 
When performing two-colour hybridisations the control sample and 'test' sample 
are labelled with different fluorophores and the subsequent cDNA is then mixed 
together and hybridised simultaneously (Nuwaysir et a/., 1999). An advantage of 
simultaneously hybridising control and treated sample is that it obviates the need 
to control for differences in hybridisation conditions or between microarrays. A 
specific example of the huge impact this technique has had includes its use 
in the first published account of gene expression data of the entire genome of 
Sacchawmyces cerevisiae (DeRisi et aL t 1997). In two-colour hybridisations, 
one would assume that the properties of the two fluorescent dyes being used 
are equivocal. In fact, for Cy5 and Cy3 this is not the case as Cy5 has been 
reported to give higher background fluorescence and also is more sensitive to 
photobleaching than Cy3 (Van Hal et a/., 2000). In addition, there is evidence 
from several independent sources that the combination of Cy3 and Cy5 dye 
labelling can affect data in certain genes. That is to say, when experiments 
are repeated and the dye combination for the two probes reversed, inconsistent 
results are obtained with certain genes (Taniguchi et at., 2001). Despite these 
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Treated : Cy 3-532 nm Control : Cy 5-635 nm 




Figure 13.2 Two-colour fluorescent scan of human gene cDNA array. The probe mix con- 
sists of DNA from HepG2 control cells and cells treated with buthionine sulfoximine for 6 h. 
A colour version of this figure appears in the colour plate section 

facts, two-colour hybridisations are widely accepted throughout the microarray 
community and an example of an image is shown in Figure 13.2. 

Hybridisation and Processing 

After labelling, the cDNA is purified (to remove unincorporated nucleotides), 
mixed with a hybridisation buffer and then applied to a cDNA microarray slide! 
The sample and the slide are heated prior to hybridisation in order to separate 
double-stranded DNA. A coverslip is applied (Shalon et al % 1996), or prefer- 
ably a hybridisation chamber is used to avoid evaporation and enable an even 
hybridisation. The hybridisation and subsequent wash steps are carried out at a 
buffer stringency and temperature that enables hybridisation of complementary 
strands of DNA but reduces non-specific binding. 

Image Capture and Image Analysis 

After hybridisation the microarray slides are scanned, using either a laser or a 
phosphorimager (depending on the type of label used). There are many different 
suppliers and models of fluorescence scanner, for example the ScanArray 5000 
(Perkin Elmer Life Sciences, Zaventem, Belgium), GenePix 4000B (Axon GRI, 
Essex, UK) and the GeneArray® (Aflymetrix, Santa Clara, CA). The choice of 
scanner is determined by sensitivity, resolution, flexible wavelength, file size 
generated, throughput and technical support available. 

Images are analysed using software that measures the intensity of the sig- 
nal from the hybridised spotted genes (spots), which provides a measurement 
of the amount of cDNA bound. Thus the initial concentration of messenger 
RNA is inferred. Early software packages 'drew* grids around the spots and 
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integrated across the whole area of the grid. This overcame problems associ- 
ated with accurate location of the spots, which is problematic, especially if the 
spots on the printed arrays are poorly aligned. More recent versions of software 
'draw' circles around the spots themselves and perform measurements within 
and outside of this boundary. For example, the background may be calculated 
from a region outside the spot boundary. The intensity of the signal from the 
spot may be calculated using median, mode or mean values of the pixels within 
the spot. Researchers differ in their preferences regarding using median or mean 
values (Hegde et al. f 2000) and this is likely to depend upon the protocols and 
software used. 

Image analysis software commonly is supplied with scanners or can be 
bought from the same supplier. This has the advantage of being opti- 
mised for that specific type of microarray and the benefit of upgrades 
and technical support. Software for microarray analysis is available 
from BioDiscovery (http://www.biodiscovery.com), Imaging Research 
(http://imagingresearch.com), GenePix Pro (Amersham Biosciences, Pis- 
cataway, NJ), arraySCOUT 2.0 (http://www.iionbioscience.com), NIH 
(http : //www . nhgri . nih . gov/D I R/LCG/l 5 K/ HTML/ img ^analysis . html), 
Stanford University (http://rana.stanford.EDU/software) Media Cyber- 
netics (Silver Spring, MD, USA) and TIGR (http//www . t igr . org/sof t lab). 
Important criteria for image analysis software include speed, ease of use, 
automation and the ability to distinguish artefact from real signal (Wildsmith 
and Elcock, 2001). 

As the technology has evolved and more experience gained, it has become 
more and more apparent that the most significant issues facing microarray users 
are the processing of the vast quantities of data generated and deciding exactly 
what tools are the most appropriate for data analysis. Because of the enormity 
of this, we have dedicated a complete section to describing the current status of 
this area. 



13.3 Data analysis 

It is important to be cognisant of the fact that the practical laboratory aspects of 
using microarrays are only part of gene expression analysis. Many researchers 
generate vast volumes of data, without a clear understanding of how to manage 
and interpret them. Furthermore, the variability in microarray data confers addi- 
tional problems for analysis. In some cases the purpose of the experiment will 
be a gene-hunting exercise, in which case a cursory indication of potential gene 
biomarkers is sufficient analysis. In other instances, such as pathway mapping 
and screening studies, it is paramount that results are statistically meaningful 
and valid. The next few sections detail some relatively simple analysis meth- 
ods and recommendations for the benefit of researchers with minimal statistical 
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Figure 13*3 The ideal micro array experimental design and process 

training. There are also suggestions for more advanced analysis for those who 
have the assistance of a statistician or specialist data analyst. 

Most of the steps before, during and after performing a microarray experi- 
ment are optimally conducted with regard for statistics and data analysis. Careful 
planning before implementation facilitates the downstream analysis and inter- 
pretation of data. The following model summarises the entire microarray process 
with integration of the biological and data analysis components (Figure 13.3), 

Hypothesis Generation 

Any study is conceived for the purpose of investigating or obtaining supporting 
evidence for a biological hypothesis. Giving time at this early stage to consider 
downstream implications will pay dividends later. It is helpful if, rather than 
simply stating the aims of the experiment, the researcher asks the question 'What 
results do I expect?* or 'what answer will validate/invalidate my hypothesis?'. 
This 'reverse-engineering' proves useful in focusing the project, assessing the 
feasibility of the work, providing early preparation for data management and 
analysis and, importantly, in managing expectations with regard to outcomes. 

A good example of careful experimental planning is demonstrated by Golub 
etal. (1999) in the classification of acute leukaemias in order to distinguish 
between acute lymphoblastic leukaemia (ALL) and acute myeloid leukaemia 
(AML). Distinguishing between ALL and AML using conventional techniques 
is known to be a difficult task. The researchers maximised their probability of 
success by choosing an easier, more defined model (normal kidney vs. renal 
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cell carcinoma), on which to validate their analytical methods. In doing so they 
established that their techniques were suitable for classifying tissues according 
to disease and gained confidence in their approach before using the samples of 
real interest. 

Optimisation Experiments 

Although microarrays are becoming increasingly accessible to all, using these 
tools requires experience and it is unlikely that successful experiments will be 
conducted immediately. It is usual that some time is given to optimising a sys- 
tem for any specific application, for example for a given tissue or cell type. 
Additionally, the requirements for a given system may warrant some modifica- 
tions. The standard approach for a scientist to take is to vary one parameter, 
whilst keeping all others constant. This is time-consuming and does not take into 
account the interactions between different factors. Well-designed, multifactorial 
experiments (Box et aL 9 1978), provide a faster route for optimisation, with a 
statistical measure of confidence. An example of this technique is in the opti- 
misation of microarray experimental conditions for preparation of fluorescent 
probes from rat liver tissue (Wildsmith et a/., 2001). When a major source of 
variation is revealed this can be investigated further with a view to minimising 
it or providing sufficient replicates to account for it. 

Design of Experiments 

Once confidence in the experimental procedure has been obtained the researcher 
is likely to have gained an insight into the reproducibility of the system. This 
assists in the design of the experiments, in particular in determining the mini- 
mal number of replicates necessary. Replication can be implemented at many 
stages - from biological samples through to microarray slides. 

Owing to the enzyme-catalysed transcription reactions, a large amount of 
variation occurs during the probe-making stages in microarray experiments. Our 
work indicated that replicates should be made at this step and a minimum of six 
replicate probes are made for microarray experiments (Wildsmith et a/., 2001). 
These can be pooled or hybridised separately onto six microarray slides. 

Lee et al. (2000) have examined the effect of the different location of cDNA 
spots on the glass slides and concluded that replicates are essential to provide 
meaningful data and to enable reliable inferences to be drawn. 

With regard to commercially available gene chip systems, such as that avail- 
able from Affymetrix (see Figure 13.4), the variation between chips, within a 
batch, is likely to be low due to stringent quality control and highly automated 
manufacture. The use of an automated wash station also reduces variability in 
intensities between chips. However, using a multi-step approach in the probe 
preparation and subsequent antibody binding steps may lead to variation between 
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Figure 13.4 The GeneChip® Instrument System. From left to right, the hybridisation sta- 
tion, scanner and workstation. Image courtesy of Affymetrix. A colour version of this figure 
appears in the colour plate section 



replicate, samples prepared on different days. Pooling of reagents within an 
experiment, and analysing controls together with treated samples, will both 
reduce the variability within a given experiment. 

Conduct of Experiment 

At this stage some attention may be required for verifying and validating pro- 
cesses. For example, checking that the imaging instruments give consistent 
results across the slide, on repeat use and from day to day. If two imagers 
are used it is important to verify that the results from both machines are com- 
parable. Some laboratories read fluorescence of one channel and then adjust the 
laser intensity of the second channel in order to obtain comparable readings. 
This is a method of normalising for the difference in intensities of fluorophores. 
It is important to be aware that this approach has a number of drawbacks. The 
arbitrary value of the second laser intensity setting will vary from experiment 
to experiment; thus comparisons of this channel cannot be made across experi- 
ments. Also the response of the fluorophore may not be linear across the laser 
intensity settings and this can lead to additional errors. 

Another area for investigation prior to running the study itself is the image 
analysis component. Depending on the software used, the image analysis pack- 
age may process the data to some extent, for example automatic background 
subtraction. Full understanding of the software is required so that it is clear at 
what point the data are 'raw', and the extent of inherent, inseparable manip- 
ulation. Effort may be required to determine the optimum settings for any 
software parameters. 

As data are generated it is important to be aware of the data integrity - for 
example ensuring that all data are collected, so that there are no missing data that 
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Figure 13.5 The relative fluorescence units (RFUs) of 1248 genes on seven microar- 
ray slides that were, hybridised with cDNA made from, the liver of -a .rat. treated with 
acetaminophen. Note the gene outliers at approximately gene number 480 

can complicate analysis later. The researcher may be intuitively aware of any 
spurious results and should be alert for anything extraordinary that could indi- 
cate problems, for example hybridisation intensities appearing inconsistent from 
sample to sample. Data analysis at this point can be a rapid indicator of dubious 
results. For example, Figure 13.5 shows a plot of the fluorescent intensities of 
1248 genes that were hybridised with probe derived from acetaminophen-treated 
rat liver tissue. The data appear consistent, with the exception of peaks in inten- 
sity on one slide at around gene 4800. Further investigation of the microarray 
revealed a large artefact that had been missed by the image analysis process 
(Figure 13.6). 

Raw Data Generation and Storage 

One issue that arises when carrying out microarray analyses is how much data 
to store and in what form. For Good Laboratory Practice (GLP) purposes, often 
required in industry, storage of the raw data is necessary. This could be construed 
as the microarray image. Storing the image analysis results requires far less 
storage space and is easier to visualise, but it has the drawback that image 
analysis cannot be redone should superior software be available in the future. 
In reality, the methods used for microarrays are continually changing and the 
.likelihood of revisiting old images on which the analysis has been performed, 
using outdated protocols is quite small. 



278 GENE EXPRESSION ANALYSIS USING MICROARRAYS 



p ■ • > ■ O O 

» "O ' Oi.j i*0.; • ■ 

;w : ...... O ■ G O 



Figure 13.6 Portion of scanned image showing region where artefact occurred that caused 
very high signals, which were classed as outliers 



Pre-processing 

A number of pre-processing steps are often used in microarray analysis. These 
include filtering, log transformation, normalisation and background subtraction. 
Filtering may be used before or after transformation in order to extract data 
from preferred regions of interest, or in order to remove outliers (see the 
above example relating to image analysis artefact). One example of filtering 
is the removal of individual gene replicates that lie outside a given number (for 
example 5) of standard deviations from the mean. Alternatively, data points that 
lie in the top/bottom few percentiles (e.g. 0.1%) of the data can be removed. This 
method of removing outliers is also called 'trimming*. It is acceptable if there 
is a large volume of data where only a small proportion of data is removed and 
if the same method is applied consistently across all data. Care must be taken 
in the way in which this is carried out in order not to delete genuine data. For 
example, if one gene is consistently high or low in expression across replicates, 
then it is unlikely to be an outlier. 

Another method of detecting outliers is to plot all genes (see the section 
'Conduct of experiment 1 above) or to perform PCA analysis (see the section 
'Multivariate analysis' below) to detect replicate outliers. The use of a PCA 
plot to detect outliers is shown in Figure 13.7. 

Log transformation of data is* accepted universally because the fluorescence 
data that are generated from microarrays tend to be skewed towards lower val- 
ues. There are scientifically valid reasons why ratios of raw expression values 
should not be used (Nadon and Shoemaker, 2002). When using two-colour 



DATA ANALYSIS 279 



1 

0.5 
0 

<0 
CO 

d -1 

-1.5 
= ■-2.1 



x10 7 


Score plot: class2 




i 


i i.i I r 












40 












6 












1 J-. 1 1 l 





-1 -0.5 0 0.5 
PC1-47.3738% 



1.5 
x10 7 



Figure 13.7 PCA plot of data used in Figure 13.4 showing one microarray (number 2) as 
an outlier 



hybridisations it is common to express the ratio of treated to control as a loga- 
rithm in base 2 (Quackenbush, 2001). Thus genes up-regulated by a factor of 2 
have a log 2 (ratio) of 1, and genes down-regulated by a factor of 2 have a log 2 
(ratio) of — 1. 

Normalisation and background subtraction techniques are methods of data 
manipulation and their use is more subjective and often debated. The purpose of 
these techniques is to reduce the error (variability) that occurs between replicates 
and thus enable a comparison of data across samples. 

The theory behind background subtraction is that during hybridisation there 
will be non-specific binding to the slide. This will effectively 'darken* the 
image and give falsely high readings of fluorescent intensity. Correcting for 
the non-specific hybridisation should reduce error due to background staining. 
Background subtraction often occurs automatically in microarray image analy- 
sis packages. The software may circle the spot of interest and use the region 
beyond the periphery as the measurement of background. In cases of uneven 
hybridisation this method enables locally high background to be subtracted on 
a regional basis. One criticism of this approach is that the slide surface beyond 
the periphery is not similar, in chemical terms, to that where the nucleic acid 
has been deposited, and therefore cannot act as a real control for non-specific 
binding. A more accurate measurement of non-specific binding can be gained 
from using a region where spotting chemicals have been deposited, but no target 
is present. This concept is the basis of a method using local 4 blank spots' (Wu 
etaL, 2001). 

A number of methods exist for normalisation of data. These include normal- 
ising to total signal or to a 'known' spot or gene, standardisation, or proprietary 
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methods. Normalising to total signal is the simplest approach, whereby the gene 
intensity is expressed as a percentage or proportion of the signal intensity for the 
entire array. This method works best when the total intensities for the microar- 
rays are similar and the number of changes is small compared with the number 
of genes. However, we often find, when using arrays of around 1000 genes, that 
pathological disease can up-regulate a large number of genes simultaneously. In 
this case, when total signal normalisation is applied, highly up-regulated genes 
will appear less up-regulated and genes that do not change from the control will 
appear down-regulated. 

Normalisation to a control value is a more popular technique. A control value 
can be obtained from using a gene known to remain constant under the condi- 
tions of the experiment. DeRisi et ai (1997) used a panel of 90 housekeeping 
genes for. normalisation, but found considerable variation in their gene expres- 
sion. Unfortunately it is very difficult to know with certainty that a gene will 
not change and there is evidence to suggest that so-called 'housekeeping genes' 
are variable (Savonet et ai t 1997). Other control genes can be derived from an 
alternative species; these should not be expected to hybridise. We have used 
yeast and Arabidopsis genes as negative controls for hybridisations of rat tis- 
sues. No orthologs were known to the genes selected; however, in , most cases 
non-specific binding occurred. 

If two or more microarray replicates appear to be different, but they are 
expected to be the same, then they can be standardised. An example of this 
might occur if the total intensity of one microarray is greater than another, but 
the genes are proportionally equally up- or down-regulated. If the microarray 
sample spot data are assumed to be drawn from a normal distribution, then the *z- 
transform' can be used. This requires that the mean and the standard deviation of 
the intensity values for each microarray are determined. The mean is subtracted 
from each individual gene value and the remainder is divided by the standard 
deviation. The intensity values from each microarray will then have the same 
mean and standard deviation. This has the advantage of facilitating comparison 
of microarrays with different dynamic ranges as well as total intensities. If the 
data are not normally distributed, then alternative non-parametric methods can 
be used, such as normalising to the median. 

Univariate Analysis 

Univariate methods of analysis involve examining one variable, or gene, at a 
time. This can be a very laborious task when examining a large volume of 
data, yet it is the preferred method of biologists. The simplest technique is to 
compare control and treated values and express the result as a 'fold-change' 
ratio. Typically, when examining small volumes of data, fold changes greater 
than 2 and less than 0.5 are considered meaningful (Quackenbush, 2001). This 
cut-off is essentially arbitrary and has the distinct drawback that microarray data 
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are not homoscedastic; that is, there is more variation about the mean at low 
values than there is at high values (Draghici, 2002). 

A second method for finding up- or down-regulated genes uses the standard 
deviation (SD) of the replicate gene data. Thus if changes greater than, say, 2 
SD from the log mean ratio are considerably greater than changes associated 
with 'noise \ then they are considered significant. This technique means that 
when looking at a large number of genes that are normally distributed there will 
be up- and down-regulated genes, regardless of whether there are (biological) 
changes (Draghici, 2002). 

Rather than using arbitrary cut-off values it is far more meaningful to express 
the fold-change in terms of either confidence intervals, or a 'p-value* (that 
is, the probability of the value occurring by chance). Thus, a fold-change of 
1.1 may be associated with a p-value of 0.001 and thus the probability that 
the gene is not up-regulated is 1 in 1000. Naturally, such small fold-changes 
may then be queried in terms of biological significance. One must then ask 
the question: Are large fold-changes more important (biologically) than small 
ones? Simple calculation of p-values for two data sets can be obtained using 
/-test functions in standard spreadsheet software. A number of replicates are 
necessary" for this approach, and the data must be normally distributed. We 
have recently developed a method for calculating p-values for fold-changes that 
is not influenced by the distribution of the data or outliers and applied it to 
TaqMan™ and microarray data. Other complex and computationally intensive 
methods for calculating p-values are described in Draghici (2002) and Nadon 
and Shoemaker (2002). 

Multivariate Analysis 

Multivariate analysis of gene expression data is becoming increasingly popu- 
lar in the microarray community and in other biological domains where large 
volumes of data are generated. Multivariate analysis methods include princi- 
pal component analysis (PCA), factor analysis, multivariate analysis of variance 
(MANOVA) and cluster analysis. Currently, cluster analysis is the most widely- 
used method in the microarray community but PCA is growing in popularity 
(Crescenzi and Giuliani, 2001; Konu et a/., 2001). 

Quackenbush (2001) provides a good review of clustering tools that is rather 
unique in the regard that different clustering algorithms and linkage methods 
are presented. Clustering methods are unsupervised, and they are powerful tools 
for gaining insight into huge data sets. They enable the data to be partitioned 
in order to facilitate interpretation; however, they do suffer from subjectiv- 
ity. This is because the user selects various parameters, such as the algorithm 
used, linkage type, distance metric and, sometimes, cluster size. Whatever the 
data, clusters will always be identified, thus there is also a tendency to over- 
interpret the data - trying to attach meaning to clusters that may have no bio- 
logical relevance. 
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Software available for cluster analysis includes Cluster, the output of which 
is viewed in Treeview; both available from http://rana.stanford.edu/ 
software. This tool is particularly useful for clustering genes to identify genes 
that are co-regulated. 

The PCA is a visualisation tool that enables complex, high-dimensional data 
to be represented in two or three dimensions. It facilitates identification of groups 
of similar data, thus enabling inferences to be made about the samples. 

An example is shown in Figure 13.8. The figure shows the gene data for one 
sample (control rat liver) that was hybridised to seven microarrays according 
to the method used in Wildsmith et aL (2001). Each microarray contained two 
replicate gene sets; thus there were 14 replicate gene sets in total. The gene sets 
comprised 1248 genes (with controls). All the data (14 x 1248 data points) was 
input into the analysis and the PCA plot displays the 14 replicates individually. 
The two axes are principal components 1 and 2. Principal component 1 (PCI) 
accounts for 65.5% of the variation in the data, whereas PC2 represents only 
13%. This means that the model accounts for 78.5% of variation in the data. 

The first principal component (PCI) accounts for as much as possible of 
the variation in the.original data and subsequent components (e.g. PC2) are of 
decreasing importance. Thus, samples 8 and 9 are -very different from samples 1 
and 2. In terms of interpreting the PCA plot, it is immediately clear that there are 
three or four distinct clusters of data. These are marked by circles. Datapoints 
tend to cluster in pairs; for example replicates 1 and 2, 3 and 4, 5 and 6, 
etc. These are the duplicate gene sets on the same microarray. This indicates 
that the variation within the microarray is lower than the variation between 



Figure 13.8 PCA plot of the microairay results from seven slides (2690, 2692, 2723, 2869, 
2876, 2879, 2880), each with two replicate spot sets (labelled 1-14), after hybridisation with 
control rat liver. See text for explanation 
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replicate microarrays. However, given that the same sample is applied to all 
the microarrays, we must ask why we get further separation of the replicates. 
The answer lies in the associated data. The four-digit numbers associated with 
the clusters are the microarray slide numbers that indicate when they were 
printed. Numbers that are more similar seem to be more closely related, and thus 
we can hypothesise that there were some differences between the slides, such 
as differences in slide backgrounds or changes during the printing process or 
change-over of batches, between slides in the 2600-2700 region and the 2800s. 

The PCA provides a clearer overview of the data than does cluster analy- 
sis. It is a rapid method for gaining an insight into the results, in particular 
where biological meaning can be attached to the components (Crescenzi et al. y 
2001). There are a number of packages for multivariate data analysis, including 
SIMCA-P (Umetrics, AB, Umea, Sweden) and The Unscrambler (Camo ASA, 
Norway), both of which are useful for PCA. 

Other tools for data visualisation include software packages such as Spot- 
fire.net (Spotfire Inc., Cambridge, MA, USA) and GeneSpring (Silicon Graphics, 
San Carlos, CA, USA). Spotfire is particularly useful for visualisation of mul- 
tidimensional data and for visualisation of temporal data. It is possible to use 
these tools to identify genes Giat are co-ordinately expressed over time. 

Biological Interpretation 

After developing a sound experimental strategy, ensuring that the results are 
statistically valid, and after analysis of the data, it is down to the biologist to 
assemble the pieces of information that have been obtained. This intertwined 
information may include unexpected results that are contradictory to intuition or 
to published literature. One way to untangle the data is to map the relevant genes 
onto existing pathways and known functions. The Kyoto Encyclopedia of Genes 
and Genomes (KEGG), available at http://www.genome.ad.jp/kegg/, is a 
. useful source of information, especially where the gene products are enzymes. It 
enables visualisation of the position of up- or down-regulated genes in metabolic 
pathways. 

The gene expression data obtained may differ from protein expression data, 
or information on gene product activity or location. When initiating a study 
it is useful to consider additional endpoints that can assist in the interpreta- 
tion of the data. For in vitro studies, these might include cytotoxicity endpoints, 
metabolites, key signalling molecules or perhaps protein expression. Waring 
et al. (2001) used tetrazolium dye reduction (MTT) as a measure of hepatocyte 
cell viability for their studies of gene expression in response to hepatotoxic 
insult. For in vivo studies, expression information on the tissue of interest could 
be supported by pathology, histology and blood chemistry measurements. Gene 
expression results could be confirmed by in situ hybridisations or protein activity 
assays. 
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13.4 Recent examples of microarray applications 

One area of rapid progress using microarray technology is the increased under- 
standing of cancer. Molecular pathologists are subgrouping cancers of tissues 
such as blood, skin and breast, based on differential gene expression patterns. 
For example, within a small group of breast cancer tissue samples, Perou ex al. 
(2000) distinguished two broad subgroups representing those expressing or alter- 
natively lacking expression of the oestrogen receptor-ot gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared 
with the previous methods of gene identification. 

Another example of the impact of this technology is in the identification of 
two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran 
era/., 2001). 

Microarray technology has also accelerated the understanding of the molecu- 
lar events surrounding pulmonary fibrosis. Specifically, two distinct clusters of 
genes associated with inflammation and fibrosis have been identified in a dis- 
ease where, for years, the pathogenesis and treatment have remained unknown 
(Katsuma etal, 2001). 

13.5 Conclusions 

Important factors in gene expression experiments include sensitivity, precision 
and reproducibility in the measurement of specific mRNA sequences (Schmittgen 
et al, 2000). These quality metrics can be maximised by using, or fabricat- 
ing, high-quality microarrays, and by optimising each step of the microarray 
process. From conception to conclusion it is important to bear in mind the 
original hypothesis. 

Having considered the complexity of the microarray experiment, the value 
obtained from a meticulously designed experiment should not be underesti- 
mated. As the number of high-quality gene expression studies increases, we hope 
that the literature will contain increasingly detailed information that will help 
interpret complex gene expression changes, and thus elucidate the mechanisms 
of disease. 
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13.9 Useful websites 

Protocols: 

ht tp : / /cmgm . Stanford . edu/pbrown 

http : / /cmgm . stanf ord. edu/pbrown/protocols . html 

Image analysis software: 

http://www.genome.ad.jp/kegg/ 
http : / /rsb . info . nih . gov/nih - image 
http r //rana . stanf ord. edu/sof tware 

http : //www . nhgri . nih . gov/DIR/LCG/15K/HTML/img_analysis . html 
http : //rana . Stanford . EDU/sof tware 
http//www. tigr . org/sof tlab 
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We have characterized comprehensive transcript and 
proteomic profiles of cell lines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
, genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNX2. The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCR11, cadherin 11, annexin 
Al, vimcntin, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid $ pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-specific 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell proliferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80 k Da 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 

Oncogene (2006) 25, 2328-2338. doi:10.1038/sj.onc.l209265; 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes (Perou et al, 2000; Becker et al, 2005). The 
expression of some of these genes is a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et al, 1995; Alpan et al, 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan et al, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their expression patterns in 
phenotypically related cell lines (Khan et al, 2001; 
Dan et al, 2002; Rosenwald et al, 2002; van't Veer 
et al, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et al, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in different proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels has not 
yet become routine. > 

Although expression profiling of tumor tissue and its* 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a tumor 
type, such comparisons have not been free of attendant 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to predict tumorigenic 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line, a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-23 1 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-23 1 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either 
upregulated in or specific to MDA-MB-23 3. The initial 
linkers used in this protocol had internal Bglll sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with BglU, a dephosphorylated 
BgM adaptor was ligated to tester DNA. The BgUl 
adaptor had an internal EcoKl site. The difference 
product was digested with EcoKL and cloned in pBlue- 
Script vector. The cloning efficiency of the difference 
product was very low (5 x 10 4 c.f.u.//zg of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1. The 
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involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-23 1 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2 + binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. . 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the cell line that was used as 
a tester. Such analyses demonstrated that more than 
90% of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
The pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in 
MCF7 and MDA-MB-23 1 cells that were used as 
driver/tester combinations for the RDA. 



Gene microarrays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-23 1 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that are either present or absent as shown in Figure 3 
revealed that a set of .123 genes distinguishes MDA-MB- 
23 1 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these classes were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes show changes varying between two- and 
fourfold (Figure 5). Interestingly, with all comparisons 
combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated arid downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-23 1 cells. These 
transcripts include GPCR11, cadherin 1 1, annexin 
A'j\ vimentin, lactate dehydrogenase B (upregulated 
in MDA-MB-23 1) and GREB1, S100A8, amyloid 
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Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downregulated in MDA-MB-231 



Extracellular matrix /matrix- cross linking proteins 
LOX 

Laminin /? 1 
Collagen VI-a 1 

Calcium-binding proteins 
Reticulocalbin 1 
S100A8 
Cullin 5 

Transcription factors/promoter-binding proteins 

RUNX2 

c-Jun 
. Fra-I 

Cell-cell adhesion/ cell-surface receptor proteins: 

Cdhll 
CYR61 

MHC class II antigen y chain 
Pro tease-activated receptor- 1 
Protease-activated receptor-4 



ATPaseJGTPase and signal transduction proteins 
ATP7a 
Caveolin 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Met a I lop rot eases and MMP inhibitor proteins 
TIMP-2 
MT1-MMP 

Stress-response proteins 

Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell-cycle regulation and growth) differentiation! apoptosis proteins 
Cyciin B 1 
Cyclin E 
Cyciin A2 
Bcl2-like 1 protein 

Secreted proteins and growth factors 
Milk fat globule protein 
TGF-ot 

SMAD-specific E3 ub ligase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
Calgranulin B 

Transcription factors/promoter-bindirtg proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

Cell-cell adhesion f cell-surf ace receptor proteins 
Claudin 3 

Amyloid /? precursor protein 
Triose phosphate isomerase 
Plakoglobin 
Cdh I 
Cdh 3 

Annexin A9 
RAR-a 
Connexin 31 

ATPasejGTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-£ Rl 
Stress-response proteins 

Protein kinase H 1 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin 8 1 

Micro tubule- associated protein t 

Cell-cycle regulation and growth /differ en tiaiionjapoptosis proteins 
S100A13 
S100C 

Aurora kinase AIFC2 
Nucleosidediphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor 1 (TFF1) 

Four and a half LIM domain 1 

Solute carrier family 16 SLC16A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINTI) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FLJ22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



precursor protein, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCF10A may be made based on keratin 19, 
serine protease, amyloid /? precursor, neuropeptide Y 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin ;A1 3 epithelial membrane protein 1, S100A2, 



keratin 15 (downregulated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 1 1, GPCR 
116, collagen type XIII a 1, Bc]2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatin A, cadherin 1, CD24, calcium-activated chlor- 
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ide channel, SI OOP, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 2+ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 



MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2).. The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2+ homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 
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Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCF10A (lanes 1 and 4), MCF7 (Janes 2 and 5) or MDA-MB- 
231 (lanes 3 and 6) cells. The amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes 1 and 2), TGF-oc (lanes 3 and 4), 
Moesin (lanes 5 and 6), LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes 11 and 12), S100A13 (lanes 13 and 14), TFF3 
(lanes 15 and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes 1, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, 11, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing PCR 
products from MDA-MB-231 cells are marked with an asterisk. 
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Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINT1 (c) and 
DUSP (d) were labeled with a 3Z P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane 1), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure 1 . 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
Affymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. 
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Gene Ontology 



03 Absent in MCF7 - Present in MDAMB231 & MCF10A ■Present in MCF7 - Absent in MDAMB231 & MCF10A 
; Present in MDAMB231 - Absent in MCF10A & MCF7 □ Absent in MDAMB231 - Present in MCF10A & MCF7 

Figure 4 Functional classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 



Gene Expression Levels 




Downregulated 



Upregulated 



No. of Folds 



| □ MCF7 vs 1 0A □ MDAMB231 vs 1 0A ■ MP AMB231 vs MCF7 1 

Figure 5 Distribution of altered number of transcripts as a 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA-MB-23] Downregulated in MDA-MB-231 



Cadherin 4 
Cadherin 11 
Catenin 
Integrin ot 6 

Transmembrane anchor protein 

Eph B2 

DystoHm 

Laminin /? 1 

Lam in 

Filamin B 

Filamin C 

Tail in 1 

Butyrophilin 

Spectrin-a 

Spectrin-/? 

Thrombospondin 

Plastin 3 

Adducin 3 y 

Lamin B receptor 

Laminin p 2 

Lamin A/C 



CIaudin3 
Cadherin- 1 
Cadherin- 3 
Cadherin- 18 

Cadherin, LAG seven pass receptor, 

Down syndrome cell adhesion 

Catenin-(52 

Eph A4 

Ephrin A4 

Annexin A9 

Ankyrin 3 

Sarcoglycan 

Keratin 8 

MAP-7 

MAP-t 

Plakoglobin 

Plakophilin 

Discoidin domain receptor 
Zona occludens 3 
Periplakin 
Protocadherin <x 9 
Laminin y 2 
Laminin a 3 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gels, and there were far fewer, protein spots in 
gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 had li unique protein 
spots, while MDA-MB-231 had 15 spots that were not 
seen in MCF-10A. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein- tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
involved in calcium homeostasis such as crocalbin, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered levels of Rho GDP dissociation 
inhibitor 1, an apoptosis/differentiation regula- 
ting protein galectin, Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein ^stathmin. The translation initiation 
factors 5A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-23J Downregukted in MDA-MB-231 
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Reticulocalbin 
Dystonin 
FolJistatin like 1 
Cullin 

Annexin A5 ■ 
EF hand domain containing-2 
Hippocalcin like-2 
LDLR 

Steroid sulfatase 
MT-actin crosslinking factor 
Inositol- 1,4,5- triphosphate 
receptor 3 
So rein 

Guanine nucleotide- binding 

protein-)! 

GAS 6 

SWP-70 protein 
Jagged 1 

EGF-containing fibrulin-like 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-a 

Calmegin 

Calpain 

HEG homolog 

Cyr61 



S100A8 
S100A7 
S100A13 

Tumor associated Ca 2 + 
- signal transducer 
Notch homolog 3 
PKD2 

Adenylate cyclase 
Phospholipase-C 
Chemokine Jigand 12 
Ubquitin-specific protease 
?KC-n 

Ret-pro tooncogene 

Signal peptide-CUB domain 

Mannosidase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca 2+ channel voltage-dependent 03 

Regulator of G protein signaling 17 

Dystrobevin-a 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member Dl 

PK-cyclic AMP dependent 

ATPase-Ca 2+ transporting 

Calmodulin 1 

CaM kinase 



Table 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 cells 



Cell-cell adhesion/cell-surface 
receptor proteins 
Trio se-phosp hate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cyioskeletal component /binding 
proteins 
Stathmin 

Cell- cycle regulation and growth/ 
differentiation/ apoptosis proteins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase VIb 
Pep tidy 1- prolyl cis- trans isomerase 
Ubiquitin 



Calcium-binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticulocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Cell-celt adhesion/ cell-surface 
receptor proteins 
Galectin 

ATPase/GTPase/signal trans- 
duction/ trafficing proteins 

Rho GDP dissociation 

inhibitor 1 

Stress-response proteins 
Hsp 70 

Peroxiredoxin 2 

Cyioskeletal component/ 
binding proteins 
Stathmin 



Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins Rho signaling proteins 



LOX 


Rho3 


LOX-1, LOX-2 


Rho/Rac GEF 18 


SCO cyt oxidase-deficient 


Rho GEF 12 


homolog 2 


Ras related C3 botulinum toxin 


COX 17 homolog 


substrate 2 


Metallothionein IE, IF and 2a 


. Cdc 42 effector protein 3 


Ring finger protein7 


Rho GEF 3 


Am iloride- binding protein 1 


Rho GDP dissociation inhibitor /? 


Neurotrypsin/motopsin 


RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transfection with antisense 
constructs of selected transcripts 

We had observed significant upregulation of transcripts 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI 1 
and secreted protein XAG in MCF7 cells. We reasoned 
that antisense knockdown of the above genes in MCF7 
cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 cells were transfected with empty vector 
pCDNA3.1 or antisense constructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



Miscellaneous 

Heterogeneous nuclear 
ribonucleo protein H 
eIF4H (translation) 
eTF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 
proteins included stress-response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 
teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The number of altered proteins varied between 5 and 15 
for various transfections. Peptidyl prolyl cis-trans 
isomerase, calcium-regulating proteins, SOD, galectin, 
histidine triad protein and PKC substrate were promi- 
nent among altered proteins. We performed database 
searches to identify interactors for all proteins that were 
altered in transfected cells and identified nearly '350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. * 
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Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 



After transfection with as- trefoil factor 3(TFF3) 



After transfection with as-protein kinase HI J 



Calcium-binding proteins 

Calmodulin 
Cell-cell adhesion/cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPasejsignal transduction/ trafficing proteins 

Raf kinase inhibitor 
Stress-response proteins 

Hsp27 

Peroxiredoxin I 
Cytoskeletal component /binding proteins 

Cofilin-nonmuscle isoform 
Cell-cycle regulation and growth! ' differentiation/ apo ptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

Translationally controlled tumor protein 
Miscellaneous 

40S ribosomaJ protein S12 

ATP synthase D chain 

Cancer-associated Sm-like protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide- bin ding protein 

Histone H2B.n 

Peptidyl-prolyl isomerase 

Proteosome subunit /J- type 6 
fc RN A- binding protein 8 A 

Thioredoxin 

Ubiquitin crossreactwe protein 

After transfection with as-aurora kinase AIK2 

Calcium- binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 

60S acidic ribosomal protein P2 
Peptidyl-prolyl isomerase 
Ubiquitin-cross reactive protein 



Cell-cell adhesionj cell-surface receptor proteins 
Galectin 

Retinoic acid-binding protein II 
A TPase- G TPasefsign al transduction/ trqfficing pro terns 

PFCC substrate 
Stress-response proteins 

Hsp27 

Cytoskeletal component (binding proteins 

Actin-a, skeletal muscle 

Cytokeratin 18 
Miscellaneous 

30H-acyl Co A dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

UbiquinoUcyt C reductase 
No match 

After transfection with as-DNA replication complex GINS PSF2 
Calcium Binding proteins: 
Calreticulin 

ATPase-GTPase/ signal transduction/trafficing proteins 

14-3-3 n 

PKC substrate 
Cytoskeletal component (binding proteins: 

Cytokeratin 1 

Cytokeratin 18 

Cell- cycle regulation and & growth/ differentiationfapoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyi-prolyl isomerase 

TJbiquinol-cyt c reductase 

After transfection with as-human homolog of XAG 
Calcium- b inding proteins 

Calreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain 

Peptidyl-prolyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
enough to yield important genes that show significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Baskaran et al, 1996; Jacob 
el al, 1997). The detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyl oxidase and a 
copper transporter ATP7A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et al, 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin -dependent kinase inhibitor 
(p21 C1P1) (Westendorf et al, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of MDA-MB-231 (Fox and Kandpal, 
2004; unpublished observations). Lysyl oxidase, a 
copper-sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize collagen fibrils (Siegel, 1976) that are 
found in invasive breast carcinoma cells (Akiri et al, 

2003) . The activation of LOX is dependent on cjopper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase efal, 

2004) , a protein mutated in Menkes disease (Moller 
et al, 2005). Prion protein has also been repbrted as a 
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chaperone for copper (Jones et al. 9 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TIMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al, 2004; Shao et al, 2005) and promoter 
methylation (Costa et aL, 2004). As several transcripts 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in particular. The changes in cell-cell interaction 
correlate to cell phenotypes because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kandpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et al, 1983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et al, 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et al, 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et al, 1995). Calcium ions act 
as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et al, 2000) and are known 
to alter calcineurin to activate transcription factors such 
as NFATc (Luo et al, 1996). 

As dictated by post- transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
genes in MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase HI 1 and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Y u 
et al, 2001; Fletcher et al, 2003; Katoh, 2003; Warner 
et aL) 2003; Takayama et al, 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT-PCR of RNA isolated from the transacted cells. 
The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable to their levels in MD A- MB -2 31 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase 'inhibitor, histidine triad nucleo- 
tide-biriding protein and peptidy-prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et aL, 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pinl has been 
related to p53-mediated signaling pathways (Mantovani 
et aL, 2004; Berger et aL, 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005). Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA-based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUN3C2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a cell line established from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB- 
231 that vary in their in vitro and in vivo invasiveness. All cells 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM:F12 media (Gibco) with 5% horse serum (Gibco), 
20mM HEPES, lOng/ml EGF (Invitrogen), 10 ml/1 PenStrep- 
Glutamine (lOOOOU/ml penicillin, 10000^g/ml streptomycin 
and 29.2mg/ml L-glutamme), 10^g/ml insulin (Invitrogen), 
0.1/ig/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-231 cells were grown in 
DMEM (Gibco) supplemented with 2mM L-glutamine (Gib- 
co), 1 mM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5000 ^g/ml streptomycin), and 10% fetal 
bovine .serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85 to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 100% ethanol, then dried and resuspended in 
DEPC-treated water. RNA was stored in aliquots at -70°C. 
The quality of RNA was visualized by running on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance at 
260 nm. 

BNAase treatment of total RNA 

To remove DNA contamination, 20 fig of RNA (quantified 
specito photometrically) was treated with 500 ng DNAase I, 
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80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50^1. The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65°C for 30 min. 

Representational difference analysis (RDA) 
RDA of cDNAs is a modification of genomic RDA (Lisitsyn 
et ai, 1993). We performed RDA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et ai, 1997). Briefly, first-strand synthesis was carried put using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a BgUl site was ligated to the tester as 
well as the driver cDNA. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
Bgffi and the digested cDNA was gel purified. A second set of * 
unphosphorylated Bglil adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized in a 
5 fi\ reaction volume at a ratio of 1 :40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1 /d of the reaction mixture was diluted to 
100 id. The difference product was obtained by amplifying 1 y\ 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoRl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger's 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 

Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (Affymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22000 probe sets corresponding to 18400 
transcripts and variants, including 14 500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in 12^1 RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY, 
USA) with 6 {A of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-31-CTP and bio-16-UTP. The 
biotinylated RNA was purified by using an affinity column 
(Qiagen, Valencia, CA, USA) and fragmented randomly, by 
heating to 95°C in the presence of fragmentation buffer, 
between sizes of 35 and 200 bases. The GeneChips were ; 
hybridized overnight at 45°C in hybridization oven in a. 
solution containing fragmented cRNA, control oligonucleo- / 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, acetylated BSA and 2x hybridization buffer. The 
GeneChips were washed and stained with strep tavidin- 
phycoerythrin and the antibody in 2 x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6 urn with 
a Affymetrix GeneChip scanner 3000. Affymetrix MICRO- 
ARRAY SUITE 3 was used for initial data preparation 
(generation of .CHP files). Normalization (quantile method) 
and calculation of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/). For every cell line, three replicates were * 
performed with Affymetrix Gene Chips. The Gene Chip data 
were used for further calculations after the raw image and 



MAS 5 analysis revealed a positive quality report. Ratios of 
average signal intensity (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was per- 
formed on probe sets as described (Yoshida et ai, 2004). The 
statistical method used to assign P-values to the fold changes 
of gene responses is described by Yoshida et ai (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et aL, 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geneontology.org) or Locuslink (http://www.ncbi.nlm.nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript EI (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the 3' 
untranslated region using Primer 3 software. PCR reactions 
were then performed in triplicates in an I-cycler Thermocycler 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fpld changes were calculated by differences in threshold cycles 
(Ct) between pairs of samples to be compared. HPRT gene was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion. The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysis 

The expression pattern of selected transcripts in cell lines was 
also analysed in Northern blots. RNA (20 /*m), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65°C 
for 12—1 5 h with a radioactive probe and the blot was 
subsequently washed with 0.3 x SSC and 1% SDS at 65°C. 
The hybridized probe was detected by autoradiography. 

Tram feet ion of MCF-7 cells with antisense constructs 
The genes selected on the basis of their upregulation were 
cloned in antisense orientation in pCDNA3.1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
Approximately 4/ig of DNA was transfected into MCF-7 
cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400/xg/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcDNA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered 
bands were excised for mass spectrometry. 
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Protein Isolation 

The cultured cells were harvested by trypsinization and 
centrifuged at 220 £ for 5min at 4°C. The cell pellet was 
washed once with ice-cold 1 x PBS. The proteins were isolated 
by using a commercial kit (BioRad, Hercules, CA, USA). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors cocktail 
(Roche), vortexed and stored on ice for 30 min. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at 100 # for 
10 min at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75 ml PSB buffer, vortexed briefly and centrifuged at 1000 g 
for 10 min at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating 2D/IEF 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis- 

The protein mixtures were separated based on isoelectric 
points by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
sample (175 /zg) in 185 /d of sample buffer (8M urea, 2% 
CHAPS, 0.2% biolytes, 3/10 ampholytes, 65 dim DTT and 
0.002% bromophenoi blue) was loaded in "the sample loading 
trays at the end of 1 1 cm immobilized rehydrated strips (pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15 min treatments with equilibration buffer (6m urea, 0.375 M 
Tris-HQ pH 8.8, 2% SDS, 20% glycerol, 2% DTT) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 



Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein lysates from MCF- 
10 A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in lOOmM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 pi\ of 
Promega's autocatalysis-resistant trypsin (12.5ng//d in 50 dim 
ammonium bicarbonate and 5mM CaCl 2 , pH 8.0) overnight at 
37°C. The supernatant (5 fil) from tryptic digest was injected 
for peptide sequence analysis using on-line capillary liquid 
chromatography-electrospray ionization- tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300 ym inner diameter 
x 15mm C18 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1 100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 /il/min. Peptides were detected at 215nM. The in- 
line mass spectrometer was a Therm oElectron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a non redundant protein 
database using the Sequest program. 
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Identification of androgen-regulated genes in the 
prostate cancer cell line LNCaP by serial analysis 
of gene expression and proteomic analysis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer line LNCaP, at both RNA and protein levels. 
Changes at the RNA level induced by DHT were determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacrylamide gel electrophoresis. Among 1 23 371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 16570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
(p <0.05), of which 147 were induced and 204 repressed by androgen. In two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots analyzed. The change in intensity for most of the affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro-, 
vides a global assessment of genes regulated by DHT and suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AR) lead to ligand- 
independent activation or promiscuity of the receptor 
[1 , 2]. The ability of the receptor to activate or repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3, 4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 
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28-30% of tumors that recurred post androgen-abtation 
therapy [6, 7], Wallen ef a/. [8] have recently shown that 
one-third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator 
amplification and corepressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathway(s) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer cell line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 

2.1 Prostate cell line and DHT treatment 

LNCaP human prostate cancer eel I line (American Type Cul- 
ture Collection, Rockville, MD, USA) was cultured for 3 d in 
phenol-free RPMI 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Half of the cul- 
tures were then treated with 1 0" 9 m DHT for 24 h. Total RNA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrogen, Carlsbad, CA, USA) reagent 
for RNA and a solubilization cocktail for proteins, consisting 
of 9 m urea, p-mercaptoethanol and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 10 -9 m 
DHT and total RNA was extracted at different time points 
from 0-96 h. To determine the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 (ig/mL of cyclohexamide in the presence or absence 
of 1 0" 9 m DHT for 24 h and total RNA was extracted. 

2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [11] with the following modifi- 
cations: ditags were PCR amplified using biotinylated pri- 
mers and digested with A/ /a 1 1 1 enzyme [12]. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacrylamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invitrogen). Concatenated tags 
were screened by PCR using M13 forward and M13 
reverse primers. PCR products with inserts greater than 
500 bp were isolated and sequenced with M13 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Elmer, Norwalk, CT, USA). For microSAGE, 1 \lq of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the mRNA Capture Kit (Boehringer Mannheim, 
Indianapolis, IN, USA. cDNAs were cleaved with N!a\\\ t 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (http://www.ncbi.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis- 
carded. Only tags that match to a single gene were tabu- 
lated and composite counts analyzed for their signifi- 
cance; (2) inclusion method: tags that match to multiple 
genes were counted at 100% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes fc><0.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have a p value <0.05 in both lists were considered statis- 



tically significant. The total number of genes identified 
was estimated by N m + (N um - 0.1 N um )/3.5 r where N m is 
the number of genes matched to SAGE tags, N um is the 
number of SAGE tags that do not match to known genes 
or ESTs, with 10% representing the estimated sequencing 
error per SAGE tag and 3.5, the average number of tags 
per gene in the SAGE reliable map (release 10-26-2000). 

2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PC R (RT-PCR), 1 jig of total RNA 
was reverse transcribed into cDNA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVision 760 Image Workstation (Nucleotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RT-PCR was done at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the i Cycler (Bio- 
Rad, Hercules, CA, USA). Briefly, one fortieth of cDNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 itim 
MgCI 2 , 0.1 mM dNTR 200 nM primers (listed below), 0.05 U 
platinum Taq polymerase (Invitrogen) and 0.1 x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR was carried out 
at 94°C for 2 min, and 40 cycles of 94°C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
quantification: PSA (prostate specific antigen) (Hs.171995, 
5'-GGAAATGACCAGGCCAAGAC-3', 5-CAACCCTG GAC 
C TCACACCTA-3'), SCMH1 (Hs.57475, 5'-GCCTTGACC 
ACATCACTCCAT-3\ 5'-AGGCCTAGGGCTGCAAAAG-3'), 
and cluster in (Hs.75106, 5 '-GCAGGAATACCGCAAAAA 
GC-3\ 5'-GACTCAAGATGCCCCCGTAAG-3'). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0.5 ^LcDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in quadruple reactions. Standard 
curve (Ct = mX +■ B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample B. Cta = mlogQa + B or Qa = ' 
10 (cta-B^ ( therefore, Qa/Qb = 1 rjtcta-ctbjAn 



2.5 2-D PAGE 

The procedure followed was as previously described 
[14]. Cells were solubilized in 200 uL of lysis buffer con- 
taining 9.5 m urea :(Bio-Rad), 2% NP-40, 2% carrier 
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ampholytes pH 4-8 (Gallard/Schlessinger, Carle Place, 
NY, USA), 2% p-mercaptoethanol and 10 mM PMSR 
Aliquots containing approximately 5x10 6 cells, were 
applied onto isofocusing gels. IEF was conducted using 
pH 4 to 8 carrier ampholytes at 700 V for 1 6 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 125 rrtMTris, pH6.8, 
10% glycerol, 2% SDS, 1% DTTand bromophenol blue. 
For the second dimension separation, a gradient of 
11-14% of acrylamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Millipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silver stained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1 % silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
from the 2-D gels and destained for 5 min in 15 mM 
potassium ferricyanide and 50 mM sodium thiosulfate 
as described [15]. Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 100 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 mM ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. The Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37°C, the peptides were extracted twice 
with 50 ^lL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager- DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyme peptide mass standards kit 
(PerSeptive Biosystems). The search program MS- Fit, 
developed by the University of California at San Francisco 
(http://prospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 



pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
tion as described [16]. In some cases, the amino acid 
sequence of some peptides of interest was determined 
by ESI MS analysis. 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 62 878 were from 
the LNCaP cell line and 60493 from the LNCaP cells 
treated with DHT for 24 h {Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 1 1 243 from LNCaP and 12203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched Gen Bank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 1 47 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 149 were changed >: five-fold by DHT treatment 
(Table 3). 



Table 1 . Summary of SAGE analysis in LNICaP cells 



Sample 


Total 

transcripts 


Distinct 
tags 


Number 
of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18510 


12 203 


Total 


123 371 


28 844 


16 570 



- DHT: LNCaP cells without dihydrotestosterone; DHT: 
LNCaP cells treated with 1 0 -9 m dihydrotestosterone for 
24 h 



Table 2. DHT regulated genes in LNCaP cells 



Genes 


LNCaP vs LNCaP + DHT 




(gene #) 






p 0.05 


p 0.01 


Known genes 


277 


147 


ESTs 


29 


8 


No match 


45 


13 


Total 


351 


168 


Up-regulated by DHT (24 h) 


147 


65 


Down-regulated by DHT (24 h) 


204 


103 
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Table 3. List of candidate genes that are regulated by DHT (== 5 fold) 



UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


283305 


1 


82 


Induced 


0 


immunoglobulin heavy contant alpha 1 


183752 


0 


42 


Induced 


0 


microseminoprotein, beta- 


140 


1 


44 


Induced 


0 


immunoglobulin heavy constant gamma 3 (Gm marker) 


75415 


18 


95 


Induced 


0 


beta-2-microglobulin 


84298 


0 


17 


Induced 


1 .OOE-05 


CD74 antigen (invariant polypeptide of major histocomp ability 
complex, class II antigen-associated) 


9615 


0 


16 


Induced 


1.00E-05 


myosin regulatory light chain 2, smooth muscle isoform 


77443 


0 


16 


Induced 


1.00E-05 


actin, gamma 2, smooth muscle, enteric 


1119 


0 


15 


Induced 


2.33 E-05 


nuclear receptor subfamily 4, group A, member 1 


84 753 


23 


2 


Repressed 


3.00E-05 


KIAA0246 protein 


75 777 


0 


14 


Induced 


3.66E-05 


transgelin 




0 


14 


Induced 


3.66E-05 


immunoglobulin J polypeptide, linker protein for immuno- 


76325 










globulin alpha and mu polypetides 


1852 


1 


17 


Induced 


4.33E-05 


acid phosphatase, prostate 


78465 


0 


13 


Induced 


9.00E-05 


v-jun avian sarcoma virus 1 7 oncogene homolog 


78344 


1 


16 


Induced 


1 .20E-04 


myosin, heavy polypeptide 1 1 , smooth muscle 


83006 


19 


2 


Repressed 


1 .50E-04 


CGI-1 39 protein 


263812 


15 


1 


Repressed 


3.19E-04 


nuclear distribution gene C (A. nidulans) homolog 


TACGGGGATA 


0 


11 


Induced 


4.20E-04 


Novel 


GCCTGGGTGG 


11 


0 


Repressed 


6.63E-04 


Novel 


GACTGACACT 


16 


2 


Repressed 


8.40E-04 


Novel 


284296 


0 


10 


Induced 


8.60E-04 


Homo sapiens SURF-4 mRNA, complete cds 


75105 


13 


1 


Repressed 


1.19E-03 


emopamil-binding proteili (sterol isomerase) 


128075 


10 


0 


Repressed 


1.30E-03 


ESTs 


1 54 1 62 


15 


2 


Repressed 


1.43E-03 


A D P-ri bo s ylati on f a ctor -like 2 


6895 


1 


12 


Induced 


1.49E-03" 


actin related protein 2/3 complex, subunit 3 (21 kD) 


143240 


17 


3 


Repressed 


1 .63E-03 


ESTs 


211 582 


o 


9 


induced 


1.66E-03 


myosin, tight polypeptide kinase 


I VJVf L.UU 


o 


g 


Induced 


1.66E-03 


tropomyosin 2 (beta) 


93 002 


14 


2 




2 53E-03 


ubiquitin carrier protein E2-C 


17883 


14 


2 


Repressed 


2. 53 E-03 


protein phosphatase 1G (formerly 2C), magnesium- 
dependent, gamma isoform 


126023 


14 


2 


Repressed 


2.53E-03 


ESTs, Highly similar to NTC1_HUMAN NEUROGENIC LOCUS 
NOTCH PROTEIN HOMOLOG 1 PRECURSOR 


69469 


2 


13 


Induced 


2.92E-03 


dendritic cell protein 


77899 


2 


13 


Induced 


2.92E-03 


tropomyosin 1 (alpha) 


285501 


2 


13 


Induced 


3.07E-03 


Human rearranged immunoglobulin lambda light chain mRNA 


119209 


0 


8 


Induced 


3.07E-03 


insulin-like growth factor binding protein 7 


173043 


0 


8 


Induced 


3.07E-03 


metastasis-associated 1 -like 1 


75866 


0 


8 


Induced 


3.07E-03 


dimethylarginine dimethylaminohydrolase 1 


171695 


0 


8 


Induced 


3.07E-03 


dual specificity phosphatase 1 


TACGGGGATT 


0 


8 


Induced 


3.34E-03 


Novel 


172 791 


13 


2 


Repressed 


4.19E-03 


ubiquitously-expressed transcript 


13561 


13. 


2 


Repressed 


4.19E-03 


:ESTs, Weakly similar to dJ37E16.5 [H. sapiens] 


98260 


8 


0 


Repressed 


4.34E-03 ' 


.ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


(/ Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3' pre-RNA, subunit 3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-monophosphatase 2 


278941 


10 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19 500 


0 


7 


Induced 


7.25E-03 


nuclear localization signal deleted invelocardiofacal syndrome 
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Table 3. Continued 



I I f^n nr Tan 

Uuu or i dy 


PAP 


nun 




r vaiue 


Description 


9006 


0 


7 


Induced 


7.25E-03 


VAMP (vesicle-associated membrane protein)-associated 
protein A (33kD) 


26471 


0 


7 


Induced 


7.25E-03 


Homo sapiens clone HQ0692 


AACTGCTGGC 


12 


2 


Repressed 


7.63E-03 


Novel 


GCTGACCGTC 


7 


0 


Repressed 


8.59E-03 


Novel 


CCCCCCTGTC 


7 


0 


Repressed 


8.59E-03 


Novel 


243901 


2 


11 


Induced 


8.82E-03 


Homo sapiens cDNA FU20738 fis, clone HEP08257 


119000 


1 


9 


Induced 


9.12E-03 


actinin, alpha 1 


78596 


1 


9 


induced 


9.12E-03 


proteasome (prosome, macropain) subunit, beta type, 5 


171955 


7 


0 


Repressed 


9.61 E-03 


trophinin associated protein (tastin) 


180545 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens mRNAfor hypothetical protein (TR2/D1 5 gene) 


77719 


7 


0 


Repressed 


9.61 E-03 


gamma-glutamyl carboxylase 


4877 


7 


0 


Repressed 


9.61 E-03 


CGI-51 protein 


22795 


7 


0 


Repressed 


9.61 E-03 


ESTs 


79335 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens SWI/SNF-related, matrix-associated, actin- 
dependent regulator of chromatin D1 (SMARCD1) mRNA 


119177 


9 


1 


Repressed 


1.26E-02 


ADP-ribosylatin factor 3 


CGGGAGCACC 


9 


1 


Repressed 


1 .29E-02 


Novel 


182217 


0 


6 


Induced 


1 .32E-02 


succinate-CoA ligase, ADP-forming, beta subunit 


75106 


0 


6 


Induced 


1 .32E-02 


clusterin (complement lysis inhibitor, SP-40, 40, sulfated gly- 
coprotein 2, testosterone-repressed prostate message 2) 


256311 


0 


6 


Induced 


1 .32E-02 


granin-like neuroendocrine peptide precursor 


103180 


0 


6 


Induced 


132E-02 


DC2 protein 


TACGGGGATG 


1 


8 


Induced 


1 .70E-02 


Novel 


AAACAAATCA 


2 


10 


Induced 


1 .70E-02 


Novel 


136644 


2 


10 


Induced 


1.71 E-02 


CS box-containing WD protein 


8036 


2 


10 


Induced 


1.71E-02 


glioblastoma overexpressed 


74284 


2 


10 


Induced 


1.71E-02 


ESTs, Weakly similar to Similar to S. cerevisiae hypothetical 
protein L3111 


ATGGGTGATC 


6 


0 


Repressed 


1.75E-02 


Novel 


ATCACTGCCC 


6 


0 


Repressed 


1 .75E-02 


Novel 


ACATCATCAG 


6 


0 


Repressed 


1.75E-02 


Novel 


CCAGTCCAAG 


6 


0 


Repressed 


1.75E-02 


Novel 


54842 




8 


Induced 


1 .78E-02 


ESTs 


1526 


t 


8 


Induced 


1.78E-02 


ATPase, Ca++ transporting, cardiac muscle, slow twitch 2 


7911 


1 . 


8 


Induced 


1 .78E-02 


KIAA0323 protein 


227400 


1 


8 


Induced 


1.78E-02 


mitogen-activated protein kinase kinase kinase kinase 3 


77269 


1 


8 


Induced 


1.78E-02 


guanine nucleotide binding protein (G protein), alpha inhibiting 
activity polypeptide 2 


7943 


6 


0 


Repressed 


1.81E-02 


RPB5-mediating protein 


278544 


6 


0 


Repressed 


1 .81 E-02 


acetyl -Coenzyme A acetyltransf erase 2 (acetoacetyl 
Coenzyme A thioiase) 


23111 


6 


0 


Repressed 


1.81E-02 


phenylalanine-tRNIA synthetase -like 


31442 


6 


0 


Repressed 


1.81 E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81 E-02 


hypothetical protein 


77422 


6 


0 


Repressed 


1.81 E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81 E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 
3'-5'-exoribonuclease 


171075 


6 


0 


Repressed 


1.81 E-02 


replication factor C (activator 1) 5 (36.5kD) 


89781 


6 


0 


Repressed 


1.81 E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1.81 E-02 


apurinic/apyrimidinic endonuclease(APEX nuclease)-like 2 



protein 
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Table 3. Continued 



UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


279 772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


205091 


8 


1 


Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 
{M. muscufus) 


75658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1. 


Repressed 


2.37E-02 


Novel 


105440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


118244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2, regulatory subunit B (B56), delta iso- 
form 


82389 


0 


5 


Induced 


2.43E-02 


CGi-1 18 protein 


31638 


0 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg eel I -expressed intermediate filament- 
associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 - 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 . 


0 


5 


Induced 


2.43E-02 


NME7 


153138 


0 


5 


Induced 


2.43E-02 


origin recognition complex, subunit 5 (yeast homolog)-like 


16034 


0 


5 


Induced . 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 


Induced 


2.43E-02 


ESTs 


140452 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 
ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 19 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown (D. melanogaster) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCT.CA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


0 


Repressed 


3.03E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03 E-02 


Novel 


7869 


1 


7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransferase-delta 


12101 


1 


7 


Induced 


3.04E-02 


hypothetical protein 


266940 


J 


7 


Induced 


3.04E-02 


t-complex-associated-testis-expressed 1 -like 1 


6196 




7 


Induced 


3.04E-02 


integrin-linked kinase 


366 


1 


7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 


1 


7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49kD) 
(NADH-ceonzyme Q reductase) 


102469 


1 


7 


Induced 


3.04E-02 


putative nuclear protein 


78825 




7 


Induced 


3.04E-02 


matrin 3 


284465 


] 


7 


Induced 


3.04E-02 


ESTs 


30738 


1 


7 


Induced 


3.04E-02 


hypothetical protein FU10407 


78687 




7 


Induced 


3. 04 E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 
factor 


92381 


1 


7 


Induced 


3.O4.E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


*0 


Repressed 


3.26E-02 


DKFZP586O0120 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKF2p762L1710 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 
UBC7) 


153678 , 


5 


0 


Repressed 


3.26E-02 


reproduction 8 
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Table 3. Continued 



UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


251317 


5 


0 


Repressed 


3.26E-02 


EST 


279623 


5 


0 


Repressed 


3.26E-02 


selenoprotein X 


150319 


5 


0 


Repressed 


3.26E-02 


ESTs 


102456 




.9 


Repressed 


3.26E-02 


survival of motor protein interacting protein 1 


284250 


5 


0 


Repressed 


3.26E-02 


AD-003 protein 


251871 


5 


0 


Repressed 


3.26E-02 


CTP synthase 


270480 


7 


1 


Repressed 


3.92E-02 


ESTs, Weakly similar to ALU5 HUMAN ALU SUBFAMILY SC 
SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 


glucose-6-phosphatase, transport (glucose-6-phosphate) 
protein 1 


8118 


7 


1 


Repressed 


3.92E-02 


leukotriene A4 hydrolase 


CTCCGCCGGC 


7 


1 


Repressed 


4.47E-02 


Novel 


AGGAAATGCT 


7 


1 


Repressed 


4.47E-02 


Novel 


GCTGACCGAGG 7 


1 


Repressed 


4.47E-02 


Novel 


CCCATAGTCC 


7 


1 


Repressed 


4.47E-02 


Novel 



CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 1 0" 9 dihydrotestosterone for 24 h 

UGD: unigenelD 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One \ig of total RNA was reverse 
transcribed into cDNA with oligo dT. cDNA was amplified by 
PCR for various cycles. PCR products were separated on 
agarose gels and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Fig. 1 A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Fig. 1 B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.171995), clusterin (Hs.75106) and SCMH1 (Hs.57475) 
genes. Typical examples of kinetics are shown in Figure 
1C. PSA was induced at 4-6 h, peaked between 6-20 h F 
and gradually declined after 20 h post-treatment of DHT. 
Clusterin was induced within 0.5-1 h and gradually declined 
after 6-12 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, clusterin, and SCMH1 was quantified by 
real-time PCR (Fig. 1D). 

3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17]. Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells, 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three-fold change in amount of protein, 
based on prior q uantitative studies [17]. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS-Fit search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21 up-regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyt transferase, p 1 ,4 galactosyl 
transferase, gafactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19]. Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). 
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:/* Figure 1. Kinetics and quanti- 

fication of androg en-regulated 
genes. Total RNA from un- 
treated and DHT treated 
LNCaP cells was reverse 
transcribed. cDNA was PCR 
amplified for various cycles. 
PCR amplification curves were 
plotted for each gene. A cycle 
number within the logarithmic 
phase of amplification was 
selected for semi -quantitative 
analysis. cDNA was also used 
for real-time quantitative PCR. 
(A) An example of RT-PCR 
amplification of PSA at cycles 
between 20-30. (B) . RT-PCR 
of PSA in three independently 
treated LNCaP samples, h: 
hours; 0: no DHT; 24: treated 
with 10T 9 M DHT for 24 h; 

without cycloheximide; +: with 5 ng/mL of cycloheximide for 24 h. (C) An example of three independent kinetics experi- 
ments for PSA, clusterin and SCMH1. (D) Comparison of SAGE results with that of real-time quantitative PCR for PSA, 
clusterin and SCMH1 . ]\ induction; J: repression. 



f. 



Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine if the changes 
observed at the protein levels were matched by concor- 
dant changes at the rnRNA level. Remarkably, for most of 
the proteins identified, there was no appreciable concor- 
dant change at the RNA level (Table 4). 



4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, with or without androgen (DHT) treatment. Approxi- 
mately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1 .7- 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in 
prostatic kallikrein 2, and a 15.7-fold induction in pro- 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three -fold in fatty acid synthase; two previously identified 
androgen-regulated genes [20, 21]. Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was. up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both p-actin and G3PD, the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0.01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-regulated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig: 1 B) have smaller variations and some have 
larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large numbers. 



5 



Proteomics 2001 ,7, 1327-1338 



Identification of androgen-regulated genes in LNCaP by SAGE 1335 




Figure 2. 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of identi- 
fied proteins. 
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G3PD (457) 
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75 
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Selenium Binding Protein (234) 


7 833 
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Selenium Binding Protein (237) 
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26 
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Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 

f: increase; |: decrease; =: equal . 



Proteomics 2001 , 7, 1327-1338 



Identification of androgen-regulated genes in LNCaP by SAGE 1 337 



When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
For instance, in approximately 60 000 transcripts of each 
sample, the PSA tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT. However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCMH1 5 times and 0 times in the corresponding sam- 
ples. Therefore, the SAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PCR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1.7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h, and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with 
DHT, PSA mRfsIA level was likely to have declined from 
its peak level. Indeed, we could detect 5-10 fold induction 
of PSA at 6-8 h post-treatment. Clusterin was reported as 
an androg en-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli [24]. Our results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1C). 

Another important question is how many of the andro- 
gen-regulated genes identified are directiy induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we wilt divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensitive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some clues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited sensitivity of 2-D 
PAGE. Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, namely selenium binding protein, brain speci- 
fic creatine kinase, SRY protein, B23 and G3PD. Using 
a subtractive approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down -regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [1 9]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25]. Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18]. Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in our list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27]. 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and seJenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RIMA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translational 
control, post-translational modifications, or changes in 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a lag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 

We would like to thank Dr. Ken Kinzler (Johns Hopkins 
University) for providing SAGE software. This study is par- 
tially supported by DAMD1 7-98-1 -8501 , R01CA74927 
and a fund from Karmanos Cancer Institute. 
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Based on biomedical literature databases, we tried a first step for constructing a gene expression 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- 
scriptomic research were available from 12 studies, using various technologies, namely, SAGE, 
cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
cell lines and nine studies of human samples. The total number, of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in six studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS- or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale transcriptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 
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1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome - 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 
analysis of single experiments but from libraries of experi- 
ments. In other words, the results obtained so far by trans- 
lational research tools in different clinical and experimental 
settings need to be compared, contrasted, and if possible 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such "data ware- 
house" specific to human CRC. 

i 
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The specific aims of the present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of "meta-analysis of gene expression" 
in CRC. Our hypothesis was that we would be able to deter> 
mine a common set of genes and gene products that are up- 
or down -regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer" 
and "gene expression" with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was performed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transcriptomics 
and/or proteomics studies), and quantitative results on over- 
~ expression or underexpression available. Studies concerning 
& single genes or arbitrarily selected genes were discarded. 
Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seattle, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
defined as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide transcriptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor-tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 344. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these studies is provided in Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free- flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 



3 Results 

One thousand two-hundred and forty genes have been 
reported to be dysregulated (up- and/or down-regulated) in 
human CRC, representing about 5% of the 20000-25 000 
human genes. 



* The complete dataset can be consulted: 
http://www.chirurgiebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only 144 patients 



Author 


Year 


Sample 


Preparation 


Method 


Number 
of patients 


Zhang etai. [181 


1997 


Normal mucosa and primary tumors 


Whole tissue 


SAGE 


2 


SAGE-NET [191 


1997 


Primary tumors and cell lines 


Cell lines 


SAGE 




Zhang etai. [181 


1997 


Cell lines 


Cell lines 


SAGE 




Parle-McDermott etai. [20| 


2000 


Cell lines 


Cell lines 


SAGE 




Yanagawa etai [211 


2001 


Paired normal and cancer 


Whole tissue 


cDNA 


20 


Takemasa etai. [221 


2001 


Paired normal and primary tumors 


Whole tissue 


cDNA 


20 


Nottermann etai. [231 


2001 


Paired normal mucosa and primary 


Whole tissue . 


Oligonuc- 


22 






tumors, or adenoma 




leotide array 




Buckhaults etai. [241 


2001 


Normal mucosa, adenoma and 


Purified epithelial 


SAGE 


6 






; primary tumors 


cells 






Birkenkamp etai. [251 


2002 


Paired normal and primary tumors 


Whole tissue 


cDNA 


27 


Lin etai [26] 


2002 


Paired normal mucosa and primary 


Purified epithelial 


cDNA 


16 






tumors, or adenoma 


cells 






Muro era/. [27J 


2003 


Normal and cancer 


Whole tissue 


Adaptor-tagged 


11 


Williams etai [281 


2003 


Paired normal mucosa and primary 


Whole tissue 


cDNA 


20 






tumors, normal and adenoma 
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Table 2. Large-scale or proteomics studies in human CRC, using cell lines or human tissues. AH studies together 
involved only nine patients 



Ai ithor 

^\ LI 11 IUI 


Pi ihlip^t"ir»n 
r uuiiuaiiuii 


O O 1 1 1 pi C 


IVi CUIUU 


M 1 1 m h a r 
IMUIIIUcf 




year 






of patients 


Reymond etat. [91 


1997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson et ai (291 


2000 


Cell lines 


SDS-PAGE 




Lawrie era/. [30J 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson et ai [311 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed etai [321 


2003 


In silico 


2-D PAGE 




Demalte etai (331 


2003 


Cell lines 


2-D PAGE 




Stierum etai [341 


2003 


Cell lines 


2-D PAGE 




Friedman etai [351 


2004 


Human whole tissue 


2-D PAGE 


6 



The vast majority of dysregulated genes in human CRC 
was found using transcriptomics tools: a total of 982 genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study, in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 (up-reg- 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. Out of these 408 
molecules, 339 (83%) were mentioned in a single study, in 
other words these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies, 16 in three studies, 10 in four studies, 3 in five studies, 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
[1] provided differential display protein expression data 
obtained in the human patient, using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by .genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteorne technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (n = 31) are compared with 
those reported to be differentially expressed in three and 
more proteomics studies (n = 30), only two genes (actin: 
ACTB_HUMAN and creatin-kinase: KCRB_HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transcriptome level is low when 
using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
omics results are verified using transcriptomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistendy (in three studies or more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 

*. 
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genes can be matched, representing about two-thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%), However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 

4 Discussion 

This study was aimed at screening transcriptomics and pro- 
teomics studies published in CRC, and to elaborate a kind of 
meta-analysis of gene expression in CRC in order to generate 
a data warehouse that would be useful in translational 
research. A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able to 
determine a common set of genes and gene products that are 
up- or down-regulated in human CRC by comparing results 
obtained by different authors. This endeavor was not very 
successful; obviously, overlapping between published gene 
expression results both at proteorne and transcriptome level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogether 144 patients in transcriptomics, 11 in prote- 
omics studies); some high impact publications having been 
conducted on samples of only two patients [18]. This is a 
problem because interindividual genetic variability is high in 
human CRC [2]. Thus, it is allowed to hypothesize that some 
results attributed to gene dysregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptomics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 



Accession 

Number Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


£J 

5 
o 
CM 

"ra 

B 
"3 

re 

o 

3 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Lin et af 2002 [26] 


eT 
<m 

o 

ti 
o 

i 

CD 

a 

£ 

o 

CO 

a. 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] 


CO 

cn 
73 

CD 
OS 

c 
cz 

NI 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


Paired normal and cancer 


Normal and cancer 


E 

=» 

C- 

-a 

c 

CO 

E 
© 
e 

0) 
T3 
19 
« 

E 

si 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Norm.muc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tum.n and adenoma 


Paired n.muc and pr.tum or adenoma 


cONA 


Adaptor-tagged 


LU 

a 


< 

tz 
o 
u 


< 
z 
o 

u 


< 
o 


tu 
CD 
<t 

to 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


AF041260 


075363 


IFM2_HUMAN 


Interferon-inducible protein 1-Bd 


1 




1 




1 






1 










AA232508 


Q9Y397 


SAHH_HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 






1 


1 










1 


1 






AA007218 


Q96QY8 


BGH3_HUMAN 


Transforming growth factor, beta-induzed,66 kDa 


1 




1 


1 










1 








G01119 




TCPD_HUMAN 


Chaperonin containing TCP-1, subunit 4 (delta) 












1 








1 


1 




X75821 


Q04984 


CA11_HUMAN 


C0L1A1 










1 


1 




1 










U22055 


Q96AGO 


TF3A_HUMAN 


General transcription factor IMA 








1 




- 1 










1 




U14631 


P80365 


GR0_HUMAN 


GROt oncogene (melanoma growth stimulating 
activity, alpha) 








1 












t 


1 




M92439 


P42704 


MDPl.HUMAN 


MPDP4, MDP7 microsomal dipeptidase 






1 


1 








1 










AF029082 


P31947 


MPI2JHUMAN 


M-phase inducer phosphatase 2 






1 




















M86400 


P29312 


PROC.HUMAN 


Pyrroline 5-carboxilate reductase mRNA 






1 










i 

— i— 








L76465 


P15428 . 


SPRC_HUMAN 


Secreted protein, acidic, : cysteine-rich 
(osteonectin) 1 






1 










i 
| 








X57352 


Q0I628 


Q96AG0 


100 kDa coactivator mRN ! A : 
















i 








X03205 


P62888 


RL30JHUMAN 


SOS Ribosomal protein L30" . 
























M95787 


Q01995 


RL23_HUMAN 


60s ribosomal protein L23 ) 
















1 








225821 


P42126 


HS9B_HUMAN 


90-kDa HSP 
















i 








U96132 


Q99714 


RLA2_HUMAI\I 


Acidic ribosomal phosphoprotein P2 mRNA 
























AB028893 


P04643 


CA14_HUMAN 


Alpha-1 chain of collagen IV 
























X79239 


Q02546 


APA1.HUMAN 


Apo lipoprotein A-l 
























M13934 


P06366 


BMP4_HUMAN 


Bone morphogenetic protein-2B (BMP-2B) 


























M60854 


PI 7008, 


CSE1_HUMAN 


CAS chromosome segregation gene homologue 


















t 








M13932 


P08708 


CD81_HUMAN 


CD81 antigen 
















i 








X69150 


P25?32 


TCPZ_HUMAN 


Chaperonin containing TCP1, subunit 6A(zeta1) 


























L06498 


P17075 


CA21_HUMAIM 


Collagen alpha-2 type 1 
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Table 3. Continued 



. TRANSCRIPTOMICS 



Accession 

Number . Swiss-Prot Entry Name Name 


Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


§ 

1 
re 
% 

re 

M 
CO 

S 

4) 
« 


<5T 

oJ 
c 

'J 


Parle-McDermott 2000 [20] 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] j 


CO 

r- 

<n 
m 

"ra 
9 

CH 

c 

(3 
-C 
M 


Williams et al 2003 [28] 


Notterman et al 2001 [23] 


< 
z 

Q 

o 

— 


Adaptor-tagged 


SAGE 


cDNA 


< 
Z 

o 

o 


cDNA 


SAGE 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


L04483 


P35265 


cni Lll IMAM 

CQZ_HUIV1AI\I 


Complement component C2 




-j- 






















M64716 


P251 1 1 


nlOA_HUMAlM 


p„_ in—DMA 

Lsa-iymHiMA 




— 






















X77770 


P02383 


R0A1_HUMAN 


0NA binding protein UPI, liver mRNA fragment 


























W52460 


P14798 


TP2A_HUMAN 


DNA topoisomerase II (top2) 


























X55715 


P23396 


iron lit i * a a m 

lr39_HUMAIM 


Eukaryotic translation initiation factor(EIF3)mRNA 


— 
— 
























hlillO'iA 
IVI//ZJ4 


r *HJZ*f 1 


IP9P MIIMAM 


Ci il/arv/nfrif* tranelatinn intttafinn fnftnr *) ctrKunit 0 
CUKdlyUUt LldHolallUII HllUdUUII IdLlUI £., dUUUIHl L 


























M58458 


P12750 


GGH_HUMAN 


Gamma-glytamyl hydrolase 


























D 16992 


P10660 


G3P2_HUMAN 


Glyceraldehyde 3 phosphate dehydrogenase 


— 
























M77233 


P23821 


SYG_HUMAN 


Glycyl-tRNA synthetase 


— 
























F16294 


P09058 


RS8_HUMAN 


H19RNA 


























X61156 


P08865 


HS9A_HUMAN 


Heat shock Protein HSP 90-Alpha 


























(36055 


Q1354I 


HMGL«UMAN 


Hmgl mRNA for high mobility group protein 1 


























AA316619 


P04645 


RS23_HUMAN 


Human homolog of yeast ribosomal protein $23 
(D14530) 


























L14599 


Q15233 


IMD2_HUMAN 


IMP (inosine monophosphate} dehydrogenase 2 


























X79234 


P39026 


IFM1_HUMAN 


Interferon induced transmembrane protein 1(3— 
27| 


























L06505 


P30050 


LDHB.HUMAN 


Lactate dehydrogenase B(LDH-B) 


























X64707 


P26373 


RS2.HUMAN 


LLRep 3 


























X56932 


P40429 


4F2.HUMAN 


Lymphocyte activation antigen 4F2 large subunit 


























L25899 


P39030 


NPL1_HUMAN 


Nucleosome assembly protein 1-like 1 ( 


























X63527 


P14118 


GSHFLHUMAN 


Phospholipid hydroperoxide glutathione per- 
oxidase 


























X89401 


P46778 


GDFF_HUMAN 


Prostate drfferentation factor 


























H59771 


P23131 


RL6_HUMAN 


Ribosomal protein L6 


























M94314 


P38663 


RL18_HUMAM 


Ribosomal protein L18(RPL18| mRNA 


























LI 9527 


P08526 


RUXJ-WMAN 


Ribosomal protein U8a mRNA, complete cds 


























U 14968 


P46776 


RL7_HUMAN 


Ribosomal protein L7 


























U 14969 


P46779 


RL8_HUMAN 


Ribosomal protein L8 


























L38941 . 


P49207 


RS5_HUMAN 


Ribosomal protein S5 


























U 12465 


P42766 


RS19_HUMAN 


S 19 ribosomal protein 


























F19234 


P18077 


MYC.HUMAIM 


V-myc myefocytomatosis viral oncogene 
homolog (avainl 


























X66699 


P12751 


Q99497 


EST|PARK7; DJ1) 


1 


i 























$ V 
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Table 4. Results of transcriptornics research in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 



TRANSCRIPTOMICS 



Entry Name Name 


: Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


; Bucklmults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


: Un et al 2002 [26] I 


o 

CM. 
C3 

I 

ti 
o 

§ 

o 
o 
o 

?5 

O- 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [23] 


Notterman et al 2001 [23] 


Paired normal and cancer 


Normal and cancer 


B 
3 
*s 

□L 

X3 
(= 

to 
ta 
B 
© 
c 

01 

■a 
re 
(J 

Z3 

B 
z 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Norm.muc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tum.n and adenoma 


Paired n.muc and pr.tum or adenoma 


< 

-ss 
a 
o 


Adaptor-tagged 


SAGE 


< 

o 

u 


< 
z 

a 

u 


< 

o 


UJ 
CD 
<X 

tn 


SAGE 


UJ 

< 
to 


SAGE 


< 
z 
a 

o 


Oligonucleotide Array 


CAH2.HUMAN 


Carbonic anhydrase II 


1 




1 




1 












1 


1 


CEAJ.HUMAN 


EST (BILIARY GLYCOPROTEIN) 






1 


1 


1 












1 


1 


BENE.HUMAN 


BENE 


1 








1 






I 








1 


CAHLHUMAN 


Carbonic anhydrase 1 


1 




1 


















1 


GUAU_HUMAN 


GCAP-II guanylate cyclase activator 2B; Uroguanylin; UGN 


1 






















1 


FABL_HUMAN 


L-FABP liver fatty acid-binding protein 1 


1 








1 
















ACDS.HUMAN 


Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 






















1 


1 


ADHG_HUMAN 


ADH gama 2 subunit (aa-1— 375) 










1 














1 


T4S3_HUMAN 


Co-029; transmembrane 4 superfamily member 3 






1 




















DRAJHUMAN 


Colon mucosa-associated (DRA) 
























1 


KCRB_HUMAN 


Creatine kinase 


1 
























GUAN_HUMAN 


GUCA1B guanulate cyclase activator 1 B/guanylin 
























1 


095784 


IgG Fc-binding protein 


1 








1 
















ITMC_HUMAN 


Integral membrane protein 2C 










1 


1 














MK03_HUMAN 


MAPK3 






















1 




FXY3_HUMAN 


MAT8 protein 










1 
















ACTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 


1 












1 


1 










SELP.HUMAN 


Selenoprotein P 










1 










1 






EST1 .HUMAN 


Liver carboxylesterase 1 [Precursor] 


















1 


1 




1 


CFAD_HUMAN 


Adipsin, complement factor DCDA (EST) 
























1 


ATPB_HUMAM 


ATP5B ATP synthase 










1 
















CAHC_HUMAN 


CA 12 










1 
















CANS_HUMAM 


Calcium dependent protease (small subunit) 
















1 










CALM_HUMAISI 


Calmodulin-! (CALM1 ( mRNA, 3' UTR, partial sequence 
















1 






1 




CAH4.HUMAN 


Carbonic anhydrase IV » 








1 
















1 
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Table 4. Continued 

TRANSCRIPTQMICS 







Yanagawa et a! 2001 [21] 


% 
o 

S 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


to 

% • 

«j 
tn 

2 • 

S 
.v 

CD 
h~ 


tz 


Parle-McDermott 2000 [20] 


Zhang et al 1997 [18] 


SAGE-NET et ai 1997 [1 9] 


j Zhang et al 1997 [18] J 


I 

m 
~a 

E 

.5 

I 


Notterman et al 2001 [23] 


Entry Name 


Name 


< 
z 
o 

o 


Adaptor-tagged 


SAGE 


< 
z 

Q 

O 


«C 

z 
o 

o 


< 
Z 

o 

o 


uu 

£3 
< 


SAGE 


SAGE 


SAGE 


| 

o 


Oligonucleotide Array 


MT1 C UI IRA AM 


r>nMA cimil^r tn nh* MlOQd.9 rrlcl human 

dunm similar io go. ivi iuj i *£_lum nuiiidii 


















1 






1 


U00748 


Ltb z carDoxyiesterase £ 










t 
















CMGA_HUMAN 


LgA 








\ 
















1 


PIHp ill IM AM 


clusterin 
























1 


If a f\f 1 II 11 1 AM 

K1CT_HUMAN 


Cytokeratin 20 


















1 






1 


KDGA_HUMAI\I 


Diacylglycerol kinase 
























1 


nrn Ml IflA A M 

DTD_HUMAN 


DTD sulfate transporter 








1 


) 
















ni A D UI Wh A M 

PLA8_HUMAN 


rpT 1 OOCOjI C 

to) l__b94 b_ 


















1 






1 


A T n A III IAI A M 

ATPA_HUMAN 


Fl-ATPase alpha subunit 










] 
















LEG3_HUMAN 


Galectin-3 (Galactose-specitic lectin 6\ (MAb-/ antigeni 








1 


1 
i 
















ABP_HUMAN 


HP-DA01 (diamine oxidase) 




















1 




1 


K1CS_.HU MAN 


Keratin 19 




















1 






K2C8„HUMAN 


Keratin, type II cytoskeletal B (cytokeratin 8) |K o) ILK o) 




















1 






DHB2_HUMAN 


LI 1708 Estradiol-17 beta- dehydrogenase 2 












] 












1 


MT1 H_HUMAN 


A J| - It ill. ' 4 |J 

Metallothionem 1 H 
























1 


■ ii inn lllfftAAAl 

MUC2_HUMAN 


Mucin 2, intestinal/tracheal 














1 


1 










MEPA_HUMAN 


PPH alpha gene 




















1 




1 


r% A A A 1 11 111 A 

PA2A_HUMAN 


RASF-A PLA2 gene 












\ 










1 




MT1F_HUMAI\I 


RNA helicase-related protein 
























1 


SBP1 HUMAN 


SBP selenium-binding protein 












1 


1 












MT1 INHUMAN 


Serine theonine kinase 39 (STE20/SPS1 homolog,yeast) 












1 












1 


MYL6_HUMAN 


Myosin light polypeptide 6 


















1 




1 




TETN.HUMAN 


TNA tetranectin 












1 












1 


VIPR.HUMAN 


VIPR1 vasoactive intestinal polyoeptide receptor 1 












1 












1 


PLA8_HUMAN 


Placenta-specific gene 8 protein 




















1 




1 


Q9NXM9 


Hypothetical protein FU20151 












I 












1 


T4S INHUMAN 


Transmembrane 4 superfamily member 1 








1 












1 






ALU2.HUMAN 


Alu subfamily SB sequence contamination warning entry 






1 






1 














S116JWMAN 


S100 calcium-binding protein A16 
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C14A„HUMAN 


Dual specificity protein phosphatase CDC14A 




















1 


1 





Second, results have been obtained using heterogeneous 
samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 
obtained in cell lines do not allow accurate comparison be- 
tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 



colonic crypts were compared with the CRC cell line LIM 
1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 
levels of a large number of proteins differed [3J. When clin- 
ical biopsies are examined, results of phenotypic compar- 
isons depend on the type of samples examined (e.g., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 

Number Swiss-Prot Entry Name Name 


Simpson et al 2000 [23] 


Simpson et al 2001 [31] 


Demalte et al 2003 [33] 


Reymond et al 1999 [9] j 


Medjahed et al 2003 [32] 


Lawrie et al 2001 [30] 


Friedman et al 2004[35] 


5r 

a 

TO 
OJ 

-£ 


SDS-PAGE 


Free-flow electrophoresis 


2-D PAGE 


2-D PAGE 


2-D PAGE 


2-D PAGE 


H 2-D PAGE 


Ui 
C3 
< 

a. 

a 
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AF041 260 


075363 


ACTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 
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Q9Y397 


ACT6 HUMAN 


Actin, Cytoplasmatic 2 (Gamma-Actin) 


•! : tl : *> 
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Q96QY8 


GTP.HUMAN 


Glutathione S-transf erase M3 (brain) 
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G01 119 


THIO HUMAN 


Thioredoxin (ATL-derived factor) (ADF) 




k \:. 
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Annpvin III ft innrnrtin till 

rM IHCAt II 1 11 \bipUl>UIUII III/ 
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Annpvin IV ft innrnrtin IV) 
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U14631 


P80365 


K1CR HUMAN 


Keratin tvDe 1 cvtoskeletal 18 (cvtokerattn 181 (K181 ICK181 






Ml 






: \: 






M92439 


P42704 


ATPB_HUMAN 


ATP5B ATP synthase 


•If 














:t\ 


AF029082 


*P31947 


PPIA_HUMAN 


Peptidyl-Prolyl Cis-Trans Isomerase A (Cyclophilin A) 




;:. j ^ 


;# 












M864O0 


P29312 


P0A3.HUMAN 


Probable protein disulfide isomerase ER-60 precursor 






IB 


M 








lit 
w 


L76465 
X57352 


PI 5428 


TPIS.HUMAN 


Triosephosphate isomerase (TIM) 


lil 














Q01628 


TCTP„HUMAN 


Translationally controlled tumor protein (TCTP)(p23) 






i . 


M 








X03205 


P10809 


CH60_HUMAN 


Heat shock protein 60 






Ill 












M95787 


Q01995 


VINC_HUMAN 


Vinculin 






111 












Z25821 


P42126 


EF1 G_HUMAN 


Elongation factor 1-gamma mRNA 


= # 










U96132 


Q99714 


TBALHUMAN 


Tubulin alpha-1 chain, brain specific 


















AB028893 


P04643 


CATD_HUMAN 


Cathepsin D (EC 3.4.23„5).{GENE.CTSD}Hamo sapiens 


















X79239 


Q02546 


EFI1_HUMAN 


Elongation factor 1*alpha, mRNA 




1 












M 13934 


P06366 


G3P2_HUMAN 


Glyceraldehyde 3 phosphate dehydrogenase 


















M60854 


PI 7008 


GR78_HUMAN 


78 kDa glucose-regulated protein precursor (GRP 78) 


: 0 
















M 13932 


P08708 


EN0A_HUMAN 


Alpha enolase (2-phospho-D-glycerate hydrolyase) 


■if 
















X69150 


P25232 


CRTC_HUMAN 


Calrecticuium precursor (CR55) (calregulin) . 


111' 
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P17075 


C0F1_HUMAN 


Cofilin, non-muscle isoform 
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L04483 


P35265 


HS7C_HUMAN 


Heat shock Cognate 71 kDa protein 
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P25111 


HBB_ HUMAN 


Hemoglobin beta chain.{GENE: HBB) h.s 












■ f; 






X77770 


P02383 


K2C8_HUMAN 


Keratin, type II cyto skeletal 8 (cytokeratin 8) (K 8) (CK 8) 


■'Vr 
















W52460 


PI 4798 


RM12_HUMAN 


Mitochondrial 60s ribosomal protein L7/L12 precursor 
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X55715 


P23396 


GR75_HUMAN 


Mitochondrial stress-70 protein precursor (GRP 75) 


otX; 
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M77234 


P49241 


S0DC_HUMAN 


Superoxide pismutase (Cu-Zn) 








M 










M 58458 


P12750 


KCRB.HUMAN 


Creatine kinase 


















D 16992 


PI 0660 


143Z_HUMAN 


14-3-3 protein zeta/delta (protein kinase C inhibitor protein- 1) 
















M77233 


P23821 


RL4_HUMAN 


60s ribosomal protein L4 (L1) 


■::iy: 










F16294 


P09058 


ANX! .HUMAN 


Annexin I (lipocortin I) (calpacttnl I) (chromobindin 9) (p35) 














II 




X61156 


P08865 


R0A1_HUMAN 


DNA binding protein UPI, liver mRNA fragment 














L36055 


Q13541 


FABL_HUMAN 


L-FABP liver fatty acid-binding protein 1 

















© 2005 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 



www.proteomics-jburnal.de 



3074 E. Sagynatiev et al. 



Proteomics 2005, 5, 3066-3078 



Table 5. Continued 



Accession 

Number Swiss-Prot Entry Name Name 


Simpson et al 2000 [29] 


Simpson et al 2001 [31] 


Demalte et al 2003 [33] 


Reymond etal 1999 [9] 


Medjahed et al 2003 [32] 


Lawrie et al 2001 (30] 
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SDS-PAGE 


Free-flow electrophoresis 


2-D PAGE 
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P04645 


PAB1_HUMAN 


Poly(A) binding protein, mRNA 
















LH599 


Q15233 


RL3_HUMAN 


Ribosomal protein L 3 
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P39026 


RS19_HUMAN 


S 19 ribosomal protein 


















L06505 


P30050 


SAHH.HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 
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X64707 


P26373 


TERA_HUMAN 


Transitional endoplasmic reticulum ATPase (TER ATPase) 


















X56932 


P40429 


GDIR_HUMAN 


Rho GDP-dissociation inhibitor 1 
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AC0N_HUMAN 


Aconitate hydratase, mitochondrial precursor (aconitase) 
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Alcohol dehydrogenase (NADP-f ) (aldehyde reductase) 






ill 












X89401 


P46778 
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Alpha-Actinin 4 


m 
















H59771 


P23131 


NPM_HUMAN 


B23 nucleophosmin 
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CD44 antigen (Phagocytic glycoprotein 1) 
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Cytochrome c oxidase polypepti-de Va, mitochondrial (EC 1.9.3.1) 
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Cytochrome C oxidase polypeptide VB precursor.COX5B 
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Endoplasmatic reticulum protein ERp29 (ERp31) (ERp28|. 
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Fructose-Biphosphate Aldolase A (Muscle-Type Aldolase) 
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Glutamate dehydrogenase 1 precursor (GDH) 
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Heat shock Protein HSP 90-Alpha 
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P32969 
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Heterogeneous nuclear ribonucleoprotein L (HNRPN L) 
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Keratin, type II cytoskeletal 1 (cytokeratin 1) (Kl| (CK1) 


















MB1182 


P2B288 
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Peroxiredoxin 1 (EC 1.1 1.1.-) (Thioredoxin peroxidase 2) 
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Spectrin beta chain, erythrocyte (Beta-I spectrin) 
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Thioredoxin-dependent peroxide reductase, mitochondrial 
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Tropomyosin alpha 3 chain (Tropomyosin 3) 
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Non-meta static cells 1, protein (NM23A|expressed in 
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14-3-3 PROTEIN SIGMA 
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60s ribosomal protein L24(L30) 
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Ml 7886 j 
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AP A INHUMAN 


Apolipopratein A-l 
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DHSA_HUMAN 


SDH2 succinate dehydrogenase flavoprotein subunit 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and vice versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics. 
tools (25%). Only two gene products can be retrieved in three and more proteomics and transcriptomics studies (actin and creatin- 
kinase) 



a. Transcriptomics > Proteomics 
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Table 6. Continued 
b. Proteomics > Transcriptomics 
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Annexin III (Lipocortin III) 
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Elongation factor 1-alpha, mRNA 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) [4], laser capture microdissection (LCM) [5-8], 
irnmunomagnetic beads separation [9], and cellular frac- 
tionation [10]. Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads -based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on the arrays 
technology platform. In transcriptomics studies any factors 
may afFect die outcome of a microarray experiment, in par- 
ticular technical, instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major concern in microarray studies [12]. When cross- 
platform comparison was performed, reproducibility was 
insufficient: only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [1 3]. 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CV of repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (interrun) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment. Interindividual phenotypic variability was high. 
Thus, even under well-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis. Unfortunately, due to the small 
sample size (number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In particular, assessing 
the reproducibility of a variable is necessary (e.g., using the 
intradass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge- 
nous controls [15]. A dedicated society, The Microarray Gene 
Expression Data (MGED) Society, has been formed to facil- 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16]. 

Finally, correlation between results of transcriptomics 
versus proteomics results is low. For CRC, there is no pub- 
lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is reasonable from 
another epithelial cancer (lung adenocarcinoma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance [17]. 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transcriptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3-3 proteins 
family. 14-3-3 proteins are ubiquitous within all. eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependent 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics [37]. Given the prevalence of specific 
14-3*3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets in human CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards (such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
these high quality, well -annotated data within the life sci- 
ences community. 

In the absence of such unified processes, reproducibility of 
results within a laboratory will continue to be low, confirma- 
tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These problems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 

We are very grateful to G. Nestler for her excellent technical 
assistance. 
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Abstract 

Prostate cancer is the most diagnosed cancer and the second leading cause of cancer death among men in the United 
States. Ability to detect this cancer early and availability of better prognostic markers are critical in order to decrease 
morbidity and mortality of prostate cancer With the recent development in gene expression analysis methodology, 
expression profiles of thousands of genes can be generated in tissue samples and cell lines. Comparison of the global 
gene expression patterns between normal prostate and tumors at different stages may allow us to understand better 
the molecular mechanism of prostate tumorigenesis and progression. Different cancer cell lines and tissues appear 
to have different gene expression patterns that provide a new tool to classify tumors. Molecular classification of 
prostate cancer holds great promise for early detection and prognosis of this disease in the future. In this review, 
we summarize some of the recent mRNA and protein expression profiling studies performed in prostate cancer. 
Further, we discuss the potential benefits and limitations of current profiling technology. 



Introduction 

Prostate cancer is the most frequently diagnosed cancer 
and the second leading cause of cancer death among 
men in the United States. Considerable progress has 
been made in the early detection and treatment of 
prostate cancer over the last two decades. Nonetheless, 
mortality from prostate cancer remains a significant 
health care problem. Two critical issues for reduc- 
tion of mortality and morbidity from this disease 
are early detection and improved prognostic markers. 
Interestingly, a majority of men will develop histo- 
logical prostate carcinoma by their fifth decade [1], 
however, many will never progress to clinically relevant 
disease. Some clinical cancers are more aggressive, and 
many are slow-growing. Early detection of aggressive 
prostate cancer is important, especially in the organ- 
confined stage, as treatment at this phase will likely 
cure the cancer. However, even if the prostate cancer 
is left untreated, many men with clinically-diagnosed, 
less aggressive prostate cancers will not die of this dis- 
ease rather other causes. With an increasing number 
of men with clinically-diagnosed prostate cancer, this 



problem leads to an enormous cost to the health care 
system as well as the emotional trauma and potential 
physiological side effects to the individual. Therefore, 
development of better prognostic markers is critical to 
the effort to identify and treat patients with unfavorable 
prognosis. 

Prostate is a hormone-regulated organ. Androgens 
affect prostatic epithelial cell proliferation and differ- 
entiation, and play a critical role in tumorigenesis. 
A common treatment of prostate cancer involves the 
deprivation of androgen, clinically known as androgen 
ablation. Despite the initial response of patients to hor- 
monal therapy, the vast majority of patients eventu- 
ally relapse. Molecular mechanisms involved in the 
development of hormone-refractory prostate cancer 
are unknown. However, several biological processes 
may contribute to this development. Mutations on 
the androgen receptor (AR) can cause a ligand- 
independent activation^ or promiscuity of the receptor, 
i.e. alteration in the ligand binding specificity such that 
it binds hormones such as progesterone and estradiol 
[2,3]. Another phenomenon observed in 28-30% of 
recurrent hormone-refractory prostate cancer tumors 
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is AR amplification [4-6]. Increased levels of AR 
may allow tumor cells to survive an<J thrive in low 
androgen environments. Specific growth factors and 
cytokines, such as insulin-like growth factor-I (IGF-I) 
[7] and interleukin-6 (IL-6) [8] may also activate 
the AR in a ligand-independent manner. Coregulators 
can affect receptor transactivation depending upon the 
delicate balance between coactivator and corepressor 
expression levels [9,10]. Regardless of the mecha- 
nism leading to androgen independence, downstream 
genes in the androgen pathway play a critical role 
in the development of hormone-refractory prostate 
cancer. 



Identification of differentially expressed genes 

One of the keys to understanding prostate growth 
and tumorigenesis is the identification of differentially 
expressed genes between normal and tumor prostate 
cells, and genes regulated by androgens. Much effort 
has been directed to identify such genes over the 
years. Prostate-specific antigen (PSA) is probably the 
best characterized androgen-regulated gene and a com- 
mon diagnostic marker. Many other genes related to 
growth, adhesion, cell cycle and apoptosis have been 
studied. 

Several lines of evidence suggest that insulin-like 
growth factors (IGFs) may play an important role in 
prostate cancer. IGF-I is a potent serum maker for 
prostate cancer risk [11]. Increased expression of IGF-I 
and its receptor may be associated with tumor progres- 
sion [ 12] . In addition, the AR can be activated by IGF-I 
in a ligand-independent manner [7]. A reduction in the 
cell adhesion molecule E-cadherin and its interacting 
molecule catenin may play a role in prostate cancer 
metastasis [13,14] and is associated with poor progno- 
sis and survival [15,16]. Over the last decade, it has 
become apparent that AR requires accessory factors for 
optimal activation of target genes. Numerous coreg- 
ulators have been identified with diverse structures 
and potential mechanisms of coregulation. Expression 
level of some AR coregulators may correlate with the 
grade of prostate cancer [17]. Many genes involved in 
cell cycle regulation such as P53, RB, P21, P27 and 
PI 6, and in apoptosis such as BCL-2 and BAX have 
been evaluated in prostate cancer. Recently, the tumor 
suppressor PTEN has also been assessed. Progresses 
have been made in understanding the molecular mech- 
anism of prostate cancer development. However, crit- 
ical issues remain, i.e. early detection of aggressive 



tumors and markers for better prognosis of clinical 
disease. 



Comprehensive gene expression analyses 

With the recent development in gene expression analy- 
sis methodology, expression profiles of thousands of 
genes can be generated in cancer cells. Different cancer 
cell lines and tissues appear to have different gene 
expression patterns [18,19], and such differences allow 
for molecular classifications of cancer [20,21]. 

Using the well established Expressed Sequence 
Tag (EST) method, prostate specific cDNA libraries 
were constructed [22], individual cDNA clones were 
sequenced [22,23], and a web-based prostate expres- 
sion database (PEDB) was constructed [24]. ESTs and 
full-length cDNA sequences derived from more than 
40 human prostate cDNA libraries are maintained and 
represent a wide spectrum of normal and pathological 
conditions. Prostate ESTs were assembled into distinct 
species groups using the multiple alignment program 
CAP2 and were annotated with information from the 
GenBank, dbEST and UniGene databases. Differential 
expression of each EST species can be viewed across 
all libraries using a Virtual Expression Analysis Tool, 
the PEDB is a useful addition to the public EST proj ect 
originated and organized by the National Institutes of 
Health (http://www.ncbi.nlm.nih.gov/). Prostate tumor 
tissues are highly heterogeneous. A given tissue often 
contains benign, PIN lesions and carcinoma cells of 
various grades as well as stromal cells. With the devel- 
opment of laser capture microdissection method, cells 
of specific types and from specific areas can be pre- 
cisely dissected. Prostate cDNA libraries have also 
been successfully constructed from microdissected 
tissues [25]. 

Although thousands of genes can be determined, the 
EST method is relatively slow and expensive. Using the 
Serial Analysis of Gene Expression (SAGE) method, 
a large number of genes can be quantified more 
efficiently [26]. An expression profile of androgen- 
regulated genes from LNCaP cells has been generated 
by SAGE [26]. Among 123,371 transcripts analyzed, 
a total of 28,844 distinct SAGE tags were identified 
representing 16,570 genes. Some 351 genes were sig- 
nificantly affected by dihydrotestosterone (DHT) treat- 
ment at the RNA level (p < 0.05), of which 147 were 
induced and 204 repressed by DHT treatment [27]. In 
another study, also conducted using LNCaP cells, a 
total of 83,489 SAGE tags representing 23,448 known 
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genes or ESTs and 1,655 potentially novel genes were 
identified [28], Comparison of transcripts between 
control and R 1881 -treated LNCaP cells revealed the 
induction of 136 genes and repression of 215 genes 
in response to the synthetic androgen (p < 0.05). 
These identified genes aremostlikely comprised of two 
groups; namely those regulated directly by androgen 
and those targeted by androgen-regulated gene prod- 
ucts. Interestingly, both studies suggested that there 
were approximately 300 genes regulated by androgen 
and, somewhat surprisingly, there, were more genes 
repressed than stimulated by androgen treatment in 
LNCaP cells. It is noteworthy that the androgen- 
regulated genes (^300 per study) identified by these 
two studies are different with some overlaps. Such 
discrepancy may be contributed to the use of DHT 
[27] versus a more stable synthetic androgen analog 
R1881 [28] and dosage of KT'M [27] versus 10~ 8 M 
[28]. Induction or repression of androgen-regulated 
genes is also time-dependent. Indeed, PSA was found 
to be induced at 4-611, peaked between 6 and 20 h t 
and gradually declined after 20 h post-treatment of 
DHT [27]. 

Differentially expressed genes between normal and 
prostate tumor tissues have also been determined by 
SAGE. In one study, a total of 133,217 transcripts 
were analyzed, with 35,185 distinct SAGE tags iden- 
tified representing 19,287 genes. Comparison of the 
transcripts in normal and tumor tissue revealed 156 dif- 
ferentially expressed genes (p < 0.05), of which 
88 genes were up-regulated and 68 genes were down- 
regulated in the tumor tissue [29]. Based on SAGE 
data, the transcriptome for human prostate was esti- 
mated to be approximately 37,000 distinct transcripts 
[29]. SAGE data for another study (Library PIG 17 
normal versus Library PR3 17 tumor) is available at the 
Cancer Genome Anatomy Project (CGAP) web site 
and a third independent SAGE experiment (Library 
PrCA-1 versus Library Normal prostate) is ongoing 
(http://www.ncbijilm.nih.gov/SAGE/sagexpsetupxgi). 

A comprehensive SAGE database for many human 
cancers, including the prostate, is accessible through 
the CGAP web site. The levels of gene expression 
can be determined virtually by searching this data- 
base using either a SAGE tag or a UniGene num- 
ber. More importantly, differential expression of genes 
between tissues or cell lines can be explored by 
thexProfiler algorithm (http://www.ncbi.nlmjiih.gov/ 
SAGE/sagexpsetup.cgi). 

The use of DNA microarray for expression profil- 
ing has steadily gained popularity during the past five 



years. Gene expression was measured in normal and 
organ-confined prostate cancer by 588-gene arrays 
[30].Levels of theGSTMl, MCP- 1 , TNFR- 1 , TGF-/J 3 
and ID-1 genes were found to be significantly reduced 
in tumor cells. In another study, expression profiling 
was performed in primary human prostate cancer and 
benign prostatic hyperplasia (BPH) using 6500-gene 
arrays. Gene expression in each of the 16 prostate 
cancer and 9 BPH specimens was compared with 
a common reference to generate normalized mea- 
sures for each gene across all of the samples. Using 
an analysis of complete pairwise comparisons of all 
expression profiles, discernable patterns of overall 
gene expression that differentiate prostate cancer from 
BPH were clearly observed [31]. Further analysis of 
the data identified 210 genes with statistically signifi- 
cant differences in expression between prostate cancer 
and BPH [31]. Microarray analysis conducted by a 
different group identified IGFBP2 as one of the genes 
highly expressed in CWR22R hormone-refractory 
prostate tumor xenograft and hormone-refractory 
clinical tumors [32]. Protein levels of IGFBP2 were 
analyzed by immunohistochemistry on tissue arrays. 
High expression of the protein was observed in 100% 
of the hormone-refractory clinical tumors, and 36% 
of the primary tumors, but not in the benign prosta- 
tic specimens [32]. Another gene of interest, hepsin, 
a transmembrane serine protease, was identified by 
four independent studies [31,33-35]. Expression of 
hepsin was significantly correlated with measures of 
clinical outcome. Two independent SAGE analyses 
and a cDNA microarray study, however, did not iden- 
tify hepsin as a differentially expressed gene [29,36] 
(httpy/www.ncbi.nlrrLmh.gov/SAGE/sagexpsetup.cgi; 
Library PR317 normal versus Library PIG 17 tumor). 
The failure in detecting differential expression of hep- 
sin by SAGE may be due to the difference in tissue 
samples. 

EST, SAGE and cDNA microarray studies reveal 
significant and widespread differences in gene expres- 
sion patterns between normal adjacent prostate, BPH, 
localized, metastatic and hormone-refractory prostate 
cancer. Gene expression analysis of prostate tissues 
should help to delineate the molecular mechanisms 
underlying prostate malignant growth and identify 
molecular markers for diagnostic, prognostic and thera- 
peutic use. Indeed, many novel genes have been cloned 
as the result of gene expression profiling [37^t2]. 
Gene expression patterns will undoubtedly be used 
for molecular classification of prostate cancer in the 
future. 
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Proteomics 

Much effort in gene expression profiling has been at 
the transcript level. However, it is clear that raRNA 
expression data alone are insufficient to predict cellular 
functional outcomes. For instance, raRNA expression 
data provide very little information about activation 
state, post-translational modification or localization of 
corresponding proteins. Moreover, there is evidence 
highlighting the disparity between mRNA transcript 
and protein expression levels, fn a study of androgen- 
regulated genes by SAGE and proteomics, some 
351 genes out of a total of 16,570 genes measured 
were found to be significantly affected by DHT treat- 
ment at the RNA level. Proteomic analysis demon- 
strated that 44 protein spots were affected at least 
two-fold in response to androgen, out of a total of 
1,031 protein spots analyzed. The change in intensity 
for most of the affected proteins identified could not 
be predicted based on the level of their corresponding 
RNA [27]. It is believed that there is a tight coupling 
between transcription and translation in prokaryotes 
and disparity between the levels of mRNA and protein 
increases from prokaryotes to low eukaryotes in mam- 
mals. The lack of correlation between mRNA and cor- 
responding protein is evident even in low eukaryotic 
cells such as yeast [43], Therefore, it will be neces- 
sary to profile both mllNA and protein for a complete 
picture of how cells are altered during malignant 
transformation. 

Compared to mRNA profiling, classical quantita- 
tive analysis of protein expression levels is much more 
time-consuming because proteins are analyzed one at 
a time. Another limitation is the fact that only proteins 
expressed at relatively high levels can be determined. 
Nevertheless, progress has been made in proteomic 
analysis of prostate cancer. A study, involving the col- 
lection of cells from prostate hyperplasia and prostate 
carcinoma, subsequendy subjected to two-dimensional 
gel electrophoresis (2-DE) and computer assisted 
analysis, demonstrated considerable differences in 
expressed protein patterns [44]. Malignant rumors 
. showed significant increases in the expression level of 
proliferating cell nuclear antigen (PCNA), calreticulin, 
HSP 90 and pHSP 60, oncoprotein 18(v), elonga- 
tion factor 2, glutathione-S-transferase pi (GST-pi), 
superoxide dismutase and triose phosphate isomerase. 
In addition, decreases in the levels of tropomyosin- 1 
and -2, and cytokeratin 18 were observed in prostate 
carcinomas compared to prostate hyperplasias [44]. 
In a subsequent study, the decrease in tropomyosin 



expression was confirmed in malignant tissues [45]. 
These studies were done with grossly dissected prostate 
tissues. It has been shown that 2-DE analysis can 
also be performed with laser capture microdissected 
samples [46]. 

The effects of androgen on protein expression 
have also been studied. Protein expression profiles of 
androgen-stimulated prostate cancer cells were gener- 
ated by 2-DE. Mass spectrometric (MS) analysis of 
androgen-regulated proteins in these cells identified 
the metastasis-suppressor gene NDKA/nm23; a find- 
ing that may explain a marked reduction in 
metastatic potential when these cells express a 
functional androgen receptor pathway [47]. Using 
surface-enhanced laser desorption/ionization (SELDI) 
MS analysis, reproducible and discriminatory protein 
biomarker profiles can be obtained from as few as 
25 cells in less than 5 min, from the time of dissection to 
the generation of the protein fingerprint. Furthermore, 
these protein pattern profiles are discriminatory for dif- 
ferent tumor types, and reveal reproducible changes in 
expression as cells undergo malignant transformation. 
Consistent protein changes were identified in microdis- 
sected cells from patient-matched tumor and normal 
epithelium in three out of three malignant prostate 
tissue sets [48]. A reverse-phase protein microarray 
study, in which cell lysates from matched normal 
prostate epithelium, PIN and invasive prostate cancer 
were spotted onto slides, showed that cancer progres- 
sion was associated with increased phosphorylation 
of Akt, a suppressor of apoptosis pathways, as well 
as decreased phosphorylation of ERK [49]. At the. 
transition from histologically normal epithelium to 
PIN, a statistically significant surge was observed in 
phosphorylated Akt and a concomitant suppression of 
downstream apoptosis pathways which proceed the 
transition into invasive carcinoma [49]. 

Conclusion and future directions 

Functional genomics is an exciting and rapidly chang- 
ing field. Array technology has significantly improved 
and proteomic techniques are swiftly evolving. Still, 
molecular profiling of prostate cancer is in its infancy. 
The massive amounts of data generated by mRNA 
and protein profiling require efficient, capable bioinfor- 
matics to extract biologically meaningful conclusions. 
Preliminary studies in prostate and other cancers sug- 
gest that different cancer cell lines and tissues have dif- 
ferent gene expression patterns (see previous section) 
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[18,19], and such differences allow for molecular 
classification of cancer [20,21]. 

Like any emerging technology, many limitations 
must be overcome before it can be applied to clinical 
practice. Although microarrays show great promise, 
this methodology has not matured to the point of 
consistently generating robust, and reliable data when 
used in the average laboratory [50]. It will also 
require the integration of other sources of biologi- 
cal information, such as gene identity, biochemical 
pathway for a given gene and pathology of tissues, 
to understand the cellular program of gene expres- 
sion and molecular basis of cell transformation [51]. 
The industrialization of proteomics demands, repro- 
ducible and high-throughput profiling technologies that 
current 2-DE cannot achieve. New technologies in . 
protein arrays, either on chips or with self-encoded 
elements in solution, hold much promise for deci- 
phering the diverse and immense proteome [52-54]. 
Imaging mass spectrometry is an interesting new tech- 
nology that allows direct mapping of proteins present 
in tissue sections [55], Protein-protein interaction 
mapping [56-58] and protein subcellular localization 
[59,60] should provideuseful complementary informa- 
tion. In molecular profiling of cancer tissues, perhaps 
the most important of aU, is the preparation of tissue 
samples. Profiling results are only as good as the tissues 
isolated for the experiment. Accordingly, laser capture 
microdissection may be the standard for tissue pro- 
curement in the future [61], With the advances of pro- 
filing technology, expression patterning holds great 
promise for prognosis, early detection and molecular 
classification of prostate cancer. 
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MOLECULAR ANALYSIS OF CANCER USING DNA AND PROTEIN 

MICROARRAYS 

Jubd Msdox-Cuipido find Sam KL Hanasb* 

DCTBODUCTTON 

Substantial progress, has bees made Is ov vAderstendrog of cancer as a umbutcp. 
complex* disuse thai involves progressive changes in the gpuns and pioteemc. Venous 
types erf cancers share mmitaritits as well as exhibit difiocsecs in oeJIolar, biachesncs) end 
molecular traits. Mkroanay technologies have the potential of providing valuable insight 
regarding disease processes. the amy fonnat b now an established method for globe) 
analysis of nucleic adds, and tn the pan few years this approach has been adapted for 
protein studies {Table I). MicToamys allow profiling of tumors* genomes; cranscriptomes 
end proteomes el a scale unattainable previously. 
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CANCER PROWLING USING DNA MJCROARRAYS 

Genomics studies, especially profiling gene expression, using DNA array have had a 
or men do as Impact on biomedical research, resulting in well over 1.000 published reports 
in 2002 alone. A substantial number of published (todies dealt with cancer. Disease related 
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rppl I can ons of DNA mioroarrays include uncovering unsuspected associations between 
genes and specific clinical features of disease that are helping devise novel molecular based 
disease classifications. Most published tumor studies using DNA microarrays have either 
examined a pathologically homogeneous set of turners to Identify clinically relevant 
subtypes, for example survivors vs non-nirmors. or pathologically distinct subtypes 
belongjog to the some lineage, for example limited stage vs advanced stage tumors to 
identify molecular correJetes. or rumors of different lineages to identify molecular 
signatures for each lineage. 

One of the landmark studies that have attracted much interest with respect to the 
potential contribution of DMA microarrayi to uncover novel classes of tumors; is an 
analysis of diffuse large tVceil lymphoma, the mast common subtype of non-Hodgtin'a 
lymphoma 14 . Large B-cell lymphoma is a cticicalry heterogeneous disease. Only 4091 of 
patients have a good response to current therapy with a prolonged survival. A systematic 
cJiaracterizaboD of gene expression in this disease using DNA aucr ua ii a ya uncovered a 
• diversity in gene expression that reflected variation in tumor proliferation rate; host 
response and differentiation slate of tho tumor. Two roo) ocularly distinct forms of cSftbse 
large B-ccll lytrrphaxna were ao wvutd which had gene expreokm patterns indicative of 
difTereot stages of B-oeU differentiation. One type expressed genes charecteristic of 
germinal center B ceQs and bad a significantly better overall survival than the second type, 
which expressed genes, normally induced during hi vitro activation of peripheral Wood B 
cells. The enaJytds therefore identified previously undetected and clinically significant 
subtypes of lymphoma. 

Studies to classify breast carcinomas based on gene expression profiles revealed that the 
romori coold be classified Into a basal epithelial* like group, an )ZRBB2- o vej expj tgsiflg 
group aod a normal breast-lite group 11 M . A humna) eplhthal/cstrogen rcccptv-postovc 
group could be divided into at least two subgroups* each with a distinctive expression 
profile. Survival analyses on a subeohort of patients with locally' ctdvanced breast cancer 
uniformly treated, in a prospective ctudY. showed rfptificantly different outcomes for 
poD'ents belonging to the various groups, inducing a poor prognosis for the basal-like 
subtype and o significant difference in outcome for the two estrogen rcccptor-posirive 
groups. In an iodeeenceni study of 38 invasive breast cancers, striking molecular 
differences between ductal carcinooui specimen* were uncovered that ted to a suggested 
new classification for extiogen-receptor negative breast cancer". Similarly, a study of 58 
node-eegBtivc breast carcinomas discordant for BR status also uncovered a list of genes 
which ducriminated tumors seconding to ER status 18 . Artificial neural networks could 
accurately predict ER status even ofter excluding top discriminator genes. Including BR 
itself. Only a small proportion of the ICO most mrportam ER oUscrimmator genes are 
regulated by estradiol in MCF-7 cells. 

An informative approach to analyze DNA microarray data in clinical studies is to divide 
such data into a training set to uncover associations between specific genes and certain 
clinical features of the disease, and o testing set to validate these associations. However 
since both Oruroog and testing sets ere derived from dte same pool of patients whose 
samples were available to the. investigators* the extent to which such associations may 
apply to other patients not included in the study, who may have different characteristics, 
cannot be inferred. Beer el ai. have undertaken a study of lung cancer in which the 
association they observed between a set of genes and patient survival was validated with o 
testing set of tumors they had available and further validated with an independent set of 
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tumors for which rnicroarray daia was collected by another group no: aisociaicd with (hit 
said/". Such eaten rive validation dearly indicated the robustness of the associadoa 
uncovered bttweco o set of genes and survival in lung edoKsca/cinoirta. 

The numerous published stodies using DNA micro arra>3 justify the use of tho 
technology for uncovering paUcms * of • gene expression that are clinically informative 
However it la substantially more difficult to develop an understanding of disease at o 
mechanistic level using DNA microsrreys. For most of the published studies it is unclear 
how well RNA levels reported eorretae with protein levels. A leek of correlation may 
bnpiy that the predicdve propeny of dte gene(s) is rndependeot of geno function. In studies 
of lung cancer, Chen et al. ccflectnd both DNA mlcroarray and 2-D PACE data, which 
allowed them to compare mRNA and protein levels in the same tumors . The integrated 
intensities of 163 protein spma representing protein produces of 98 genes were analyzed in 
76 lung edcirocarctnonas end 9 unaffected lung tissues using 2-D gels. For the same 83 
sample*. oiftNA levels were determined using oUgormcleotido miuoanoys. Oaly 21 of 98 
genes £21.4%) had a suuiiticalr/ significant correlaiion between .protein and mRNA levels 
(r >C2445; ? <D.05). The mRNA/proieio correlation coefficients also varied between 
isoforms of the same protein, indicating potentially isofcrin-specific caechaiuams for the 
regulation of protein abundance. 



PROTCJN M ICHO ARRAYS Iff CAffCER RESEARCH 

Despite the odvanccs In our urtdmtanding of the molecular basis of cancer, substantial 
pips remain bosh m our undemanding of cancer pathogenesis and In the development of 
effective strategies for earl/ diagnosis and for treatment. A proteosnfc approach to 
investigalmB diseases such as cancer' may overooma some of the brmtations of other 
approachs6\ DNA m i ciu ai t ay a have limited utility for the analysis of biological fluids and 
far uncovering directly, in the Moid, osseyobte biorrorfcer*, Ncrnerous alterations may occur 
m protons thai cm not reflected in charges at the RNA level. . 

rjnjiae DNA rrdcroarrays that provide one measure of gene expression, namely RNA 
levels, there is a need to unplement protein mlcroarray strategies that address the many 
different features of proteins including determination of their levels in biological samples, 
and determined on of their selective interactions with other jprotcJna, antibodies, drugs or 
various small ligands. Arrays that incorporate an tib odie s 5 *' or rec ombin a n t proteins 
obtained using cDNA expression libraries** 0 or phnge-display librar ies ha ve been 
utilized for different types of protein based assays. With other types ofrmcruaituya, whole 
tissue-derived samples have been directly arrayed onto slides, to assess the reactivity of 
total protein lysatcs with specific ligand** - ". Two practical applications of protein 
microarrsys were presented by Kbdadek 66 . designated protein function array and pratefn- 
detecting array. With protein function arrays, a large amount of protein is spotted on a solid 
support at a defined location and tested to characterise either a biochemical activity or a 
molecular Interaction. The proiein-dctecting array consists of an arrayed set of protein 
ligands used to profile gene expression and draw signatures mcucanve of the cellular stale. 
The whole process of assembling the protein array reqolres wmrideratioTUi related to the 
nature of the support, the type of urunobilixatioii. as well as die molecular architecture of 
the particle being Birr-nr* four main different supports have been optimized to perform 
assays of this magnitude: chemically modified Qlass slides (poly?Uysioes, porvaldehidea. 
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these samples. Peitems that distinguish between cancer patienU ami norma) object* with 
remarkable accuracy, have been reported for several types of cancer'. The coupling of 
protein arrays with mass spectrometry technologies is likely to become a powerful enafytit 
tool with which to profile protein expression. Such an approach, known as rYotemQup 
(Gphergen Inc. USA), was sue ccssfuJiy applied to study prostate and ovarian canters 
and, more recently, bead and neck cancers 00 . Results from these studies revealed the 
involvement of proteins in carcinogenesis processes and specificelly identified protein 
fingerprints from which cancer biomarkcrs were extracted. The major drawbacks of direct 
analysis of tissues or biological fluids by MAL01 it the preferentiel detection of proteins 
whh a lower molecular mass and the difficulty In Identirying lhe proteins corresponding to 
the masses observed, further technological improvements coord enhance the utility of 
direct mass spec trorne trie analysis of tissues and bioJogtcn] Maids. 




Plgvtv I, Rcpntczitxnrc cspenrarau far optimisation of PQA deprotoctiack. (Reproduced from Ffclteia ft of.. 
2002). 

Cancer tissue profiling studies that hDve u tinted protein raicroafrays are oegrrrning to 
emerge.' As a model to better understand how pauems of protein expression shape the tissue 
rmcroenvironmeflt. ICnexertc el a), analysed protein expression in tissue derived from 
squamous cell csreinomas of the oral cavity through an antibody rmcroarray approach for 
high-throughput proteomic analysis* 9 . Utilising laser capture microdissection to procure 
total protein from specific microscopic cellular populations, they demonstrated that 
quantitative, and potentially qualitative, differences in expression patterns of omttiplo 
proteins within epithelial cells rcproduciMy correlated with oral eerily tumor profession. 
Differesdal aapresslon of multiple proteins was found fas stromal cells surrounding and 
adjacent to regions of diseased cpitheKum that directly correlated with tumor progression of 
the epithelium. Most of the proteins identified In both cell types were involved in signal 
transduction pathways. They hypothesized therefor© that extensive molecular 



SJSofortat mow ho nmfa^taH hu «*wm«K» louf ZTtM© 1T II Q f*/*Ho\ 




X ftUDOZ'CimHBB AKP SM. fUflA&H 



communications involving complex cellular signaling between epithelium and stroma play 
a key lOTOiiiydtrvIng oral canriiy cancer progress on. 

A clinically rrlevnr.t application of protein micro arrays is the identification of proteins 
ihai induce an enribodr response in autoimmune disorders 0 . Microa/rays were produced 
by attaching several hundred proteins and peptide* to the surface of derivftozed gluts slides. 
Arrays were incubaxed with patient serum, end fluorescent labels were used to detect 
autoantibody binding to specific proteins in autoimmune diseases, intludiag systemic lupus 
cryihernatwus and rheumatoid arthritis. Such fnrcrofmys represent a powerful tool to 
study immune responstvio a variety of diseases including cancer. 

A reverse phase protein array approach that immobilizes the whole repertoire of a 
tissue's proteins has been developed 4 . A high degree of sensitivity, precision and tincariry 
was achieved, mating it possible to quantify the phospherylated status of signal proteins in 
human tissue ccD subpopol&zions. Using this approach Faweletrei ol. 4 * have Inngfnidrnafly 
' analyxed the stale of pTOrsarvivaJ checkpoint proteins a) the rJucrostople timnsitioo stag? 
from patient matched bistologiailly norma) prostate epithelium to prostate inotepi thefts! 
neoplasia and to invasive prostate cancer. Cancer progression was associated with mcreased 
phosphorylation of Aid. ropprcssion of apoptosis pathways* as well as decreased 
phosphorylation of ERK, At the transition from histolog^caUjr norma) epithelium to 
Intraepithelial neoplasia, a statistically significant serge fas phosphorylsted Akt and a 
concomitant suppression of dovTOStrearn apoptosis pathways preceding the transition cam 
invasive carcinoma were observed. 

A major challenge In making bxochips for global analysis of protein expression is the 
current lack of comprehensive sets of genome scale capture agents such as antibodies. As a 
result, bloc hips mat target specific classes of proteins such as kinases or cytokines am much 
easier to produce, that would have clinical utility. Another irmwtarrt cortsideration is 
protein oiicroarrBys is dial proteins undergo numerous post-trsnslaiional modifications eg 
phosphorylations, glycosytettons, vfarfeb are highly Important to their functions, as they caa 
determine activity. stability, localization and turnover. To address the need for 
comprehensive analysis of proteins in tftefr modified forma several approaches to the liquid 
based separation of cell and tissue ry tales were investigated in order to obtain protein 
fractions with reduced complexity or pore individual proteins". Thfi separation products 
can be arrayed in a manner thai allows the probing of protein constituents of cells and 
tissues to uncover specific targcia. For example, using a combination of anion exchange 
and reverse phase LC Medna-Gurpide cl at have obtained come 2000 Individual protein 
fractions that have been utilized to produce micro arrays that Interrogate cancer cell 
protcomes. Fractions thai react with specific probes are within the reach of 
chromatographic and gel based separation techniques for resolving their individual protein 
constituents asd of mass sp utum >c trie techniques for identification of their constituent 
proteins. The LC procedures allow sufficient protdn amounts to be resolved for the 
construction of large numbers of microarrays from a given cell or tissue source. 

Protein microarrays of different types arc likely to become commercially available for 
assays of broad sets of proteins end may well rival or at least complement DNA 
microarrays as tools for global expression analysis. 
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SUMMARY 

In conclosion, array-based technologies ftave emerged that contribute to profiling 
tissues at the genomic, oanscriptotnic and pro (comic levels. Analytical toob are needed to 
mine the vast amount of data generaieot Ultimately the molecular analysis of cancer at a 
genome and proteome stole will allow better classification of disease and tailored 
individooJucd therapy for individual patients. 
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Abstract 

h J V< ^ aVC P^^^P^^ive analysis of the expression profiles in 25 adult and 4 fetal human 
tissues by means of * cDNA microarray consisting of 23,040 human genes. This study revealed a number of 
genes that were expressed specifically in each of those tissues. Among the 29 tissues examined, 4,080 genes 
were h.ghly expressed (at least a five-fold expression ratio) in one or only a few tissues and 1 163 " 

tZJr T^r? 1 ^ ( TT tHan a ten " f0 ' d high6r rati °) in fl P^icular tissue. 

Expression of some of the genes m the latter category was confirmed by northern analysis. A hierarchical 
clustermg analyse of gene-expression profiles in nerve tissues (adult brain, fetal brain, and spinal cord) 
lymphoid tissues (bone marrow, thymus, spleen, and lymph node), muscle tissues (heart and skeletal' 
muscle) or adipose tissues (mesenteric adipose and mammary gland) identified a sot of genes that were 
commonly expressed among related tissues. These data should provide useful information for medical 
research, especially for efforts to identify tissue-specific molecules as potential targets of novel drues to 
treat human diseases. & 

Key words: expression profile; normal human tissues; cDNA microarray; tissue-specific expression 



1. Introduction 

The HuruaJi Genome Project and corporate efforts 
achieved a complete draft sequence of the genome 
recently. 1,2 Whole-genome sequencing and large-scale ex- 
pressed sequence tag (EST) projects have revealed that 
the human genome contains 30,000-40,000 genes. 1 " 3 In 
the post-sequencing era, investigating the biological and 
physiological functions of gene products, especially those 
related to diseases, is becoming increasingly important. 
To address this task, it Ls useful to know the expression 
properties of each gene in human tissues and organs. 

cDNA microarray technology has developed dramati- 
cally over the past 5 years, and wo con now obtain expres- 
sion profiles of thousands of genes simultaneously. Mi- 
croarrays have been used to investigate gene-expression 
profiles in yeast, 4 Mycobacterium, 5 cell lines 6 and many 
typos of human cancers. 7 " * Moreover, statistical methods 
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such as cluster analysis have revealed molecular classifi- 
cations based on differential expression of certain genes. 

Some genes that are expressed in a tissue-specific man- 
ner have been screened using a differential cDNA dis- 
play technique, 10 " 12 the "Body Map" system, 13 " 16 or by 
computational subtraction in serial analyses of gene ex- 
pression (SAGE). 17 Obtaining information about genes 
expressed only in a single tissue is a critical step toward 
understanding the mechanism of tissue-specific transcrip- 
tional regulation or for investigating genes responsible 
for hereditary syndromes when the phenotype is specific 
to a particular organ. With that goal in mind, we have 
performed comprehensive expression profiling in 25 adult 
and 4 fetal human tissues using a cDNA microarray con- 
taining 23,040 genes, and evaluated the power of this 
analysis for identifying tissue-specific molecules. 



Material may be protected by copyright law (Title 17, U.S. Code) 



36 Gene Expression Profiles 

2. Materials and Methods 

2.L Prepamtion of samples and Tl-based amplification 
of RNA 

Poly(A) + RNAs isolated from bone marrow, brain, 
heart, kidney, liver, lung, lymph node, mammary gland, 
pancreas, placenta, prostate, salivary gland, skeletal mus- 
cle, small intestine, Hpinal cord, spleen, stomach, testis, 
thymus, thyroid, trachea, uterus, fetal brain, fetal kid- 
ney, fetal liver, fetal lung (Clontech), colon, and ovary 
(Biochain) were used as probes for the cDNA microar- 
ray. From normal mesenteric adipose tissue obtained 
separately with informed consent, total RNA was ex- 
tracted using TRJzol (Invitrogen Life Technologies). Af- 
ter treatment with DNase I, total RNAs were purified us- 
ing inRNA Purification Kits (Amersham Biosciences) ac- 
cording to the manufacturer's instructions. All poly(A) + 
RNAs were pooled from at least two normal donors. 
T7-based RNA amplification was carried out as described 
previously. 18 Using 2 fxg of poly(A) + RNA as starting 
material, we performed one round of amplification to ob- 
tain 100-200 pg of amplified RNA (aRNA) for each tis- 
sue. 

2.2. cDNA microajrays 

We constructed a "genome-wide" cDNA microar- 
ray consisting of 23,040 cDNAs selected from the 
UniGene database (build #131) of the National Center 
for Biotechnology Information (NCBI), as described 
previously. 7 "^ Briefly, the cDNAs were amplified by 
RT-PCR using poly (A) + RNA isolated from various hu- 
man organs as templates; the lengths of the ampli- 
cons ranged from 200 to 1,100 bp excluding repetitive 
or poiy(A) sequences. The PGR products were spot- 
ted on type 7 glass slides using a Microarray Spotter, 
Generation III (Arnersham Biosciences); 4608 genes were 
spotted in duplicate on a single slide. We prepared 
five different sets of slides (i.e., 23,040 genes total), 
on each of which the same 52 housekeeping genes and 
2 negative-control genes were spotted as well. 

2.3. Labeling and hybridization 

A fluorescent cDNA probe was prepared from each 
aRNA in the manner described previously; 9 12.5/Ag 
aliquots of aRNA from individual tissues and a mixture of 
equivalent aRNA from all 29 tissues as a universal control 
were labeled respectively with Cy5-dCTP and Cy3-dCTP 
dyes. Hybridization and washing were performed accord- 
ing to protocols described previously 1 * except that all 
processes were carried out with an Auto Slide Processor 
(Amersham Biosciences). 

2.4- Quantification of signals 

The intensity of each hybridization signal was calcu- 
lated photometrically by the Array Vision computer pro- 
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gram (Amersham Biosciences) and background inten- 
sity was subtracted. Normalization of each Cy3- and 
Cy5-fluorescence intensity was performed using averaged 
signals from the 52 housekeeping genes. A cut-off value 
for each expression level was automatically calculated ac- 
cording to background fluctuation. When both Cy3 and 
Cy5 signal intensities were lower than the cut-off values, 
we assessed expression of the corresponding gene in that 
sample as absent. In cases where only the Cy3 signal was 
lower than the cut-off value, we replaced its signal with 
the cut-ofT value and calculated the relative expression ra- 
tio as Cy5/cut-off. If only the Cy5 signal was lower than 
the cut-off value, we calculated cm>off/Cy3. For other 
genes, we calculated Cy5/Cy3 as the relative expression 
ratio. 

2.5. Cluster analysis of 29 normal tissues 

We applied hierarchical clustering analysis to 
both genes and samples using wcb-available software 
("Cluster and TreeView") written by M. Eisen (http:// 
genome-www5.stanford.edu/MicroArray/SMD/rcstcch. 
html). 19 To obtain reproducible clusters, we excluded 
genes whore both the Cy3 and Cy5 signals were lower 
than the cut-off value, and selected only genes for which 
the Cy5/Cy3 ratio was greater than five in at least one 
sample. Before the clustering algorithm was applied, the 
fluorescence ratio for each spot was log-transformed and 
the data for each sample were centered to remove exper- 
imental biases. The selection criteria left 3,408 genes for 
the clustering analysis. 

2.6. Northern, analysis 

A 0.5/ig aliquot of poly(A) + RNA from each of the 
29 normal human tissues was electrophoresed on a 1% 
agarose gel containing 1 x MOPS buffer and 2% formalde- 
hyde, and was transferred to a nylon membrane. Hy- 
bridization with random-primed 32 P-labelled probes was 
carried out according to the manufacturer's instructions 
(Clontech) using the same RT-PCR products as those 
spotted onto the cDNA microarray slides. 

3. Results 

3. 1. Classification of genes 

We analyzed gene-expression profiles in 25 adult and 
4 fetal human tissues on our cDNA microarray, using a 
mixture of poly (A) + RNAs from all 29 tissues as a con- 
trol to calculate the relative expression ratio (Cy5/Cy3) 
of each of the 23,040 genes on the array (see Materi- 
als and Methods). The numbers of genes that showed 
Cy5/Cy3 ratios of 5 or higher or those of 10 or higher 
in individual tissues are summarized in Fig. 1. Of the 
4.080 genes that were highly expressed in only one or 
a few tissues, 1,163 were expressed with a ratio of > 10 
in only one tissue. Liver, placenta, skeletal muscle, und 
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Figure 1. Distribution of genes higlily expressed in 29 norma) human tissues. 



testis each contained more than 100 genes that showed 
at least a tenfold higher level of expression there than in 
other tissues. 

Table 1 lists the 30 genes showing the biggest differ- 
ences in expression levels in a given tissue as compared 
with the mixture of mRNAs from all 29 tissues. The list 
includes genes that were previously reported to be related 
to or responsible for a disease. For example, among the 
30 genes highly expressed in kidney, SLC3A1 (solute car- 
rier family 3 member l f with a Cy5/Cy3 ratio of 26.0). 
is involved in the transport of dibasic and neutral forms 
of cystine, and its mutations cause cystinuria type l. 20 - 21 
In heart, mutant forms of TNNI3 (cardiac troponin I). 
TNNT2 (cardiac troponin T2), ACTC (alpha cardiac 
actin), and MYH7 (beta cardiac myosin heavy chain) are 
capable of causing hypertrophic cardiomyopathy. 22-25 
Their Cy5/Cy3 ratios were 56.1, 43.3, 31.4 t and 18.6, 
respectively. A comparison with data from the Body 
Map database, which also contains ESTs from human 
tissues, indicated that many of the tissue-specific genes 
listed here were also indicated by Body Map to be ex- 
pressed highly in mesenteric adipose tissue, adult liver, 
adult lung, skeletal muscle, fetal brain, fetal Jivcr, and fe- 
tal lung; those genes are shown in bold letters in Table 1. 



3.2. Clustering analysis of gene- expression patterns in 
normal human tissues 

We applied an algorithm for hierarchical clustering of 
the expression profiles of all 29 tissues, using 3,408 genes 
whose expression ratios were 5 or higher in at least 
one tissue (Fig. 2)- The colored bars in Fig. 2 indi- 
cate clusters of genes that were coordinately expressed 
in a tissue-specific manner. Fetal brain, adult brain, 
and spinal cord were clustered into the same terminal 
brand:, Indicating that a large number of genes are com- 
monly expressed in those neuronal tissues. In this clus- 
ter we found genes encoding neuronal cytoskeleton pro- 
teins, neurofilament components (NEFL and NEF3), mi- 
crotubule components (TUBB, TUBB1, and TUB A3, 
tubulin polypeptides), MAPT (microtubule associated 
protein tau), neuroendocrine-related proteins (SGNE1, 
RTNl. RTN3, CADPS), and proteins involved in neu- 
ronal vesicle trafficking (KIF5C, SH3GL2, STXBP1). 
Also included in this cluster were GPM6B, GPM6A, NR- 
CAM, ASTN, and CTNND, which encode neural cell ad- 
hesion molecules, and neurotensin ission-related proteins 
GRM3 (glutamate receptor) and GAD1 (glutamate de- 
carboxylase 1) (Fig. 3a). 

Spleen, lymph node, thymus and bone marrow con- 
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Figure 2. Hierarchical clustering of genc-cxpression profiles. Of 23,040 genes on the microarray, 3/iOB were selected for clustering 
analysis by the criteria described in Materials and Methods. The dendrogram at the left lists the tissues studied and provides a 
measure of the reiatedness of gene expression in each sample. In the matrix, each row represents a single cONA on the microarray, 
' whereas each column corresponds to a separate sample. Red and gretn colors indicate genes with high and low expression respectively, 
mRNA compared to the reference mixture of equivalent aRNA from all 29 tissues. Black and gray indicate unchanged expression 
and no expression, respectively. Colored bars oa the right of the matrix indicate clusters of tissue-specific genes of Muscle (heart and 
skeletal). CNS (adult brain, fetal brain, spinal cord), and Lymphoid tissues (bone marrow, lymph node, spleen, thymus). 



stituted another cluster, which included cell-adhesion 
molecules or receptors (SELL, ITGA4. CD37, CXCR4, 
IL2RG, CD53, HEM1, and LAIRl), and also PLCB t 
ARHGDIB, and HCLSl whose products play impor- 
tant roles in signal transduction. In addition, we 
found the COROlA and WASPIP genes, encoding 
leukocytic cytoskelcton-rclated proteins, in this cluster 
(Fig. 3b). Striated-muscle tissues (skeletal muscle and 
heart) also clustered together. Genes encoding my- 
ofibrillar component-proteins (ACTN2, MYH6, MYH3, 
MYH7, TTN T MYL2, TCAP, MYOM1, and MYOM2) 
were included in this cluster. ATP2A, CASQ2, and 



HRC, encoding calcium-regulating proteins, showed high 
expression in ooth types of muscle. Genes encoding 
enzymes playing important roles in muscles, -PFKM, 
COX6A2, and CKM, were included also (Fig. 3c). 

Furthermore, we identified 371 tissue-specific genes 
that had Cy5/Cy3 ratios >5 in a single tissue but 
ratios of < 2 in ail other tissues examined (Table 2). 
In the case of brain, kidney, liver, and lung, we se- 
lected -tissue- specific genes according the following cri- 
teria: Cyo/Cy3 ratios > 5 in either or both of the fetal 
and adult tissues and ratios < 2 in the remaining 27 tis- 
sues. Very few genes were expressed in both fetal and 
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Figure 3. Gene clusters related to tissue characteristics. Daia arc the Banie as in Fig. 2 X with genes involved in each cluster listed 
here: a, CNS cluster including adult brain, fetal brain, and spina: cord; b, lymphoid cluster including lymph node, spleen, thymus, 
and bone marrow; c, Muscle cluster including heart and skeletal muscle. All of the Hs. numbers were retrieved from the Unigene 
Database (build #131, NCBI). 
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Parentheses indicate numbers of qualifying genes of unknown 
function in the given tissue. 

adult tissues. 

Many of tlie tissue clusters were distinguished by spe- 
cific expression of metabolically related groups of genes. 
For example, in the muscle cluster, 14 genes showed 
Cy5/Cy3 ratios > 10 in both skeletal muscle and heart. 
This set included genes known to play critical roles 
in muscle, including tropomyosin, proteins of the tro- 
ponin complex, and myosin heavy polypeptide. In the 
adipose- tissue cluster, perilipin (PLIN) showed high ex- 
pression in both mesenteric adipose tissue and mammary 
gland; testis, placenta, skeletal muscle, liver, and brain 
tissues each expressed more than 80 genes with Cy5/Cy3 
ratios > 5. 

3 'J. Verification of cDNA microarray data 

To verify the reliability of our cDNA microarray anal- 
ysis, we randomly selected ten genes (one of known func- 
tion and the other nine unknown) which had shown 
Cy5/Cy3 ratios of more than 10 in a single tissue, and we 
performed northern blotting using those genes as probes. 
As Fig. 4 shows, each of these ten genes was confirmed 
to be expressed in a tissue-specific manner, a result that 
supported the reliability and rationality of our strategy 
to identify tissue-specific genes on cDNA microarrays. 



4. Discussion 

Wc examined a total of 666,160 measurements of gene 
expression in 25 adult and 4 fetal tissues, by means of a 
cDNA microarray. This is the first study to apply that 
technique for analyzing gene expression profiles of nor- 
mal human tissues on a large scale. By doing this, we 
established an expression database for sets of genes ex- 
pressed specifically in a certain tissue(s). Among them 
we found many genes that were related to various dis- 
eases, an indication that genes expressed exclusively in 
one particular tissue or predominantly expressed tn only 
a few tissues may play crucial roles in tissue-specific 
function. For example, as shown in Table 3, a defect 
of the PYGM (muscle glycogen phosphorylase) gene, 
which showed skeletal muscle-specific expression in our 
experiment (Cy5/Cy3 ratio — 34.1), causes a typical 
metabolic myopathy known as glycogen storage disease 
V (McArdle's disease). 26 Similarly, a defect in AMPD1 
(adenosine monophosphate deaminase), which was highly 
expressed (Cy5/Cy3 ratio = 30.3) only in skeletal muscle, 
is responsible for myoadenylate deaminase deficiency. 27 
In addition, we found that three genes correlated 
with hypertrophic cardiomyopathy, TNNIT3, TNNT2, 
and ACTC, 22 " 24 were heart-specific (Cy5/Cy3 = 56.1, 
43.3, and 31.4, respectively). Moreover, the MYH7 
(beta-myosin heavy-chain) gene, included in the muscle 
cluster defined by our clustering analyses (Fig. 3c), is re- 
sponsible for hypertrophic cardiomyopathy (HCM); mu- 
tations in the MYH7 gene arc associated also with skele- 
tal muscle changes characteristic of central core disease. 25 
SFTPB (pulmonary surfactant protein B), a lung- 
specific gene according to our results (Cy5/Cy3 ratio = 
10.3 and 11.2 iu adult and fetal lung, respectively), is re- 
sponsible for congenital alveolar proteinosis. 28 Similarly, 
we detected specificity of TPO (thyroid peroxidase) to 
thyroid tissue (Cy5/Cy3 ratio = 24.1). TPO is a key en- 
zyme in the synthesis of thyroid hormones; defects of the 
TPO gene can lead to congenital hypothyroidism due to 
a total iodide organification defect (TIOD)." 29 Recently, 
Blackehaw and coworkers identified genes that were spe- 
cific to or highly enriched in rods using gene-expression 
profile by the SAGE technique, and they demonstrated 
that many of candidate retinal disease genes were iden- 
tified on the basis of these expression profiles and inte- 
grating these data with in silico mapping. 30 These results 
suggest that our study may be useful for cloning genes 
associated with disease involved in specific organs where 
gene-specific expression of genes is observed. 

It is important to identify tissue-specific genes in the 
interest of cancer research as well. Using a difTeren- 
tial display method, Ozaki et al. demonstrated that a 
pancreas-specific gene, pancpin. was down-regulated or 
mutated in primary pancreatic cancers. 11 All these lines 
of evidence indicate that obtaining sets of genes expressed 
in specific tissues may reveal genes that have not yet been 
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identified as responsible for human disease. 

'Transcriptional regulation of genes in a given tis- 
sue is controlled by a combination of several mecha- 
nisms; e.g. changes in chromatin structure mediated by 
the association of histone acetylases and deacetylases, 31 
binding of transcriptional regulators to an enhancer 
and/or a suppressor, 32 binding of transcription factors 
to a promoter region, and methyiation of CpG Islands 
in promoter sequences. 33 However, the precise mecha- 
nisms of cooperation among these elements are not well 
understood. 34 - 35 To address this issue, we identified sev- 
eral potential regulatory elements of liver-specific tran- 
scription on the basis of tissue-specific expression (data 
not shown). Comparison of promoter sequences among a 
large number of tissue-specific genes would identify their 
common cis-elements whose core sequence might be use- 
ful for identifying novel transcription-related factors in 
each tissue and for use as promoters in vectors designed 
for gene therapy 36 



Using a clustering algorithm, we found that three of 
the fetal tissues examined (kidney, liver, and lung) were 
clustered in the same branch. This implies that gene ex- 
pression patterns among the three tissues might be sig- 
nificantly similar during fetal life. Sell et al. reported 
that the albumin gene was expressed in both kidney and 
lung of fetal rat; 37 we confirmed a relatively high level 
of expression of albumin mRNA in human fetal kidney 
and fetal lung (data not shown). However, in our anal- 
ysis, expression profiles of genes in adult and fetal lung 
tissues fell into different branches an indication that fetal 
lung, with no air respiration, functions quite differently 
from adult lung. We conclude that hierarchical cluster- 
ing of normal tissues on the basis of gene expression can 
elucidate characteristic features of each tissue. 

Finally, our information should have relevance to can- 
cer therapy. We have already carried out expression 
profiling in many typce of cancer including ovarian, 1 
hepatocellular, 9 colorectal, 38 and esophageal 39 carcino- 
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QenetD Cy5/Oy3 Symbol 



Gene Name 



Function 



Related dlsaaso 



SKqfptal muaclft 

A6921 34.1 PYGM muscle glycogen phsphoiyiation aliosteric enrymo In carbohydrate metabolism Gtyoogen strage disoase V 26 



A34S5 30.3 AMPDt adenosine monophosphate deaminase energy metaboilsm. 



C4166 56.1 

A7154 43.3 

B2563 31.4 

A0S40 1B.6 



TNN13 cardiac troponin ! 

TNKT2 perdtac troponin T2 

ACTC atpha cardiac acttn 

MYH7 beta-myosin heavy chain 7 



tropomyosin-blnding subunft of troponin 
tropomyosin-blnding subunlt o! troponin 
ceil motility 
musde contraction 



myoadenylate deaminase deficiency 27 

Hyperttophic cardiomyopathy 22 

Hypertrophic cardiomyopathy 23 

Hypertrophic cardiomyopathy 24 

Hypertrophic cardiomyopathy 25 



Una 

A1739 10.8 SFTPB putmonary surfactant protein B 



promote alveolar stability 



congenital aJveotarproelnosis 



T yrold 
A3240 24.1 



TPO thyroid peroxidase 



(odination end coupling of the hoononooenic ^ dafact 
tyrosines In thyrogiobulin ■ 



Gene ID represents the ID in our laboratory. Ref indicates a reference concerning each related disease. 
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mas using "genome-wide'* cDNA mieroarrays. Those ex- 
pression profiles provide useful information for identify- 
ing genes that are specifically, expressed in cancer cells 
and whose products would be good candidates as molec- 
ular targets for novel drugs. Furthermore, if these candi- 
date geues are not expressed in vital tissues, such drugs 
may be less likely to cause adverse effects. Thus, the 
gene expression profiles we have now established in nor- 
mal tissues, especially those of tissue-specific genes, will 
serve as references for estimating the risk of adverse ef- 
fects and should help to guide the selection of molecular 
targets for the development of novel therapeutic drugs. 
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We have determined the relationship between mRNA and protein expression levels for selected genes 
expressed m the yeast Saccharomyces cerevisiae growing at mid-log phase. The proteins contained in total yeast 
cell lysate were separated by high-resolution two-dimensional (2D) gel electrophoresis. Over 150 protein spots 
were excised and identified by capillary liquid chromatography-fcndem mass spectrometry (LC-MSMS). 
Protein spots were quantified by metabolic labeling and scintillation counting Corresponding mRNA levels 
were calculated from serial analysis of gene expression (SAGE) frequency tables (V. E. Velculescu, L. Zhang, 

S^d^/'iuoff M */; BaSnii ' D ' R BaSSCtt ' Jn ' P ' me!er ' B ' Vogelstein, and K. W. Kinder, Cell 
88.-43-251, 1997). We found that the correlation between mRNA and protein levels was insufficient to predict 
protein expression levels from quantitative mRNA data. Indeed, for some genes, while the mRNA levels were 
of the same value the protein levels varied by more than 20-fold. Conversely, invariant steady-state levels of 
certain proteins were observed with respective mRNA transcript levels that varied by as much as 30-fold. 
Another interesting observation is that codon bias is not a predictor of either protein or mRNA levels. Our 
results clearly delmeate the technical boundaries of current approaches for quantitative analysis of protein 
expression and reveal that simple deduction from mRNA transcript analysis is insufficient " 



The description of the state of a biological system by the 
quantitative measurement of the system constituents is an es- 
sentia! but largely unexplored area of biology. With recent 
technical advances including the development of differential 
disp!ay-PCR (21), of cDNA microarray and DNA chip tech- 
nology (20, 27), and of serial analysis of gene expression 
(SAGE) (34, 35), it is now feasible to establish global and 
quantitative mRNA expression profiles of cells and tissues in 
species for which the sequence of all the genes is known. 
However, there is emerging evidence which suggests that 
mRNA expression patterns are necessary b\it are by them- 
selves insufficient for the quantitative description of biological 
systems. Tin's evidence includes discoveries of posttranscrip- 
tional mechanisms controlling the protein translation rate (15), 
the half-lives of specific proteins or mRNAs (33),, and the 
intracellular location and molecular association of the protein 
products of expressed genes (32). 

Proteome analysis, defined as the analysis of the protein 
complement expressed by a genome (26), has been suggested 
a<; an approach to the quantitative description of the state of a 
biological system by the quantitative analysis of protein expres- 
sion profiles (36). Proteome analysis is conceptually attractive 
because of its potential to determine properties of biological 
systems that are not apparent by DNA or mRNA sequence 
analysis alone. Such properties include the quantity of protein 
expression, the subcellular location, the state of modification, 
and the association with ligands, as well as the rate of change 
with time of such properties. In contrast: to the genomes of a 
number of microorganisms (for a review, see reference 11) and 
the transcriptome of Saccharomyces cerevisiae (35), which have 
been entirely determined, no proteome map has been com- 
pleted to date. 

The most common implementation of proteome analysis is 
the combination of two-dimensional gel electrophoresis (2DE) 
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(isoelectric focusing-sodium dodecyl sulfate [SDSJ-poIyacryl- 
amide gel electrophoresis) for the separation and quantitation 
of proteins with analytical methods for their identification. 
2DE permits the separation, visualization, and quantitation of 
thousands of proteins reproducibly on a single gel (18, 24). By 
itself, 2DE is strictly a descriptive technique. The combination 
of 2DE witl) protein analytical techniques has added the pos- 
sibility of establishing the identities of separated proteins (1, 2) 
and thus, in combination with quantitative mRNA analysis, of 
correlating quantitative protein and mRNA expression mea- 
surements of selected genes. 

The recent introduction of mass spectrometric protein anal- 
ysis techniques has dramatically enhanced the throughput and 
sensitivity of protein identification to a level which now permits 
the large-scale analysis of proteins separated by 2DE. The 
techniques have reached a level of sensitivity that permits the 
identification of essentially any protein that is detectable in the 
gels by conventional protein staining (9, 29). Current protein 
analytical technology is based on the mass spectrometric gen- 
eration of peptide fragment patterns that are idiotypic for the 
sequence of a protein. Protein identity is established by corre- 
lating such fragment patterns with sequence databases (10, 22, 
37). Sophisticated computer software (8) has automated the 
entire process such that proteins are routinely identified with 
no human interpretation of peptide fragment patterns. 

In this study, we have analyzed the mRNA and protein levels 
of. a group of genes expressed in exponentially growing cells of 
the yeast S. cerevisiae. Protein expression levels were quantified 
by metabolic labeling of the yeast proteins to a steady state, 
followed by 2DE and liquid scintillation counting of the se- 
lected, separated protein species. Separated proteins were 
identified by in-gel tryptic digestion of spots with subsequent 
analysts by mtcrospray liquid chrornatography-tandem mass 
spectrometry (LC-MS/MS) and sequence database searching. 
The corresponding mRNA transcript levels were calculated 
from SAGE frequency tables (35). 

This study, for the first time, explores a quantitative com- 
parison of mRNA transcript and protein expression levels for 
a relatively large number of genes expressed in the- same met- 
abolic state. The resultant correlation is insufficient for predic- 
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lion of protein levels from mRNA transcript levels. We have 
also compared the relative amounts of protein and mRNA 
with the respective codon bias values for the corresponding 
genes. This comparison indicates that codon bias by itself is 
insufficient to accurately predict either the mRNA or the pro- 
tein expression levels of a gene. In addition, the results dem- 
onstrate that only highly expressed proteins are detectable by 
2DE separation of total cell lysates and that therefore the 
construction of complete proteomc maps with current technol- 
ogy will be very challenging, irrespective of the type of organ- 
ism. 

MATERIALS AND METHODS 

Yeast strain and growth conditions. The source of protein and message tran- 
scripts for all experiments whs YPH499 (MAT* ura3-52 Jys2SQl ddc2-J01 
Uu2-M his3-CaOO trpi-&63) (30). Logarithmically growing cells were obtained by 
growing yeast cells to early log phase (3 X 10* cafe/ml) in YPD rich medium 
(YPD suppiemeated with 6 mM uracil, 4.8 mM adenine, and 24 nuM tryptophan) 
at 30*C (35). Metabolic labeling of protein was accomplished in YPD medium 



exactly as described elsewhere (4) with the exception that 1 ml of cells was 
labeled with 3 mCi to offset methionine present in YPD medium. Protein was 
harvested as described by Carrels and coworkers (12). Harvested protein whs 
lyophiiized, resuspendec in isoelectric focusing gel" rehydration solution, and 
siored at -80°C 

2DK Soluble proteins were run iu lhe firs; dimension by using a commercial 
flatbed electrophoresis system (Multiphor U; Pharmacia Biotech). Immobilized 
poiyacrylamide gel (IPG) dry strips with nonlinear pH 3.0 to 10.CI gradients 
(Amersham-Pharmacia Biotech) were used for the first-dimension separation. 
Forty micrograms of protein from whole-cell lysates was mixed with IPG strip 
rehydration buffer (8 M urea, 2% Nonidet P-40 ; 10 mM diihiothreitoi), and 250 
to 380 u.1 of solution was added to individual lanes of an IPG strip rehydration 
tray (Amersham -Pharmacia Biotech). The strips were allowed to rehydraie at 
room temperature for 1 h. The samples were run at 300 V-10 mA-5 W for 2 h, 
then ramped to 3,500 V-10 mA-5 W over a period of 3 h, and then kept at 3,500 
V-10 mA-5 W for 15 to 19 h. At the end of the first-dimension run ((50 to 70 kV • 
h). the IPG strips were reequilibrated for 8 min in 2% (wt/vol) dithiothreitol in 
2% (wt/vol) SDS-4i M urea-30% (wt/vol) glycerol-0.05 M Tris HQ (pH 6.8) and 
for 4 min in 2.5% iodoaceiamidc in 2% (wtA-ol) SDS-6 M urea-30% (wt/vol) 
«tycero)-0.05 M Tris HQ (pH 6.8). Following rccquilibration, the strips were 
transferred and apposed to 10% poiyacrylamide second-riime.usion gels. Poiy- 
acrylamide gels were poured in a casting stand with 10% acrylamide-2,67% 
piperazine diacryiam>de-0.375 M Trisbase-HCl (pH R8>-0.1% (wt/vol) SDS-0.05% 
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(wt/vol) ammonium persu!faie-*U)5% TEMBD (Ayv//V v ^tetramethyiethyl- 
enedismine) in Milli-Q water, The apparatus used to run second- dimension gels 
was a noncommercial apparatus from Oxford Glycosciences, Inc. Once the IPG 
strips were apposed to the second-dimension gels, they were immediately run at 
50 mA (eorjstant)-500 V-S5 W for 20 min, followed by 200 mA (cons;am)-500 
\Mi5 W until the buiFer front line was 10 to IS mm from the bottom of the gel. 
Gels were removed and silver stained according to the procedure of Shevchenko 
et al. (29). 

Protein identification. Gels were exposed to X-ray film overnight, and then the 
silver staining and film were used io excise 156 spots of varying intensities, 
molecular weights, and isoelectric focusing points. In order "to increase the 
detection limit by mass spectrometry, spots were cut ou; and pooled from up to 
four identical cold, silver-stained gels. In-gcl tryptic digests of pooled spots were 
performed as described previously (29). Tryptic peptides were analyzed by aii- 
crocapillary LC-MS with automated switching to MS/MS mode for peptide 
fragmentation. Spectra were searched against the composite OWL protein se- 
quence database (version 30.2; 250,514 protein sequences) (24ii) by using the 
computer program Sequest (8), which matches theoretical and acquired taadem 
mass spectra. A ptoteiu match was determined by comparing the number of 
peptides identified and their respective cross-correlation scores. All protein 
identifications were verified by comparison with theoretical molecular weights 
and isoelectric points. 



mRNA quantitation. Velculescu and coworkers have previously generated 
frequency tables foi yeast mRNA transcripts from the same straiu grown under 
the same stated conditions as described herein (35). The SAGE technology is 
based oa two main principles. First, a short sequence tag (15 bp) that contains 
sufficient information uniquely to identify a transcript is generated. A single tag 
is usually generated torn each mRNA transcript in the cell which corresponds to 
15 bp at the 3'-mosr cutting site for .NIaiiL Second, many transcript tags can be 
concatenated into a single molecule and then sequenced, revealing I he identity of 
multiple tags simultaneously. Over 20,000 transcripts were sequenced from yeast 
strain YFM499 growing at mid-log phase un glucose. Assuming the previously- 
derived estimate of 15,000 mRNA molecules per cell (16), ibis would represent 
a 1. 3-fold coverage even for mRNA molecules present at a single copy per cell 
and would provide a 72% probability of delecting such transcripts. Computer 
software which took for input the gene detected, examined the nucleotide se- 
quence, iCad performed the calculation as described by Velculesctt and coworkers 
(35) was written. In practice, we found that for 21 of 128 (16%) genes examined 
viable mRNA levels from SAGE data could not be calculated. This was because 
(i) no CATG site was found in the open reading frame (ORF), (>i) a CATG site 
was found but the corresponding 10-bp putative SAGE tag was not found in the 
frequency tables, or (ui) identical putative SAGE tags were present for multiple 
genes (e.g., TDH2_ YEAST and TPH3 YEAST). 
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TABLE 1. Expressed genes identified from 2D gei in Fig. 2 TABLE I — Continued 



Mol wt 


Pi 


Spot no. 


Vpn „,.,,„ 
TrlJ gCIIC 


Protein 
abundance 

\L*J COpJCli/ 

cell) 


uiRNA 
abundance 
(cupics/ccil) 


Cod on 

was 


17,259 


6.75 


133 


CPR1 


15.2 


61.7 


0.769 


18,702 


4.80 


83 


EGD2 


20.1 


5.2 


0.724 


18.726 


4.44 


147 


YKL056C 


61.2 


88.4 


0.831 


18,978 


5.95 


135 


YER067W 


3.7 


6.7 


0.118 


19,108 


5.04 


130 


YLR1U9W 


94.4 


9.7 


0.680 


19,681 


9.08 


136 


ATP7 


11.0 


NA**-" 


0.246 


20,505 


6.07 


111 


GUK1 


16.5 


3.7 


0.422 


21,444 


5.25 


148 


SARI 


5.4 


10.4 


0.455 


21,583 


4.98 


95 


TSA1 


110.6 


40.1 


0.S45 


22,602 


4.30 


80 


EFB1 


66.1 


23.8 


0.875 


23,079 


6.29 


112 


SOD2 


12.6 


2.2 


0.351 


23,743 


5.44 


137 


HSP26 


NA J 


0.7 


0.434 


24,033 


5.97 


96 


ADK1 


17.4 


16.4 


0.656 


24,058 


4.43 


143 


YKLU7W 


29.2 


10.4 


0.339 


24.353 


6.30 


140 


TFS1 


8.1 


0.7 


0.146 


24,662 


5.85 


99 


URA5 


25.4 


6.0 


0.359 


24.808 


6.33 


97 


GSPI 


26.3 


5.2 


0.735 


24,908 


8.73 


122 


RPS5 


18.6 


NA C 


0.899 


25,081 


4.65 


81 


MRP8 


9.3 


NA C 


0.241 


25.960 


6.06 


116 


RPE1 


5.8 


0.7 


0.372 


26,378 


9.55 


127 


RPS3 


96.8 


NA ,: 


0.863 


26.467 ' 


5.18 


100 


VMA4 


10.5 


3.7 


0.427 


26,661 


5.S4 


98 


TP11 


NA rf 


NA f; 


0.900 


27,156 


5.56 


93 


PRE8 


6.9 


0.7 


0.129 


27,334 


6.13 


115 


YHR049W 


18.4 


2.2 


0.520 


27,472 


5.33 


92 


YNLOIOW 


31.6 


3.7 


0.421 


27,480 


8.95 


123 


GPM1 


10.0 


169.4 


0.902 


27,480 


8.95 


124 


GPM1 


231.4 


169.4 


0.902 


27,480 


8.95 


125 


GPM1 


7.5 


169.4 


0.902 


27,809 


5.97 


139 


HOR2 


5.7 


0.7 


0.381 


27,874 


4.46 


78 


YST1 


13.6 


52.8 


0.805 


28.595 


4.51 


41 


PUP2 


4.4 


0.7 


0.147 


29,156 


6.59 


114 


YMR226C 


14.5 


2.2 


0.283 


29.244 


8.40 


120 


DPMI 


5.0 


11.2 


0.362 


29,443 


5.91 


48 


PRE4 


3.4 


3.7 


0.162 


30.012 


6.39 


138 


PRB1 


21.2 


1.5 


0.449 


30,073 


4.63 


77 


BMH1 ■ 


14,7 


28.2 


0.454 


30,296 


7.94 


121 


OMP2 


67.4 


41.6 


0.499 


30.435 


6.34 


89 


GPP'l 


70.2 


11.2 


0.703 


31,332 


5.57 


88 


TLV6 


13.9 


3.0 


0.402 


32.159 


5.46 


113 


IPP1 


63.1 


3.7 


0.752 


32,263 


6.00 


149 


rasi 


22.4 


4.5 


0.232 


33,311 


5.35 


84 


SPE3 


15.1 


6.7 


0.468 


34,465 


5.60 


129 


ADE1 


8.7 


5.2 


0.305 


34,762 


5.32 


85 


SEC 14 


10.9 


6.0 


0.373 


34,797 


5.85 


42 


URA1 


49.5 


8.9 


0.237 


34,799 


6.04 


90 


BEL1 


103.2 


S1.0 


0.S75 


35 5 556 


5.97 


43 


YDL124W 


6.4 


4.5 


0.206 


35,619 


8.41 


59 


TDH1 


69.8 


32.7 e 


0.940 


35,650 


5.49 


68 


CARl 


5.2 


3.0 


0.339 


35.712 


6.72 


117 


TDH2 


49.6 


473.0 C 


0.982 


35,712 


6.72 


154 


TDH2 


863.5 


473.0* 


0.982 


35.712 


6.72 


155 


TDH2 


79.4 


473.0' : 


0.982 


36>72 


4.85 


128 


APA1 


8.7 


0.7 


0.425 


36,358 


5.05 


75 


YJR105W 


17.6 


17.1 


0.522 


36,358 


5.05 


76 


YJR105W 


27.5 


- I7.I 


0.522 


36,596 


6.37 


79 


ADH2 


58.9 


260.0 0 


0.711 


36.714 


6.30 


102 


ADHl 


746.1 


260.0 


0.913 


36,714 


6.30 


. 103 


ADHI 


17.6 


260.0 


0.913 


36,714 


6.30 


104 


ADH1 


61.4 


260.0 


0.913 


36,714 


6.30 


105 


ADHl 


52.7 


260.0 


0.913 


37,033 


6.23 


44 


TALI 


44.8 


3.7 


0.701 


37.796 


7.36 


57 


IDH2 


29.4 


6.7 


0.330 


37,886 


6.49 


106 


ILV5 


76.0 


4.5 


0.892 


38.700 


7.83 


55 


BAT1 


30.9 


11.2 


0.469 


38,702 


6.24 


46 


OCR2 


NA' ; 


2.2 


0.326 



Mol wt. 




Spf)t TU>. 


YPD gene 
nam a" 


Protein 
abundance 
(Jlr copies/ 
rcll) 


mRNA 
abundance 
(copies/cell) 


Codun 
bias 


39,477 


5.58 


86 


FBA1 


1 7.8 


183.6 


0.935 


39,477 


5.58 


87 


FBA1 


427.2 


183.6 


0.935 


39,540 


6.50 


150 


HOM2 


60.3 


4.5 


0.592 


39,561 


6.12 


156 


PSA1 


96.4 


27.5 


0.718 


41,158 


6.01 


49 


YNLI34C 


14.9 


1.5 


0316 


41,623 


7.18 


58 


BAT2 


19.0 


8.9 


0,250 


41,728 


7.29 


110 


ERG10 


24.1 


4.5 


0.543 


41,900 


5.42 


74 


TOM40 


22.3 


2.2 


0.375 


42,402 


6.29 


45 


CYS3 


6.7 


8.9 


0.621 


42,883 


5.63 


67 


DYS1 


15.8 


5.2 


0.526 


43,409 


6.31 


107 


SER1 


10,5 


1.5 


0.292 


43,421 


5.59 


91 


ERG6 


2.2 


14.1 


0.408 


44,174 


7.32 


56 


YBR025C 


13.1 


6.0 


0.684 


44,682 


4.99 


72 


TIF1 


2.9 


39.4 


0.S34 


44,707 


7.77 


108 


PGK1 


23.7 


165.7 


0.897 


44,707 


7 77 


109 - 


PGK1 


315.2 


165.7 


0.897 


46,080 


6.72 


30 


CAR2 


15.4 


NA' : 


0.495 


46,383 


8.52 


53 


IDP1 


7.7 


0.7 


0.436 


46,553 


5.98 


47 


IDP2 


32.4 


NA C 


0.197 


46,679 


6.39 


50 


ENOl 


35.4 


0.7 


0.930 


46,679. 


6.39 


51 


KNOT 


6.6 


0.7 


0.930 


46,679 


6.39 


52 


ENOl 


2.2 


0.7 


0,930 


46,773 


5.82 


63 


EN02 


15.5 


289.1 


0.960 


46,773 


5.82 


64 


EN02 


635.5 


289.1 


0.960 


46,773 


5.82 


65 


ENG2 


93.0 


289.1 


0.960 


46,773 


5.82 


66 


EN02 


3t.fl 


289.1 


0.960 


47,402 


6.09 


126 


CORl 


2.5 


0.7 


0.422 


47,666 


8.98 


54 


AAT2 


11.7 


6.0 


0.338 


48,364 


5.25 


73 


WTM1 


74.5 


13.4 


0.365 


48*530 


6.20 


61 


MET! 7 


38.1 


29.0 


0.576 


48^904 


5.18 


69 


LY r S9 


16.2 


3.7 


0.463 


48,987 


4.90 


153 


SI.JP45 


29.6 


11.9 


0.377 


49*727 


5.47 


70 


PR02 


1.3.6 


5.2 


0.297 


49,912 


9.27 


62 


TEF2 


558.5 


282.0 


0.932 


50,444 


5.67 


35 


YDR190C 


4.8 


22 


0.228 


50*837 


6.11 


32 


YEL047C 


3.S 


1.5 


0.387 


50^891 


4.59 


151 


TUB2 


11.2 


7.4 


0.404 


51,547 


6.80 


27 


LPD1 


18.9 


2.2 


0.351 


52,216 


7.25 


29 


SUM 2 


19.7 


7.4 


0.722 


52,859 


5.54 


37 


YFR044C 


30.2 


6.7 


0.442 


53,798 


5.19 


71 


HXK2 


26.5 


7.4 


0.756 


53,803 


6.05 


145 


GYP6 


4.4 


0:7 


0.147 


54,403 


5.29 


39 


ALD6 


37.7 


2.2 


0.664 


54,403 


5.29 


40 


ATJD6 


6.6 


2.2 


0.664 


54,502 


6.20 


31 


ADE13 


6,3 


1.5 


0.417 


54,543 


7.75 


25 


PYK1 


225.3 


101.8 


0.965 


54,543 


7.75 


26 


PYK1 


39.8 


101.8 


0.965 


55,221 


6.66 


146 


YEL071W 


16.3 


3.0 


0.244 


55,295 


4.35 


134 


PDTi 


66/2 


14.1 


0.589 


55,364 


5.98 


24 


GEK1 


22.6 


6.0 


0.237 


55,48 1 


7.97 


118 


ATP I 


21.6 


2.2 


0.637 


55,886 


6.47 


28 


CYS4 


22.2 


NA C . 


0.444 


56,167 


5.83 


33 


AROS 


14.3 


3.0 


0.324 


56^167 


5 : 83 


34 


AR08 


9.1 


3.0 


0324 


56,584 


6.36 


20 


CYB2 


18.9 


NA C 


0.259 


57,366 


5.53 


60 


FRS2 


2.3 


0.7 


0.451 


57,383 


5.98 


144 


ZWF1 


5.6 


0.7 


0.215 


57,464 


5.4y 


36 


THR4 


21.4 


3.7 


0.508' 


57,512 


5.50 


7 


SRV2 


6.5 


NA*"' 


0.260 


57,727 


4.92 


152 


VMA2 


33.7 


8.9 


0.546 


58,573 


6.47 


17 


ACH1 


4.4 


1.5 


0.327 


58,573 


6.47 


IS 


ACT1 


5.4 


1.5 


0327 


61,353 


5.87 


21 


PDC1 


6.5 


200.7 


0.962 


61,353 


5.87 


22 


PDC1 


303.2 


200.7 


0.962 


6J T 353 


5.87 


23 


PDCI 


16.3 


200.7 


0.962 


6i,649 


5.54 


'38 


CCTS 


2.2 


1.5 


0J27I 
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Mol wt 




Spot no. 


YPD gene 
n:ime a 


X I U LCI 11 

abundance 
(10 3 copies/ 
cell) 


uiRNA. 
abu ndar.ee 
(copies/ceil) 


Cod on 
bias 


fi 1,902 


6.21 


101 


PDC5 


4.2 


NA C 


0.828 


62,266 


6.19 


16 


TCL1 


20.1 


NA"" 


0.327 


62,862 


S.02 


19 


1LV3 


5.3 


4.5 


0.548 


63,082 


6.40 


119 


PGM2 


2.2 


3.0 


0.402 


64,335 


5.77 


5 


PAB1 


30.4 


1.5 


0.616 


66,120 


5.42 


8 


stu 


6.7 


0.7 


0.313 


66,120 


5.42 


9 


STU 


6.4 


0.7 


0.313 


66,450 


5.29 


141 


SSB2 


7.1! 


NA< 


0.880 


66,450 


5.29 


142 


SSB2 


2.3 


NA' 


0.8S0 


66,456 


5.23 


10 


SSB1 


64.5 


79.5 


0.907 


66,456 


5.23 


11 


SSB1 


59.0 


79.5 


0.907 


66,456 


5.23 


12 


SSBI 


13.7 


79.5 


0.907 


68,397 


5.82 


82 


LEU4 


3.1 


3.0 


0.407 


69,313 


4.90 


13 


SSA2 


24.3 


18.6 


0.892 


69,313 


4.90 


14 


SSA2 


77.1 


18.6 


0.892 


74,378 


8.46 


15 


YKL029C 


2.8 


3.7 


0.353 


75,396 


5.82 


6 


GRS1 


5.5 


7.4 


0.500 


85,720 


6.25 


1 


MET6 


2.0 


NA C 


0.772 


85,720 


6.25 


2 


MET6 


10.9 


NA C 


0.772 


85,720 


6.25 


3 


MET6 


1.4 


NA 1 


0.772 


93,276 


6.11 


131 


EFT1 


17.9 


41.6 


0.890 


93,276 


6.11 


132 


EFT1 


5.7 


41.6 


0.890 


102,064* 


6.6r 


94 


ADE3 


4.8 


5.2 


0.423 


:i07,482 e 


5.33 c 


4 


MCM3 


2.7 


NA C 


0.240 



• YPD gene names arc available from the YPD website (3 l J). 
6 NA, calculation could not be performed or was not available. 
f mRNA daia inconclusive or NA. 

J No methionines in predicted OKF; therefore, protein concentration was not 
determined. 

" Measured molecular weight or pi did not mulch theoretical molecular weight 
cr pi. 



Protein quantitation. [^SJmeibionine- labeled gels were exposed to X-ray film 
overnight, and then the silver stain and ftim were used to excise 156 spots of 
varying intensities, molecular weights, and pis. The excised spots were placed in 
0.6-uil luicrocenuifnge lubes, and sciatilhmori cocktail (100 jj,:) was added. The 
samples were vortexed and counted. In addition, two para Net geis were elect; o- 
blolted to polyvinylideue djfliioride membranes. The membranes were exposed 
to X-ray film, and four inUiase single spots were excised from each membrane 
and subjected to amino acid analysis. For these four spots, a mean of 209 ± 4 
cpm/pmol of protein/me thionine was found. This number was used to quaniitate 
all remaining spots in conjunction with the number of methionines present in the 
protein. 

To ensure that proteins were labeled to equilibrium, parallel 2D gels weie 
prepared and run on yeas! mctabolicatly labeled for I, 2, 6, or 18 h. The 
corresponding 156 spots were excised from each gel, aad radioactivity was mea- 
sured by liquid scintillation counting for each spot. Calculated protein levels were 
highly reproducible for all time points measured after 1 h. 

Calculation of codou bias and predicted batf-life. Codoa bias values were 
extracted from the YPD spreadsheet (17). Protein biilf-lives were calculated 
based ou the N-eud rule (33). When the N-terminal processing was no; known 
experimentally, it was predicted based on the affinity of methionine aminopep- 
tidase (31). 

RESULTS 

Characteristics of proteonie approach. Nearly every facet of 
protcome analysis hinges on the unambiguous identification of 
large numbers of expressed proteins in cells. Several tech- 
niques have been described previously for the identification of 
proteins separated by 2DE, including N-terminal and internal 
sequencing (1, 2), amino acid analysis (38), and more recently 
mass spectrometry (25). We utilized techniques based on mass 
spectrometry because they afford the highest levels of sensitiv- 
ity and provide unambiguous identification. The specific pro- 
cedure used is schematically illustrated in Fig. 1 and is based 
on three principles. First, proteins are removed from the gel by 



proteolytic in-gel digestion, and the resulting peptides are sep- 
arated by on-line capillary high-performance liquid chromatog- 
raphy. Second, the during peptides are ionized and detected, and 
the specific peptide ions are selected and fragmented by the 
mass spectrometer. To achieve this, the mass spectrometer 
switches between the MS mode (for peptide mass identifica- 
tion) and the MS/MS mode (for peptide characterization and 
sequencing). Selected peptides are fragmented by a process 
called collision-induced dissociation (CYD) to generate a tan- 
dem mass spectrum (MS/MS spectrum) that contains the pep- 
tide sequence information. Third, individual CID mass spectra 
are then compared by computer algorithms to predicted spec- 
tra from a sequence database. This results in the identification 
of the peptide and, by association, the protein(s) in the spot. 
Unambiguous protein identification is attained in a single anal- 
ysis by the detection of multiple peptides derived from the 
same protein. 

Protein identification > Yeast total cell protein lysate (40 u.g), 
metabolically labeled with [ 35 S]methionine, was electro- 
phoretically separated by isoelectric focusing in the first dimen- 
sion and by SDS-10% polyacrylamide gel electrophoresis in 
the second dimension. Proteins were visualized by silver stain- 
ing and by autoradiography. Of the more than 1.000 proteins 
visible by silver staining, 156 spots were excised from the gel 
and subjected to in-gel tryptic digestion, and the resulting 
peptides were analyzed and identified by microspray LC- 
MS/MS techniques as described above. The proteins in this 
study were all identified automatically by computer software 
with no human interpretation of mass spectra. They are indi- 
cated in Fig. 2 and detailed in Table 1. 

The CID spectra shown in Fig. 3 indicate that the quality of 
the identification data generated was sui table for unambiguous 
protein identification. The spectra represent the amino acid 
sequences of tryptic peptides NSGDIVNLGSIAGR (Fig. 3A) 
and FA VG AFTDSLR (Fig. 3B). Both peptides were derived 
from protein S57593 (hypothetical protein YMR226C), which 
migrated to spot 114 (molecular weight, 29,156; pf, 6.59) in the 
2D gel in Fig. 2. Five other peptides from the same analysis 
were also computer matched to the same protein sequence. 

Protein and niRNA quantitation. For the 156 genes investi- 
gated, the protein expression levels ranged from 2,200 (PGM2) 
to 863,000 (TDH2/JDH3) copies/cell. The levels of mRNA for 
each of the genes identified were calculated from SAGE fre- 
quency tables (35). These tables contain the mRNA levels for 
4,665 genes in yeast strain YPH499 grown to mid-log phase in 
YPD medium on glucose as a carbon source. In some in- 
stances, the mRNA levels could not be calculated for reasons 
stated in Materials and Methods. For the proteias analyzed in 
this study, mean transcript levels varied from 0.7 to 473 copies/ 
cell. 

Selection of the sample population for mRNA-protein ex- 
pression level correlation. The protein spots selected for iden- 
tification were selected from spots visible by silver staining in 
the 2D geJ. An attempt was made not to include spots where 
overlap with other spots was readily apparent. The number of 
proteins identified was 156 (Table 1). Some proteins migrated 
to more than one spot (presumably due to differential protein 
processing or modifications), aiid protein levels from these 
spots were calculated by integrating the intensities of the dif- 
ferent spots. The 156 protein spots analyzed represented the 
products of 128 different genes. Genes were excluded from the 
correlation analysis only if part of the data set was missing; i.e., 
genes were excluded if (i) no mRNA expression data were 
available for the protein or putative SAGE tags were ambig- 
uous, (ii) the amino acid sequence did not contain methionine, 
(iii) more than a single protein was conclusively identified as 
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FIG. X Tandem mass (MS/MS) spectra resulting from analysis of a single spot on a 2D gel. The first quadrupole selected a single mass-to- charge ratio {mfz) of 687.2 
(A) or 592.6 (B), while the collision cell was filled with argon gas, and a voltage which caused the peptide to undergo fragmentation by C1D was applied. The third 
quadrupole scanned the mass range from 50 to 1,400 mfz. The computer program Sequest (8) was utilized to match MS/MS spectra to amino acid sequence by database 
searching. Both spectra matched peptides from the same protein, S57593 (yeast hypothetical protein YMR226C). Five other peptides from the same analysis were 
matched to the same protein. 



migrating to the same gel spot, or (iv) the theoretical and 
observed pis and molecular weights could not be reconciled. 
After these criteria were applied, the number of genes used in 
the correlation analysis was 106. 



Codon bias and predicted halMives. Codon bias is thought 
to be an indicator of protein expression, with highly expressed 
proteins having large codon bias values. The codon bias distri- 
bution for the entire set of more than 6,000 predicted yeast 
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gene ORFs is presented in Fig. AA. The interval with the 
largest frequency of genes is between the codon bias values of 
0.0 and 0.1. This segment contains more than 2,500 genes. The 
distribution of the codon bias values of the 128 different genes 
found in this study (all protein spots from Fig. 2) is shown in 
Fig. 4B, and protein half-lives (predicted from applying the 
N-end rule [33] to the experimentally determined or predicted 
protein N termini) are shown in Fig. 4C. No genes were iden- 
tified with codon bias values less than 0.1 even though thou- 
sands of genes exist in this category. In addition, nearly all of 
the proteins identified had long predicted half-lives (greater 
than 30 h). 

Correlation of mRNA and protein expression levels. The 
correlation between mRNA and protein levels of the genes 
selected as described above is shown in Fig. 5. For the entire 
group (106 genes) for which a complete data set was gener- 
ated, there was a general trend of increased protein levels 
resulting from increased mRNA levels. The Pearson product 
moment correlation coefficient for the whole data set (106 
genes) was 0.935. This number is highly biased by a small 
number of genes with very large protein and message levels. A 
more representative subset of the data is shown in the inset of 
Fig. 5. It shows genes for which the message level was below 10 
copies/cell and includes 69% (73 of 106 genes) of the data used 
in the study. The Pearson product moment correlation coeffi- 
cient for this data set was only 0.356. We also found that levels 
of protein expression coded for by mRNA with comparable 
abundance varied by as much as 30-fold. and that the mRNA 
levels coding for proteins with comparable expression levels 
varied by as much as 20-fold. 

The distortion of the correlation value induced by the un- 
even distribution of the data points along the x axis is further 
demonstrated by the analysis in Fig. 6. The 106 samples in- 
cluded in the study were ranked by protein abundance, and the 
Pearson product moment correlation coefficient was repeat- 
edly calculated after including progressively more, and higher- 
abundance, proteins in each calculation. The correlation values 
remained relatively stable in the range of 0.1 to 0,4 if the 
lowest-expressed 40 to 95 proteins used in this study were 
included. However, the correlation value steadily climbed by 
the inclusion of each of the 11 very highly expressed proteins. 

Correlation of protein and mRNA expression levels with 
codon bias. Codon bias is the propensity for a gene to utilize 
the same codon to encode an amino acid even though other 
codons would insert the identical amino acid in the growing 
polypeptide sequence. It is further thought that highly ex- 
pressed proteins have large codon biases (3). To assess the 
value of codon bias for predicting mRNA and protein levels in 
exponentially growing yeast cells, we plotted the two experi- 
mental sets of data versus the codon bias (Fig. 7). The distri- 
bution patterns for both mRNA and protein levels with respect 
to codon bias were lughiy similar. There was high variability in 
the data within the codon bias range of 0.8 to 1.0. Although a 
large codon bias generally resulted in higher protein and mes- 
sage expression levels, codon bias did not appear to be predic- 
tive of either protein levels or mRNA levels in the cell. 

DISCUSSION 

The desired end point for the description of a biological 
system is not the analysis of mRNA transcript levels alone but 
also the accurate measurement of protein expression levels and 
their respective activities. Quantitative analysis of global 
mRNA levels currently is a preferred method for the analysis 
of the state of cells and tissues (11). Several methods which 
either provide absolute mRNA abundance (34, 35) or relative 
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FIG. 4. Current p;oteowe analysis technology utilizing 2DE without pre en- 
richment samples mainly highly expressed and long-lived proteins. Genes encod- 
ing highly expressed proteins generally have large codon bias values. (A) Distri- 
bution of the yeast genome (more than 6,000 geaes) based on codon bias. The 
interval with the largest frequency of geaes is 0.0 to 0.1, with more than 2,500 
genes. (B) Distribution of the genes from identified proteins in this stud)* based 
on codon bias. No genes with codon bias values less than 0.1 were detected in this 
study. (C) Distribution of identified proteins in this study based on predicted 
half-life (estimated by N-cnd rule). 



mRNA levels in comparative analyses (20, 27) have been de- 
scribed elsewhere. The techniques are fast and exquisitely sen- 
sitive and can provide mRNA abundance tor potentially any 
expressed gene. Measured mRNA levels are often implicitly or 
explicitly extrapolated to indicate the levels of activity of the 
corresponding protein in the cell. Quantitative analysis of pro- 
tein expression levels (proteome analysis) is much more time- 
consuming because proteins are analyzed sequentially one by 
one and is not general because analyses are limited to the 
relatively highly expressed proteins. Proteome analysis does, 
however, provide types of data that are of critical importance 
for the description of the state of a biological system and that 
are not readily apparent from the sequence and (he level of 
expression of the mRNA transcript. This study attempts to 
examine the relationship between mRNA and protein expres- 
sion levels for a large number of expressed genes in cells 
representing the same state. 

Limits in the sensitivity of current protein analysis technol- 
ogy precluded a completely random sampling of yeast proteins. 
We therefore based the study on those proteins visible by silver 
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FIG. 5. Correlation between protein and mRNA levels for 106 genes in yeast growing at log phase with glucose as a carbon source. mRNA and protein levels were 
calculated as described in Materials and Methods. The data represent a population of genes with protein expression levels visible by silver staining on a 2D get chosen 
to include the entire range of molecular weights, isoelectric focusing points, and staining intensities. The inset shows the low-end portion of the main figure. It contains 
69% of the original data set. The Pearson product moment correlation for the entire data set was 0.935. The correlation for the inset containing 73 proteins (69%) was 
only 0.356. 



staining on a 2D gel. Of the more than 1,000 visible spots, 156 
were chosen to include the entire range of molecular weights, 
isoelectric focusing points, and staining intensities displayed on 
the 2D protein pattern. The genes identified in this study 
shared a number of properties. First, all of the proteins in this 
study had a cod on bias of greater than 0.1 and 93% were 
greater than 0.2 (Fig. 4B). Second, with few exceptions, the 
proteins in this study had long predicted half-lives according to 
the N-end rule (Fig. 4Q. Tliird, low-abundance proteins with 
regulatory functions such as transcription factors or protein 
kinases were not identified. 

Because the population of proteins used in this study ap- 
pears to be fairly homogeneous with respect to predicted half- 
life and codon bias, it might be expected that the correlation of 
the mRNA and protein expression levels would be stronger for 
this population than for a random sample of yeast proteins. We 
tested this assumption by evaluating the correlation value if 
different subsets of the available data were included in the 
calculation. The 106 proteins were ranked from lowest to high- 
est protein expression level, and the trend in the correlation 
value was evaluated by progressively including more of the 
higher-abundance proteins in the calculation (Fig. 6). The cor- 
relation value when only the lower-abundance 40 to 93 pro- 
teins were examined was consistently between 0.1 and 0.4. If 
the 11 most abundant proteins were included, the correlation 
steadily increased to 0.94. We therefore expect that the corre- 
lation for all yeast proteins or for a random selection would be 
less than 0.4. The observed level of correlation between 
mRNA and protein expression levels suggests the importance 



of posttranslational mechanisms controlling gene expression. 
Such mechanisms include transiational control (15) and con- 
trol of protein half-life (33). Since these mechanisms are also 
active in higher eukaryotic cells, we speculate that there is no 
predictive correlation between steady-state levels of mRNA 
and those of protein in mammalian cells. 

Like other large-scale analyses, the present study has several 
potential sources of error related to the methods used to de- 
termine mRNA and protein expression levels. The mRNA 
levels were calculated from, frequency tables of SAGE data. 
This method is highly quantitative because it is based on actual 
sequencing of unique tags from each gene, and the number of 
times that a tag is represented is proportional to the number of 
mRNA molecules for a specific gene. This method has some 
limitations including the following: (i) the magnitude of the 
error in the measurement of mRNA levels is inversely propor- 
tional to the mRNA levels, (h) SAGE tags from highly similar 
genes may not be distinguished and therefore are summed, (iii) 
some SAGE tags are from sequences in the 3' untranslated 
region of the transcript, (iv) incomplete cleavage at the SAGE 
tag site by the restriction enzyme can result in two tags repre- 
senting one mRNA, and (v) some transcripts actually do not 
generate a SAGE tag (34, 35). 

For the SAGE method, the error associated with a value 
increases with a decreasing number of transcripts per cell The 
conclusions drawn from tliis study are dependent on the qual- 
ity of the mRNA levels from previously published data (35). 
Since more than 65% of the mRNA levels included in this 
study were calculated to 10 copies/cell or less (40% were less 
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than 4 copies/cell), the error associated with these values may 
be quite large. The niRNA levels were calculated from more 
than 20,000 transcripts. Assuming that the estimate of 15,000 
mRNA molecules per ceil is correct (16), this would mean that 
mRNA. transcripts present at only a single copy per cell would 
be detected 12% of the time (35). The mRNA levels for each 
gene were carefully scrutinized, and only mRNA levels for 
which a high degree of confidence existed were included in the 
correlation value. 

Protein abundance was determined by metabolic radiolabel- 
ing with [ 35 S]methionine. The calculation required knowledge 
of three variables: the number of methionines in the mature 
protein, the radioactivity contained in the protein, and the 
specific activity of the radiolabel normalized per methionine. 
The number of methionines per protein was determined from 
the amino acid sequence of the proteins identified by tandem 
mass spectrometry. For some proteins, it was not known 
whether the methionine of the nascent polypeptide was pro- 
cessed away. The N termini of those proteins were predicted 
based oh the specificity of methionine aminopeptidase (31). If 
the N-terminal processing did not conform to the predicted 
specificity of processing enzymes, the calculation of the num- 
ber of methionines would be affected. This discrepancy would 
affect most the quantitation of a protein with a very low num- 
ber of methionines. The average number of calculated methi- 
onines per protein in this study was 7.2. We therefore expect 
the potential for erroneous protein quantitation due to un- 
usual N-terminal processing to be small. 



The amount of radioactivity contained in a single spot rnight 
be the sum of the radioactivity of comigrating proteins. Be- 
cause protein identification was based on tandem mass spcc- 
trometric techniques, comigrating proteins could be identified 
However, comigrating proteins were rarely detected in this 
study, most likely because relatively small amounts of total 
protein (40 u-g) were initially loaded onto the gels, which re- 
sulted in highly focused spots containing generally 1 to 25 ng of 
protein. Because of the relatively small amount loaded, the 
concentrations of any potentially comigrating protein would 
likely be below the limit of detection of the mass spectrometry 
technique used in this study (1 to 5 ng) and below the limit of 
visualization by silver staining (1 to 5 ng). In the overwhelming 
majority of the samples analyzed, numerous peptides from a 
single protein were detected. It is assumed that any comigrat- 
ing proteins were at levels too low to be detected and that their 
iniluence in the calculation would be small. 

The specific activity of the radiolabel was determined by 
relating the precise amount of protein present in selected spots 
of a parallel gel, as determined by quantitative amino acid 
composition analysis, to the number of methionines present in 
the sequence of those proteins and the radioactivity deter- 
mined by liquid scintillation counting. It is possible that the 
resulting number might be influenced by unavoidable losses 
inherent in the amino acid analysis procedure applied. Because 
four different proteins were utilized in the calculation and the 
experiment was done in duplicate, the specific activity calcu- 
lated is thought to be highly accurate. Indeed, the specific 
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FIG. 7. Relationship between codon bias and protein and mRNA levels in this study. Yeast mRNA and protein expression levels were calculated as described in 
Materials and Methods. The data represent the same 106 genes as in Fig: 5. 



activities calculated for each of the four proteins varied by less 
than 10%. Any inconsistencies in ihe calculation of the specific 
activity would result in differences in the absolute levels calcu- 
lated but not in the relative numbers and would therefore not 
influence the correlation value determined. 

The protein quantitative method used eliminates a number 
of potential errors inherent in previous methods for the quan- 
titation of proteins separated by 2DE, such as preferential 
protein staining and bias caused by inequalities in the number 
of radiolabeled residues per protein. Any 2D gel-based method 
of quantitation is complicated by the fact that in some cases the 
translation products of the same mRNA migrated to different 
spots. One major reason is posttraaslational modification or 
processing of the protein. Also, artifactual proteolysis during 
cell lysis and sample preparation can lead to multiple resolved 
forms of the protein. In such cases, the protein levels of spots 
coded for by the same mRNA were pooled.. In addition, the 
existence of other spots coded for by the same mRNA that 
were not analyzed by mass spectrometry or that were below the 
limit of detection for silver staining cannot be ruled out. How- 
ever, since this study is based on a class of highly expressed 
proteins, the presence of undetected minor spots below silver 
staining sensitivity corresponding to a protein analyzed in the 
study would generally cause a relatively small error in protein 
quantitation. 

Codon bias is a measure of the propensity of an organism to 
selectively utilize certain codons which result in the incorpo- 
ration of the same amino acid residue in a growing polypeptide 
chain There are 61 possible codons that code for 20 amino 
acids. The larger the codon bias value, the smaller the number 
of codons that are used to encode the protein (19). It is 



thought that codon bias is a measure of protein abundance 
because highly expressed proteins generally have large codon 
bias values (3, 13). 

Nearly all of the most highly expressed proteins had codon 
bias values of greater than 0.8. However, we detected a number 
of genes with high codon bias and relative low protein abun- 
dance (Fig. 7). For example, the expressed gene with both the 
second largest protein and mRNA levels in the study was 
EN02_YEAST (775,000 and 289.1 copies/cell, respectively). 
EN01_YEAST was also present in the gel at much lower 
protein and mRNA levels (44,200 and 0.7 copies/cell, respec- 
tively). The codon bias values for EN02 and ENOl are similar 
(0.96 and 0.93, respectively), but the expression of the two 
genes is differentially regulated. Specifically, EN01_YEAST is 
glucose repressed (6) and was therefore present in low abun- 
dance under the conditions used. Other genes with large codon 
bias values that were not of high protein abundance in the gel 
include EFT1, TIF1, HXK2, GSP1, EGD2, SHM2, and TALI. 
We conclude that merely determining the codon bias of a gene 
is not sufficient to predict its protein expression level. 

Interestingly, codon bias appears to be an excellent indicator 
of the boundaries of current 2D gel proteomc analysis tech- 
nology'. There are thousands of genes with expressed mRNA 
and likely expressed protein with codon bias values less than 
0.1 (Fig. 4A). In this study, we detected none of them, and only 
a very small percentage of the genes detected in this study had 
codon bias values between 0.1 and 0.2 (Fig. 4B). Indeed, in 
every examined yeast proteomc study (5, 7, 13, 28) where the 
combined total number of identified proteins is 300 to 400, this 
same observation is true, it is expected that for the more 
complex cells of higher eukaryotic organisms the detection of 
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low-abundance proteins would be even more challenging than 
for yeast. This indicates that highly abundant, long-lived pro- 
teins are overwhelmingly detected in proteome studies. If pro 
teome analysis is to provide truly meaningful information 
about cellular processes, it must be able to penetrate to the 
level of regulatory proteins, including transcription factors and 
protein kinases. A promising approach is the use of narrow- 
range focusing gels with immobilized pH gradients (IPG) (23). 
This would allow for the loading of significantly more protein 
per pH unit covered and also provide increased resolution of 
proteins with similar electrophoretic mobilities. A standard pH 
gradient in an isoelectric focusing gel covers a 7-pH-unit range 
(pH 3 to 10) over 18 cm. A narrow-range focusing gel might 
expand the range to 0,5 pH units over 18 cm or more. This 
could potentially increase by more than 10-fold the number of 
proteins that can be detected. Clearly, current proteome tech- 
.nology is incapable of analyzing low-abundance regulatory pro- 
teins" without employing an enrichment method for relatively 
low-abundance proteins. In conclusion, this study examined 
the relationship between yeast protein and message levels and 
revealed that transcript levels provide little predictive value 
with respect to the extent of protein expression. 
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Amplification and overexpression of putative oncogenes 
confer growth advantages for. tumor development. We 
used a functional gessomic approach that integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines. Next, we used two-dimensional polyacrylamide gel 
electrophoresis and mass spectrometry to identify 42 
proteins that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDXI> 
EEF1A2, CALR, and KCIP-1 - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r > 0.84, two- 
sided A><0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EEFIA2 and KCIP-1 expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunohistochemical analyses of EEFIA2 and KCIP-1 in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
protein expression fo r hnth gpnps a nd r h^ r protein 
ovcrexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEF1A2 and KCIP-1 and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma. 
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published online 12 December 2005 
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Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage. The ability to identify 
putative oncogenes that are activated during tumorigen- 
esis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER- 2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, HER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et af., 2001; Jiang 
el ai, 2004; Tonon ei aL y 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
and mRNA l evels in cDNA — m i croarrays permit s- 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et <?/., 2002; Pollack et a/., 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of prolo- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate proteins and post- 
transcriptional mechanisms (Chen et a/., 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over- re presented but are not biologically 
relevant in tumor development. 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified Tour putative onco- 
genes that could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell lines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in lumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy , in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify 183 genes with increases in genomic copv 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
ceil lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S). 
Using random permutation tests across all cancer cell 
lines, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over-represented in th e genuiu e- (Table I ), suggesting 
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that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
linking genomic changes with altered transcript patterns 
in breast cancer (Hyman et al, 2002; Pollack et al. 
2002). However, our finding that only 31% of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et al, 2002; Pollack 
et aL 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 

Proteomic analyses identify four genes for which protein 
abundance was associated with increases in the cognate 
gene and transcript levels 

Analysis of transcript, patterns is insufficient for under- 
standing the expression of protein products and the 

eifeci of genomic o\cr-rcprcsciuaEion on the expression 



of their cognate proteins. To extend these findings 
Beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell Jines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9.5%) of those 
42 genes - PRDX1, EEFJA2. CALR % and KCIP-I - 
showed statistically significant correlations between 
elevated protein exr*ressioj\and increases, in both copy 
number and mRNA expression (all r>0.84; / > <0.05) 
(Table 2) in the cancer cell lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression ofPRDXI. EEFIA2, CALR. and KCIP-I 
in lung cancer cell lines 

To confirm our findings from the high-throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell lines. For compar- 
ison, we arbitrarily chose one gene, NFKBI, \n which an 
increase in protein level did- not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure I). For example, KCIP-I showed 
fivefold amplification in five of the six cancer cell lines. 
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whereas NFKBI showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKBI transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDXI as 
compared with normal cells. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer cell lines that 
showed at least fourfold increases in EEFIA2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as well (relative to 
their expression in normal cells) {Supplementary Figure 
3S). The protein abundance of the four genes showing 
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Table I (continued) 
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Table I {continued) 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibition of EEF1A2 and KC/P-J expression by 
siRNAs led to decreased cell proliferation and induction of 
apoptosis 

To further prove the oncogenic function of the identified 
genes in lung tumorigenesis, we used siRNAs to inhibit 
the endogenous expression of EEFIA2 and KCJP-1 
protein in four lung cancer cell lines (Hi 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEFIA2 and 
KCTP-I protein by 70 90% 48 h after transfection 



(Supplementary Figure 4S). In contrast, EEF1A2 and 
KCIP-I protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 1 5-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for. the disease. 



Amplification and protein expression of KCIP-I and 
EEFIA2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemical analysis, in parallel, to commercially 
available human lung tissue microarrays (Ambion, 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDXI or CALR were 
not suitable for use in immunohistochemical analysis 
when this report was written.) Overexpression of KCIP- 
I and EEFIA2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(P-0.0003 for KCJP-1 and /> = 0.001i for EEF1A2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCIP-L and 
strong cytoplasmic staining for KC1P-J protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by analysing Ki-67 
expre ssi on in contiguou s sections of the tissue micro - 
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arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-I (P = 0.02) and 
EEFIA2 (/> = 0.0I). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage I non-small-cell lung cancer who had 
undergone curative surgery (Wang et <?/., 2005). 
Immunohistochemical analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-I in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEF1A2 and KCIP-I expression and clinico- 
pathologic variables. Expression of EEFI A2 or KCIP-I 
was associated with short overall survival time 
(P = 0.00 1 2 for EEFI A2 and /> = 0.0026 for KCIP-I) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Tabic 2 Proteins showing significant ovcrexpression in cancer cell lines relative to those in normal bronchial epithelial cell lines and their 
. correlation coefficients with increased DNA copy number or mRNA values" 
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HCK 
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Hemopoietic cell kinase 
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CDC25B 
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Cell division cycle 25B 
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0.283214 


P50290 
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CDC42 
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Cell division cycle 42 (GTP-binding protein, 25kDa) 


-0.47636 
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RACI 


21.5/6.8 


Ras-related C3 botulinum toxin substrate I 


-0.05583 


-0.03566 


P07437 
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Tubulin, beta polypeptide 
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Cyclin HI 
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Colony stimulating factor 2 (granulocyte-macrophage) 
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Eukaryotic translation initiation factor SA 


-0 97893 
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Fatty acid-binding protein 5 
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a Only the gene showing statistically significant increased protein expression wilh increases in both genomic copy number and transcript 
simultaneously will be considered as potential putative oncogene in lung adenocarcinoma ceils. V, Spearman correlation coefficients between 
proteins ;ind genomic or mRNA values are based on all six cancer cell lines; bold indicates /><0.05, if r> 0.84000. Mw, molecular weight; p/, 
isoelectric point. 



immunohistochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again that genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance of proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67. a known indicator of poor 
prognosis in lung cancer (Martin et <//., 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
expression of EEFIA2 and KC1P-1 proteins in stage I 
non-small-cell lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
from those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes (PRDXh CALR, and KCIP-J) have been im- 
plicated in lung tumorigenesis. PRDXI is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kim et al. (2003) found 
PRDXI expression to be elevated in both lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDXI may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2+ binding and the 
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Figure I Confirmation by Southern. Northern, and Western blot 
analyses ol* increased DN'A copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, we arbitrarily chose one gene, NFKBl, in which an 
increased protein level did not correlate with genetic changes. The 
blotting results are consistent with the results from the CGH array, 
irunscripi array, and proieomie analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated at 
least three times with each cell line. Means of normalized to /J-aclin 
signal intensities on Southern, Northern, and Western blots, along 
with 95% confidence intervals, were calculated (0-actin signals are 
not shown in the figure; two difTcrenl normal bronchial epithelial 
cell lines were used in the confirmation and only one normal cell 
line is .shown in the figure). 



transcriptional regulation of other genes and was 
-recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCIP-i 
belongs to the 1 4-3-3 family, which participates via the 
M APK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et aL, 
2005). KCIP-I was recently found to be expressed in all 
1 2 lung tumors tested in a single-institution study (Qi 
et ul r 2005). Interestingly, EEFIA2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et uL y 2002); functional analyses have estab- 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEF1A2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the 
power of our functional genomic strategy for rapidly 
identifying potential oncogenes. 

Although the main focus of this study was to 
specifically identify putative oncogenes, it should be 



noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
translational, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Chen et aL y 2002). For example, 
NFKBl is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et aL 2002). De- 
regulation of NFKBl is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et aL, 2002). Tn our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript levels, implying 
that at least some of these genes are 'passenger* genes 
that are concurrently amplified because of their 
location with respect to arnplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo microarray we used consists of 
22000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence lags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
using microarrays with information on more genes 
and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
, consequences of gene amplifications. Such expanded 
analyses will very likely lead to the identification of 
additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For examp l e, genomi c copy number and protein l eve ls — 
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of KCIP-I were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et aL, 2004) and 
proteomic analysis (Chen et aL, 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, transcript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression of the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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l J™Zl .^^amplification is associated with high EEFIA2 protein expression in lung adenocarcinomas, (a) Cells from a (una 
actenouMcmoma sample ,n which EEF1A2 is amplified show more green signals (EEFIA2) than red * 



centromeric probe) (original magnification, x 400). (b) Immunohist^heS slatog oT^om'Ihe sa^e^^ 

^ A '-^adenocarcinoma sampled two copies of EEF^JS^^^ 

tciHiomcrii. probe, indical ng no EEFIA2 amplification (original magnification, x 400). (d) Immunohistochcmical stainine of celU 
Iron, the same tissue sample as in panel c shows negative staining for EEFI A2. nonistouicmical staining of cells 



Materials and methods 

Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, H 1792, 
SK-LU-I, H.522, and HI563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 
-HWttal bronchial epi thelia l ce ll lines we r e o btai n ed from 



» — ** VkSlUIIIV-U Ill/Ill 

Clonlech (Palo Alto t CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells/ 

ON A and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization'were performed as 
described previously (Barrett et <//., 2004) with Agilent's 
Human I A Oiigo Microarray (V2) (Agilent Technologies, 
Palo Aito. CA, USA), which, contains 22 000 unique 60-mer 
oligos. Details of the protocol for analysing transcripts are 
available at htlp://www.chen».agileni.com. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene cluster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilcntCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from normal control cell hybridization, as described elsewhere; 
Barrett et aL, 2004) were interpreted as evidence of true 
differences in DNA copy number, the criteria for defining 
genomic over-representation and amplicons are described 
vSsewbere (Hvnmn w ,;/.. 2002):- details are niven in the 



Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > I). 

Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two 7 dimensional PAGE and their 



loentihcation by mass spectrometry were performed as 
previously described (Shen et ai. t 2004). Briefly, protein pellets 
were solubilized in rehydration bufier, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad 's PDQuest 2D gei image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRQ matrix-assisted 
laser desorptipn ionization/time-of-fiight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense monoisotopic peaks for each sample was 
entered in the MS-Fit search program (v3.2.l) (http:// 
prospector.ucsf.edu/ucsfhtmW.O/msfit.htm) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern, Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA. USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDXI, CALR, 
NFKBI, KCIP-I. and //-actin from Santa Cruz Biotechnology 
(Santa Cruz. CA, USA); and EEFI A2 from Upstate Biotech- 
nology (Wultham. MA. USA). 

Fluorescence in situ hybridization and imnumohisfochemical 
analyses of Inn a f issue microarrays 

Fluorescence //; si/ti hybridizations and immunohistochemical 
analyses of KCIP-I and EEFI A2 were carried out as described 
elsewhere (Jiang ct ah. 2002; Wang et aL 2005) with Lung 
Tissue Microarrays (Ambion, Austin, TX, USA) and II 
homemade microarray blocks containing tissue samples from 
113 patients with pathologic stage I non-small-ccll king cancer 
(Wang et aL 2005). DNA probes specific for KCIP-I und 
EEFIA2 were obtained by screening a Human BAC Clone 
library (Invitrogcn) by polymerase chain reaction as described 
previously (Jiang et aL. 2002). The antibodies used for the 
immunohistochemical analyses were the same as those used 
for the Western blotting. Ceil proliferation of the lung tissues 
was assessed with a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transla tion, cellular proliferation assay, and apoptosis 
analysis 

Translations were carried out by using siPORT Lipid 
Transfcction Agent (Ambion) with siRNAs targeting KCIP-I 
or EEFIA2 or with a scrambled siRNA duplex (siControl) 
(Dharmacon Inc.. Lafayette, CO. USA), with PBS used as a 
negative control (Jiang et aL, 2002). Cells were fixed 24, 48, or 
96 h later and subjected to further tests. Ail siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of cell growth by the 

References 



siRNAs was determined by MTT staining, and cell growth 
rate was plotted against the percentage of viable cells in the 
saline-treated controls (a value arbitrarily set at 100%) (Jiang 
et aL, 2002). Apoptosis was analysed by fluorescence cell 
cycle analysis of terminal deoxynucleotidyl transferase- 
mediated dUTP nick-end labeling with FITC-labeled dUTP 
(Boehringer Mannheim Biochemicals, Mannheim, Germany) 
(Jiang et aL: 2005). 

Statistical analyses 

Relationships between gene copy number and mRNA level 
were examined as described elsewhere (Hyman et aL. 2002, 
Supplementary Information). Correlations between protein 
abundance and DNA copy number and mRNA expression of 
the corresponding genes were evaluated with the Spearman 
correlation coefficient. Fisher's exact test and % l - tests were 
used to analyse associations between amplification and 
expression of the candidate genes with various histopathologic 
variables of the samples in the tissue microarrays. Univariate 
and multivariate analyses were carried out with Cox's 
proportional hazards model to determine which independent 
factors might have a joint significant influence on survival. A 
P-value s^O.05 was considered statistically significant; all 
statistical tests were based on a two-sided significance level. 
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